Skip to Content
0
Sep 20, 2005 at 10:31 AM

3rd Party Search Engine integration and crawling

21 Views

I'm experiencing with the integration of the Google Search Web Service into KM index management, following the documentation and the example code by Thilo Brandt.

I can successfully submit a search query and get the results, but as soon as the index is created, a very long series of the following errors start to be logged:

Location : com.sapportals.wcm.service.indexmanagement.XIndexCrawlerReceiver
Message : XIndexing documents failed. Continue crawling... - com.sapportals.wcm.WcmException: Method internalIndexDocuments() not supported on CustomerSearchIndex implementation.
	at it.trilog.km.search.google.ws.GoogleSearchIndex.internalIndexDocuments(GoogleSearchIndex.java:83)
	at com.sapportals.wcm.service.indexmanagement.AbstractIndex.indexDocuments(AbstractIndex.java:555)
	at com.sapportals.wcm.service.indexmanagement.AbstractIndex.indexDocuments(AbstractIndex.java:570)
	at com.sapportals.wcm.service.indexmanagement.XIndexCrawlerReceiver.addIndexResource(XIndexCrawlerReceiver.java:267)
	at com.sapportals.wcm.service.indexmanagement.XIndexCrawlerReceiver.receive(XIndexCrawlerReceiver.java:211)
	at com.sapportals.wcm.service.xcrawler.thread.CrawlerProviderThread.callReceive(CrawlerProviderThread.java:299)
	at com.sapportals.wcm.service.xcrawler.thread.CrawlerProviderThread.provideStd(CrawlerProviderThread.java:271)
	at com.sapportals.wcm.service.xcrawler.thread.CrawlerProviderThread.provide(CrawlerProviderThread.java:200)
	at com.sapportals.wcm.service.xcrawler.thread.CrawlerProviderThread.run(CrawlerProviderThread.java:123)

This ends up with completely locking the server, and I have to restart it. After the reboot, the index continues to work, even if its status is red, and no crawling error is reported. However if I delete the index and recreate it, the problem arises again.

It seems that the system behaves as a crawler was assigned to the index. But this is not the case. Does anyone know what process is going on?

Thanks a lot, Davide