Hello,
I have a web repository configured in my SAP portal that is having difficulty indexing one set of documents. The documents that it cannot index are those that are in a CM repository in the portal server. These documents in the portal are referenced with anchor tags like the following:
https://portal.domain.com/irj/servlet/prt/portal/prtroot/com.sap.km.cm.docs/folder/myfile.htm">
What I'm seeing in the crawler log file for these documents is errors like the following:
ERROR Apr 24, 2009 2:14:03 PM /webdynamic/portal.domain.com/irj/servlet/prt/portal/prtroot/com.sap.km.cm.docs/folder/myfile.htm http://portal.domain.com/irj/servlet/prt/portal/prtroot/com.sap.km.cm.docs/folder/myfile.htm processing failed com.sapportals.wcm.repository.AuthorizationRequiredException
I gathered that this means that when the indexing service tries to access these files in the portal, the portal is expecting a username and password to be passed. I have created an HTTP System definition with the following settings:
Description:
Same User Domain: + <unchecked>
Max Connections: + 0
Password: + <password>
Server Aliases: +
Server URL: * http://portal.domain.com/irj/portal
User: + mydomain\myuser
This did not work. I also tried changing the server url setting to:
http://portal.domain.com/irj/servlet/prt/portal/prtroot/com.sap.km.cm.docs
This also did not work.
What do I need to do to get content like this to be indexed?
Thanks!
-StephenS