Skip to Content
author's profile photo Former Member
Former Member

Trex failing to index word docs in FS repository

Hi,

I am having errors while indexing a FS repository.

The errors in TREX monitor are

return message:Content-Length -vs- Actual Read mismatch

return code:8030

Document Status: Preparation Failed

I tried reindexing these failed entries, but did not work out.

All of these are word docs. Moreover Crawler errors for

folders are also visible in the errors list.

Looking into the TREX trace , was able to get the follwing.

HTTP-GET failed for URL <............>with Errorcode -30 , but HTTP-HEAD worked, trying again,

Mimetype application/msword is not based on TEXT, but was detected as text type; content might be corrupted and will be ignored

When I tried opening some of these docs from Windows,

I did not have access to the same itself and got an error. Would this also be the case for the index_service user?

Also not all docs which are visible on the Windows side are visible from the portal in the KM nav iview.

Platform :EP 7.0 SP13, TREX 7.0

Any help would be appreciated

Rgds

Add a comment
10|10000 characters needed characters exceeded

Related questions

3 Answers

  • author's profile photo Former Member
    Former Member
    Posted on Feb 13, 2008 at 06:00 AM

    Hello,

    A problem related to the search for this FS repository is that search takes a very long time to display results.

    The datasource is quite big with nearly 90,000 docs and the index is supposed to index external links too with

    indexContentOfExternalLink, indexContentOfExternalLink properties set also.The search scope is based on Indexes in the search iview .

    When a normal user who has the relevant role for the searh iview runs a search it takes a very long time, nearly 30 min!!

    But if a super admin runs the same , it comes up with results immediately.

    Is this some kind of authorisation issue.The index is having everyone full control and I was not able to see any thing much in the default trace too.

    Is there any particular trace/log file to be checked for this?Has something been missed out in the index creation process?

    Hope someone can comment on this

    Rgds

    Add a comment
    10|10000 characters needed characters exceeded

    • Former Member

      Have a link in the FS repository to an external url. This url opens up a webpage which has links to some documents . Then is the search supposed to pick up the documents or will it pick up content within the document too?

      The properties indexContentOfExternalLink, showWithoutDatasource are set for the index .So

      I am not sure if the search is giving wrong results or I understood the concept of the external links wrong from the help link.

      Appreciate any help on this

      Rgds

  • author's profile photo Former Member
    Former Member
    Posted on May 14, 2008 at 09:51 AM

    Hello,

    I have the same problem in TREX-Monitor in our EP 6:

    return message:Content-Length -vs- Actual Read mismatch

    May it have to do with Supportpackage which we installed in February (SP21 for EP6)?

    Is there already a solution for this problem?

    Thanks or any help!

    Marc

    Add a comment
    10|10000 characters needed characters exceeded

  • author's profile photo Former Member
    Former Member
    Posted on Jul 08, 2008 at 11:10 AM

    Used webrepositories for this

    Add a comment
    10|10000 characters needed characters exceeded

Before answering

You should only submit an answer when you are proposing a solution to the poster's problem. If you want the poster to clarify the question or provide more information, please leave a comment instead, requesting additional details. When answering, please include specifics, such as step-by-step instructions, context for the solution, and links to useful resources. Also, please make sure that you answer complies with our Rules of Engagement.
You must be Logged in to submit an answer.

Up to 10 attachments (including images) can be used with a maximum of 1.0 MB each and 10.5 MB total.