Skip to Content
author's profile photo Former Member
Former Member

HTML property extractor for web repository

Hi All,

I was just wondering has anyone worked on this ...

We are using EP 2004s and for one of our web server we have created a web repository and now we want to use 'html property extractor" to extract the values of META tag of the html documents.

This will help us in filtering the search result , so we have followed following steps :

1. Created a html property extractor for the meta tags (META all =<meta tag name>).

2. Assigned this extractor to the web repository.

3. Crawled the website .

After this we tried to search for the documents using meta tag values but not able to find any document.

We have even tried to filter the result by adding this custom property in 'custom properties' , but this also didn't work.

Is there anything we are missing or this has to work in some other way ?

Note : We have tried a html extractor with <title> tag and extracted it successfully .

Useful answers will be rewarded points !

Thanks & Regards,

Amit Kade

Add a comment
10|10000 characters needed characters exceeded

Related questions

1 Answer

  • author's profile photo Former Member
    Former Member
    Posted on Jan 10, 2008 at 09:37 AM

    Hi Amit

    I'm working on a solution using Web Property Extractors as well, and would like to know if you managed to find a solution to your problem.

    Kind regards,

    Martin S√łgaard

    Add a comment
    10|10000 characters needed characters exceeded

Before answering

You should only submit an answer when you are proposing a solution to the poster's problem. If you want the poster to clarify the question or provide more information, please leave a comment instead, requesting additional details. When answering, please include specifics, such as step-by-step instructions, context for the solution, and links to useful resources. Also, please make sure that you answer complies with our Rules of Engagement.
You must be Logged in to submit an answer.

Up to 10 attachments (including images) can be used with a maximum of 1.0 MB each and 10.5 MB total.