Skip to Content
0

PDF as Source in Text Data Processing

Jul 18, 2017 at 12:52 PM

43

avatar image

Hi All,

I have read that Text Data Processing supports pdf,word and other binary formats but i dont understand how use the pdf/word as source.

Can anyone explain or guide me a work around.

Thanks,

srinivas

10 |10000 characters needed characters left characters exceeded
* Please Login or Register to Answer, Follow or Comment.

1 Answer

Best Answer
Dirk Venken
Jul 20, 2017 at 07:03 AM
0

In the file format definition set Type to Unstructured Text. The output schema will look like this:

Then use a TDP Entity_Extraction transform to process te contents.


Share
10 |10000 characters needed characters left characters exceeded