Solved: Dictionary Issue - Base Data Cleanse

Former Member · ‎03-11-2010

Hello Experts,

Using Data Services 12.2 .In my dictionary there are 590 classifications are there(I think this is the maximum limit after that it giving error unable to allocate index ,indexes are over).my source contains 16500 records ,i ran this job on server with 4GB RAM ,for loading 10000 erecords it took 6 hours i forcibly killed the job after that. i tried with run as seperate process but no luck.Please let me know how to tune this datacleanse transform to improve the performance.For testing purpose, i am using query transformation after that will be using BDC transformation then output table .

Please give your inputs?

Will it be possible by splitting dictionary and doing so the performance takes a big hit?

Thanks

Former Member · ‎03-12-2010

Hi Felix,

Could you provide some more information so I can understand the problem better.

1. Are you sure you have 590 "classifications" and not some other dictionary attribute like Primary_Entry?

2. Are you using a shipped cleansing package or using your own custom dictionary? If you are using a custom dictionary, can you provide an idea of the size of the dictionary in terms of number of primary entries? Just a ball park number to get an idea would be sufficient. Would be best if you can export your dictionary and pass it along so I could take a look.

3. What kind of RDBMS is your dictionary repository based?

Based on your response here I may have some follow up questions. In the meantime, can you also provide some description of what you are trying to achieve in your dataflow, is it some Name/Title/Firm processing or custom data processing?

To answer some of your queries, you can't really split the dictionary and looking at the number of records you are handling, it shouldn't take that long a time. There might be something else in the setup that we might need to look into.

Thanks,

Sarthak

Former Member · ‎03-29-2010

As a follow up to anyone else who might be running into the same issue, the cause of the error "Unable to allocate index..." is due to the limit on the number of classifications allowed in a dictionary. Once that limit is hit, any attempt to add new classifications will result in that message.

Also regarding the performance hit in this case, it is related to heavy usage of Pattern based classifications. Each token in the record is matched against every pattern based classification(PBC) causing a performance hit and it is not a recommended best practice to have a huge number of PBC's. PBC's are mainly in place to handle special cases such as parsing $10,000 and similar notations. A better approach would be to leverage the dictionary and use custom classifications to optimize the performance.

Former Member · ‎03-25-2010

Thank you.

I guess we will probably speak to SAP on this.

Dictionary Issue - Base Data Cleanse

Accepted Solutions (1)

Accepted Solutions (1)

Answers (2)

Answers (2)

Iterating through JSONModel with multiple nested a...

Re: Connecting SAP CAP with SAP Cloud ALM

How to use CopyProvider on Table created by TypeSc...

How to execute/fire onPost when user press enter o...

Re: Connecting SAP CAP with SAP Cloud ALM