Solved: Reading multiple records from an XML doc (redux)

Former Member · ‎01-23-2017

A little over a year ago, I asked this question:

https://archive.sap.com/discussions/thread/3818944

Essentially, how do I read (or flatten) an XML document that has multiple tags into one or more input streams? I realize that the adapter to stream mechanism doesn't permit one adapter to send input to multiple streams, so I'm at a loss on how the XML based adapters are meant to be used. There's not much information in the docs and no usable sample code.

Say for example, an XML doc that looks like:

<?xml version="1.0" encoding="utf-8"?><br><InputRequest>
    <Protocol>
        <IP/>
        <Port/>
    </Protoco>
    <MessageDetails>
        <Length>0123</Length>
        <header>HEAD123</header>
        <routing></routing>
    </MessageDetails><br>    <ResponseDetails><br>        <Length>1234</Length><br>        <header>HEAD456</header><br>    </ResponseDetails><br>           .<br>           .<br>           .<br>       Many more tags<br></InputRequest>

How would I define an ESP schema to receive all the data? I've talked with the people who are sending this data, and they've agreed to include every tag that might possibly occur, even if it has no data (as shown above). Some child tag names will repeat when they're used in parent tags that have different names (e.g. Length and header above).

In the referenced question above, I was using the XML file adapter and the suggestion was to read the same file multiple times for different schemas. That really wasn't a workable solution. And for this project, I'm going to receive this data via the socket adapter, and it will only be sent once, so I have to get all the data in a single pass.

Please help!

Thanks,

Dan

JWootton · ‎01-26-2017

Why the reluctance to engage SAP customer support? Is the process that burdensome? is it really significantly harder than posting in this forum?

JWootton · ‎01-25-2017

Unless someone jumps in here to offer assistance, I suggest you open a support ticket for this

Former Member · ‎01-24-2017

Hi again,

I ran some tests with the 3rd party's XML and it isn't too hard to flatten it, so maybe I can convince them to do that and avoid the custom adapter route.

I was using the xml file adapter and not the socket adapter so maybe it behaves differently, but I have some questions about how the adapter is behaving. I'm running ESP 5.1 SP12 PL03.

1. Every time I start the adapter, it sends a record that has nulls in all the columns. If this is the same in the socket adapter, it's likely there will be only one record per xml doc, effectively doubling the input data. Is this a bug? Should I report it in an incident?

2. It seems the data can only be specified in attributes. Nested tags don't work. (look at data where ID=4)

3. A properly defined XML tag (I think) is not being sent to ESP (look at data where ID=3). Note the closing bracket following the attributes.

Thanks again for the suggestions and help. It's really helped clear my mind about how to proceed.

ESP Code:

CREATE SCHEMA S_xmlIn (
    ID string,
    Port string,
    Length string,
    header string
);

CREATE INPUT STREAM xmlIn SCHEMA S_xmlIn;

ATTACH INPUT ADAPTER XML_IN TYPE toolkit_file_xmllist_input to xmlIn
PROPERTIES 
	dir = 'C:/data' , 
	file = 'tx1Attr.xml'
;

tx1Attr.xml

<?xml version="1.0" encoding="US-ASCII"?>
<ThisWorks ID="1" Port="333" Length="0439" header="00121" />
<ThisWorks ID="2" Port="444" Length="0439" header="00121" </ThisWorks>
<ThisWorks ID="5" Port="444" header="00121" </ThisWorks>
<ThisDoesntWork ID="3" Port="555" Length="0439" header="00121"></ThisDoesntWork>
<ThisDoesntWork> <ID>4</ID> <Port>666</Port> <Length>0439</Length> <header>00121</header> </ThisDoesntWork>

RobertWaywell · ‎01-23-2017

Hi Dan,

It sounds like you have a decent understanding of the functionality of the 'out-of-the-box' adapters and that they do not fit the requirements for the application that you are currently working on. In that case, the recommended approach is to build a custom adapter using your choice of the available SDK's (C/C++, C# .Net, Java). With a custom adapter you will be able to implement the XML parsing logic that you require and then connect to and publish to as many separate streams within the ESP project as needed.

JWootton · ‎01-23-2017

It doesn't sound like any of the pre-configured adapters will do what you need but you have several options here...

Let's start with the challenge of repeating elements. This is not so much as an adapter question but a CCL schema question. As you know, CCL requires a fixed schema for each input stream/window - i.e. fixed number of columns. So in a situation where the number of columns in each "event" will vary, you have a few choices:

1. Define an input stream with the max number of columns any event can have. You can input events that don't have all columns - missing columns will just be "NULL"
2. Define multiple input streams, each with a different schema, and "route" the event to the appropriate input stream

3. Pack multiple values in a single CCL string column, using a delimiter, and then in the CCL you can parse the string as needed

Now to the adapters: most of the pre-build adapters are based on the Adapter Framework. The adapter framework lets you combine transport modules, parsing modules, and ESP/SDS connector modules in different combinations to achieve the desired result. SDS and ESP ship with a set of modules and a pre-configured set of adapters. However, you can define many additional adapters without writing new custom modules, just by combining existing modules in different combinations.

Have look at the guide for building custom adapters. Modules of interest:

- XMLDoc parser - lets you parse an XML doc and map XML elements to CCL rows/columns

- ESPMultiStreamPublisher, which lets a single adapter publish to mulitple streams using filters to determine which events go to which input stream

In the end, it may not be possible to achieve what you want with existing modules, in which case you could consider writing a custom parsing module.

And the doc for the adapters, and adapter modules can be difficult to decipher (especially around these more complex modules) but there are examples included in download packages (location varies by package - sounds like you are using ESP. Poke around a little but somewhere in the ESP installation directory you should find a folder called adapters and if you drill down you'll find examples)

Reading multiple records from an XML doc (redux)

Accepted Solutions (1)

Accepted Solutions (1)

Answers (4)

Answers (4)

Re: [XSLT]: Dynamic rows creation and values(in co...

Re: Connecting SAP CAP with SAP Cloud ALM

Re: How to configure alerts in Time and Material J...

How to configure alerts in Time and Material Journ...

Re: How to use node modules in SAPUI5 Fiori Javasc...