Branch document test performance

Michael_Lemaire · June 26, 2020, 12:38am

(webMethods 10.3)
The following type of Flow code is very common, but I have hit a significant performance issue with it:

Doing some performance testing, I found that if the document is large (say 20Mb) with many fields, then this test can take 200-300 milliseconds! Clearly webMethods is not just checking for the document’s presence in the pipeline - I assume it is converting the entire document to a string representation prior to carrying out the test, which would explain why tests like the following work:

The same performance hit occurs for evaluated labels e.g. %theDocument% == $null.
While the Service Development guide does state the variable being tested must be a String or “constrained” Object, it seems surprising that WM can’t efficiently handle a simple null test for a document.

I am aware of couple of workarounds:

Instead of checking whether the document exists, check for a document field we are confident will have a value if the document exists. This is fast.
Call a Java service that checks for the document’s presence in the pipeline (bizarrely yes this would be faster if workaround#1 can’t be relied on).

Has anyone else encountered this behavior? How did you resolve it?

Gerardo_Lisboa · June 26, 2020, 9:11am

Have you tested this case?

BRANCH on '/theDocument/someField'
>> $null: SEQUENCE

Michael_Lemaire · June 26, 2020, 9:49am

Thanks Gerardo,
Yes, that’s my workaround #1. I have found it to be hugely faster. Failing anything else I will look into the application to see whether the documents being tested have a field that is guaranteed to be populated whenever the document exists, and change to this type of test.

Gerardo_Lisboa · June 26, 2020, 10:01am

Created a new request on Brainstorm: 08226 “new BRANCH labels”

Gerardo_Lisboa · June 26, 2020, 10:08am

Created a new request on Brainstorm: 08227 “require a warning on BRANCH cases for non-trivial documents”

Sander_Brinkhuis · June 26, 2020, 11:12am

Did you try:
BRANCH evaluate labels true
%theDocument%

This only tests if the variable exists.

Gerardo_Lisboa · June 26, 2020, 2:52pm

Hi @Sander_Brinkhuis, thanks for your suggestion.

@Michael_Lemaire, does this approach has the same performance problems you saw before?

Best regards and Keep Safe!

Michael_Lemaire · June 28, 2020, 11:51pm

Sander, yes that’s the solution - thank you! That drops the test time down to 0 milliseconds (I’m only measuring to millisecond precision for the moment).

It’s interesting that the moment the expression contains anything else at all e.g. %theDocument% == $null, WM reverts to taking 200-300mS. So the solution to test the opposite condition (where the document does not exist) is fastest when implemented as a default clause on what you proposed, i.e.

Gerardo_Lisboa · June 29, 2020, 8:09am

Hi,

@Michael_Lemaire, do you want to share your test code?

Have you created a test set or is it automatically generated?

A second question: does it make any difference if the document does not exist (e.g. dropped) vs $null (e.g. set to nothing) in this test you just showed?

Best regards,

Sander_Brinkhuis · June 29, 2020, 9:04am

Nice, I would use

BRANCH evaluate labels true
!%theDocument%

Saves a branch.

Gerardo_Lisboa · June 29, 2020, 10:51am

New feature request: #08229 BRANCH should be visually different on “Evaluate Labels”
New feature request: #08230 BRANCH negated step label should be visually noticeable

Camilo_Arbelaez · July 1, 2020, 2:37am

@Michael_Lemaire

Hi Michael,

Thanks for reporting this issue. I am part of the Integration Server R&D team.

I am trying to reproduce this performance issue. Can you share a test package to repro this issue?

I probably don’t have a large enough document in my own example, but I haven’t been able to see the same performance results. I would love to see exactly what you are doing since it sounds very reproducible.

Thanks,
Camilo

Michael_Lemaire · July 6, 2020, 5:10am

Hi Camilo,

Sorry I can’t send you a test package, but here’s a clip of the relevant flow. I ran a series of flow test steps; each step is bracketed by a ~.processTimer:start service and a ~.processTimer:end service. The “start” service calls pub.date:currentNanoTime and the “end” service calls pub.date:elapsedNanoTime, and then the elapsed time is written to the server log (along with a description so I know which test I’m looking at). The main challenge is to generate large enough test documents; in my case I have a document tree with perhaps a million fields that generates about 23Mb of JSON.

I ran the tests on a WM10.3 integration server on a (VM) Intel Xeon E5-2699 v4 @ 2.20GHz with 16Gb RAM, Windows Server 2016 Datacenter,

Michael_Lemaire · July 6, 2020, 5:16am

Hi Gerardo, sorry I can’t share the code. The document is live production data so I have been saved the effort of generating it myself The performance test itself pretty simple - please see my response to Camilo. Yes the delay only occurs if the document exists.

Topic		Replies	Views
LargeDoc bizDoc EDI	6	1541	April 2, 2021
How to automatically sync Documents for CI/CD git Version Control git	20	639	November 12, 2024
BRANCH Condition behaviour post Migration from IS Version 10.7 to 10.15 CONNX	4	45	February 26, 2025
Need performance advice for translating very large EDI files EDI	14	4343	April 2, 2021
Null elements in output array when mapping EDI to output FF schema EDI	58	10973	April 2, 2021

Branch document test performance

Related topics