Does BatchInsert has any limit

Kumar_Kumar · October 4, 2018, 10:30pm

Experts,

we are processing very large flatfile which can be close to 700-1gb. So, we validate the flatfiles ( reading as stream ) , once the output list is generated we are simply doing a batchInsert to the target table.

Do you see any issues here? I mean, does batchInsert has any limits?

Thanks.

Spurti_Kumarsarj_Gadag · October 11, 2018, 8:09am

Hi Kumar,
This should work as there is no limit on insert operation.

Regards,
Spurti

r_eamon · October 11, 2018, 5:19pm

How are you processing the file with the flat file services? Are you using pub.flatFile:convertToValues with iterate set to true? Based upon the comment “once the output list is generated” is seems like you’re loading all records into memory at once, which may be an issue.

Mahesh_K_Sreenivas · October 13, 2018, 1:36pm

Good point, you must revisit your design approach and also see how much jvm is assigned to your IS.

r_eamon · October 15, 2018, 4:39pm

By “how much jvm” I assume you mean CPU and memory allocated to the JVM.

While this is a factor to a degree, the other concern is loading a 700MB - 1GB file completely into memory. And having the content of that file replicated 2 or more times within memory – e.g. during read, copied during mapping, etc. The data could be in memory 2-3 times.

The flat file and batch insert services (and XML and document services) tend to lead to solutions that load everything into memory at once. For most “event-driven” solutions (using this term loosely) this isn’t a concern. But when dealing with a large amount of data, such as in this case, it may cause a failure by exhausting memory, particularly if the JVM is doing other work at the same time.

The key here is to structure the reading and writing of the data in chunks, never all at once. If you have a single document list that holds all of the records, you’ve read everything into memory. Using a stream to read the data is but one step to follow.

Use stream to read the data.
Use iteration to read X records at a time. The number can vary, depending on record size, memory available to the JVM, etc. Testing will help you determine the “optimal” batch size.
Write those records using batch insert.
Repeat 2 - 3 until stream is EOF.

Topic		Replies	Views
Large Flat File Parsing and Publishing the Generated Document EDI	7	2594	April 2, 2021
Data insertion from flat file EDI	4	2128	April 2, 2021
WebMethods EDI data handling capacity EDI	3	1612	April 2, 2021
Max EDI File Size EDI	6	1993	May 24, 2021
Large file handling EDI	11	5514	May 14, 2021

Does BatchInsert has any limit

Related topics