Controlling order of files that DataSync processes

Sam_Longoria2 · February 24, 2025, 3:46pm

Maybe there is already a way to do this, but I haven’t found it yet. As I am sitting here, waiting for the only remaining single, large file to be synchronized on a 4 thread machine, I am thinking that it would be great to have a way to control the order of the synchronization. Admins could then select large files to start on some of the available threads at the beginning of the sync and process many small files on the other threads at the same time, instead of waiting to the middle or end of an alphabetical sync list to start a massive file sync.
I am thinking of a priority system, but any method of controlling the order of files to be synchronized would be helpful. The Groups option doesn’t seem very helpful when you have to sync 400+ files in one maintenance window as quickly as possible.

sally_swan · February 28, 2025, 12:58pm

Yeah, I get what you mean. Right now, AWS DataSync processes files in a mostly alphabetical order, and there’s no built-in way to prioritize large files upfront. One workaround could be splitting the sync job into multiple tasks—one for large files and another for smaller ones—so they run in parallel. You could also try using filters to exclude large files initially, sync the smaller ones first, and then run a second sync for the big ones. Not perfect, but it might help optimize the process.

Topic		Replies	Views
MFT synchronous processing of files Managed-File-Transfer	4	740	April 2, 2021
Reordering Managed-File-Transfer	2	465	April 2, 2021
AT file processing mode Managed-File-Transfer	2	544	April 2, 2021
Filtering out Matching records from two work files Adabas-Natural , Natural , Natural-on-Mainframes	10	1318	April 2, 2021
Sort of Workfiles Adabas-Natural , Natural , Natural-on-Linux	7	10458	April 2, 2021

Controlling order of files that DataSync processes

Related topics