How to check duplicate data

ROHAN_LOBO · June 5, 2008, 4:59pm

Hi All,

I want to know about checking duplicate data. My requirement is that :There are 2files(work files). Read file 1 and 2 Validate for error like Duplicate data in file, Non-Numeric check etc.
If error encountered write to error file with the corresponding message.
For example consider that there are 2 records with same data in file 1. How i can check for duplicates?

Thanks,
Rohan.

Verne · June 6, 2008, 6:20pm

Natural does not have a COMPARE function but it does not work like that…
Try IF #FIELD1 = #FIELD2 WRITE (1) ‘Duplicate’ END-IF
and maybe sort both work files first by the same key value to assist in the compare.

Kazi1 · June 7, 2008, 2:30pm

I have written a program which checks for duplicates and runs once every year.
The trick is sort both files. Read File 1 (preferably small file) move key to a variable, read file 2.
If saved key is GT file 2 key read file 1 and if less read other way round and of course if equal write ‘error — record key’.

Have fun.

ROHAN_LOBO · June 9, 2008, 11:30am

Thanks Kazi…I got your point. But in my case i want to eliminate the duplicates from the same file. For example: If file 1 is having same records, i have to eliminate.

Requirement :

Read file 1 and 2 Validate for error like Duplicate data in file, Non-Numeric check etc.
If error encountered write to error file with the corresponding message.

If every data looks okay then process the records.Need an output report sorted based on employee ID where employees are in Active status.

Thanks,
Rohan.

Steve_Robinson · June 9, 2008, 4:28pm

You are either describing your problem inadequately, or have never programmed.

The two posted answers, by Verne and Kazi, both suffice to answer your problem as stated.

Assuming you have a work file sorted by Employee ID, and you should not have duplicate Employee IDs, you can simply do an IF test between successive records.

In Natural, you could also use AT BREAK to discover duplicates, or PERFORM BREAK, or even IF BREAK.

steve

Ralph_Zbrog · June 9, 2008, 8:08pm

To delete (or simply identify) duplicate key values on a WORK file, use a sort utility (such as DFSORT or SYNCSORT) rather than Natural. SORT outperforms Natural when it comes to sequential I/O.

To delete the duplicate records, your sort parameters will look similar to this:

SORT FIELDS=(start,length,format,A),NOEQUALS
 SUM FIELDS=NONE

Topic		Replies	Views
Compare 2 work files Adabas-Natural , Natural	2	1052	January 18, 2022
problem: how to check if file exist Adabas-Natural , Natural , Natural-Code-Samples	7	2690	April 2, 2021
Filtering out Matching records from two work files Adabas-Natural , Natural , Natural-on-Mainframes	10	1314	April 2, 2021
Work files Adabas-Natural , Natural , Natural-on-Mainframes	2	4475	April 2, 2021
deletion of duplicate records Adabas-Natural , Adabas , Adabas-on-Linux , Adabas-Utilities-on-Open-Systems	2	11256	April 2, 2021

How to check duplicate data

Related topics