How do you remove duplicates in dataset

In how many ways we can delete dataset? If a record is duplicated 3 times then how to get middle duplicated record? Is it advisable to use basic Tfr in Parallel jobs?

Questions by pradeep.dwh   answers by pradeep.dwh

Showing Answers 1 - 4 of 4 Answers

srkreddy111

  • Aug 12th, 2011
 

first you have to open the data set and click on the partitioning and after click hash partition and next click perform sort after click on unique and after ok and after compile and run the job.open the target output,the duplicate records are removed..

  Was this answer useful?  Yes

Give your answer:

If you think the above answer is not correct, Please select a reason and add your answer below.

 

Related Answered Questions

 

Related Open Questions