Dataset stage/lookup file stage/file stage

When do one use dataset stage/lookup file stage/file stage in a parallel job?or in other way what is the significance of these files...wat are their differences?

Questions by chackozach

Showing Answers 1 - 1 of 1 Answers

DataSetFile: DataStage PX jobs uses the dataset file to organize the data.Dataset files preserves partitioning means u need not to partition the data when ever u read the data.It maintaines the persistent form.These are the operating system files.we can't read directly content of the file.Not in human readable form.There will be a control file whcih contains the path of the actual individual file.(extension .ds).
FileSet:A file set is collection of individual files distributed over the partitions.This is helpfull because some opeating systems impose the file size to 2 GB.This also preserves partitioning as dataset but it carrys formatting information of the data.It is in human readable form.
Lookup:Lookup stage is used to look up for other stage and as referene for database

  Was this answer useful?  Yes

Give your answer:

If you think the above answer is not correct, Please select a reason and add your answer below.

 

Related Answered Questions

 

Related Open Questions