Friday, January 7, 2011

DS - Debug and Development stages

Row generator produces a set of test data which fits the specified metadata (can be random or cycled through a specified list of values). Useful for testing and development.
Column generator adds one or more column to the incoming flow and generates test data for this column.
Peek stage prints record column values to the job log which can be viewed in Director. It can have a single input link and multiple output links.
Sample stage samples an input data set. Operates in two modes: percent mode and period mode.

Head selects the first N rows from each partition of an input data set and copies them to an output data set.

Tail is similiar to the Head stage. It select the last N rows from each partition.
Write Range Map writes a data set in a form usable by the range partitioning method.

No comments:

Post a Comment