Scenario: Iterate on files and merge the content
The following Job iterates on a list of files then merges their content and displays the final 2-column content on the console.
Drop the following components onto the design workspace: tFileList, tFileInputDelimited, tUnite and tLogRow.
Connect the tFileList to the tFileInputDelimited using an Iterate connection and connect the other component using a row main link.
In the tFileList Basic settings view, browse to the directory, where the files to merge are stored.
The files are pretty basic and contain a list of countries and their respective score.
In the Case Sensitive field, select Yes to consider the letter case.
Select the tFileInputDelimited component, and display this component's Basic settings view.
Fill in the File Name/Stream field by using the Ctrl+Space bar combination to access the variable completion list, and selecting
tFileList.CURRENT_FILEPATH
from the global variable list to process all files from the directory defined in the tFileList.Click the Edit Schema button and set manually the 2-column schema to reflect the input files' content.
For this example, the 2 columns are Country and Points. They are both nullable. The Country column is of String type and the Points column is of Integer type.
Click OK to validate the setting and accept to propagate the schema throughout the Job.
Then select the tUnite component and display the Component view. Notice that the output schema strictly reflects the input schema and is read-only.
In the Basic settings view of tLogRow, select the Table option to display properly the output values.