Data Sink¶
You want to save results to a file - scoring a full dataset and writing the output to parquet or CSV for downstream analysis.
Spreadsheet equivalent
Like "Save As CSV" in Excel, but built into your pipeline so it runs automatically.
When to use
Use this for batch scoring - processing a full dataset and saving the results. For live API responses, use Output instead.
This node accepts a single input.
| Config | Description |
|---|---|
path |
Required. Output file path (e.g. outputs/scored_policies) |
format |
Required. "parquet" or "csv" |
If you provide a filename without a directory, it's written to outputs/. The format extension is added automatically if missing.
Example:
This writes the full scored dataset to outputs/scored_policies.parquet.
All columns are written
All columns from the input are written to the file. To control which columns are saved, add a Polars node upstream with selected_columns.
Overwrites existing files
If the file already exists, it is overwritten.
Multiple sinks
You can have multiple Data Sink nodes in a pipeline - for example, to write both a parquet file and a CSV, or to save results at different stages.
See also:
- Output - define the API response for live pricing