Data Flow service

The Data Flow service enables running data flow instances on decision data nodes. Data flows are rules for sequencing and combining data (based on various sources) and writing the results to a destination.

The Data Flow service is divided into Batch and Real Time services to better handle different types of data flow runs. When you run a data flow, select whether you want to run the data flow on the nodes in the Batch or Real Time service. Data processing operations on the Batch nodes and the Real Time nodes are independent and do not affect each other. The higher the number of nodes for a service, the higher the use of these services. For example, using more nodes is useful when performing batch runs that require data-intensive computing.

Depending on the partitioning configuration of data flow instances, the data flow can process data on a different number of nodes than the number configured for the Data Flow service.