Data Flow service

The Data Flow service enables running data flows in batch and real time (stream) modes. Data flows are data pipelines that read, transform, and write data. With data flows, you can, for example, run decisions, perform text analysis, and execute real-time aggregations.

With the Data Flow service, you can run data flows in either batch mode or real time (stream) mode, depending on the type of data flow. Batch and real time modes process data independently and do not affect each other. The higher the number of nodes for a mode, the higher the use of the mode. For example, using more nodes is useful when performing batch runs that require data-intensive computing.

Depending on the partitioning configuration of data flow instances, a data flow can process data on a different number of nodes than the number configured for the Data Flow service.