Flink operation Architecture (2)

Execute diagram and task chain

All flink execution programs can be roughly divided into three parts:

One-to-one: Stream maintains partitions and the order of elements (such as between source and map). This means that the map operator’s subtask sees the same number and order of elements as the source operator’s subtask produces. Operators such as map, Fliter, and flatMap are one-to-one.
Redistributing: The stream partition will change. Each operator’s subtask sends data to a different target task based on the transformation selected. For example, keyBy repartitions based on hashCode, and broadcast and rebalance repartition randomly. These operators both cause the redistribute process, The redistribute process is similar to the shuffle process in Spark.

Condition:

This article is reproduced in my personal blog Flink’s operating architecture (ii) under CC 4.0 BY-SA Copyright agreement

Personal blog

CSDN home page