flink - 3、分流、合流、join - 《大数据》

分流
合流
- windows coGroup join
- interval Join
维度扩展 enrich

join 不能输出未匹配的事件。 connect能

�

分流

分流方式一：split：1.13已废弃
分流方式二：processFunction的sideputput ```java SplitStream
splitStream = mapStream.split(new OutputSelector
() { @Override public Iterable
select(Sensorreading value) {
```
  return value.getTemperature() > 30 ? Collections.singletonList("high") : Collections.singletonList("low");
```
} });

DataStream highStream = splitStream.select(“high”); DataStream lowStream = splitStream.select(“low”);

```java
public class SplitStreamByOutputTag {
    public static void main(String[] args) throws Exception {
        StreamExecutionEnvironment env = StreamExecutionEnvironment.getExecutionEnvironment();
        DataStreamSource<Event> stream = env.addSource(new ClickSource());
        OutputTag<Event> maryOutPut = new OutputTag<>("Mary-pv");
        OutputTag<Event> bobOutPut = new OutputTag<>("Bob-pv");
        SingleOutputStreamOperator<Event> processedStream = stream.process(new ProcessFunction<Event, Event>() {
            @Override
            public void processElement(Event event, Context context, Collector<Event> collector) throws Exception {
                if ("Mary".equals(event.user)) {
                    context.output(maryOutPut, event);
                } else if ("Bob".equals(event.user)) {
                    context.output(bobOutPut, event);
                } else {
                    collector.collect(event);
                }
            }
        });
        processedStream.getSideOutput(maryOutPut).print("mary");
        processedStream.getSideOutput(bobOutPut).print("bob");
        processedStream.print("else");
        env.execute();
    }
}

合流

方式一：connect + coMap/coFlatmap/CoProcessFunction
- Connect 之后，只是被放在了一个同一个流中，内部依然保持各自的数据和形式不发生任何变化，两个流相互独立
- coMap：ConnectedStreams 中的每一个 Stream 分别进行 map 和 flatMap 处理。
方式二：union

Connect 与 Union 区别:

Union 之前两个流的类型必须是一样，Connect 可以不一样，在之后的 coMap 中再去调整成为一样的。

Connect 只能操作两个流，Union 可以操作多个。 ```java // connect + coMap/coFlatMap ConnectedStreams, SensorReading> connectedStreams = warningStream.connect(lowTempStream); // CoMapFunction三个参数：第一条流、第二条流、合并后的类型 DataStream

3、分流、合流、join

分流

合流

windows coGroup join

interval Join

维度扩展 enrich