当前位置:   article > 正文

Kafka与Flink的整合 -- sink、source_kafka source flink

kafka source flink
1、首先导入依赖:
  1. <dependency>
  2. <groupId>org.apache.flink</groupId>
  3. <artifactId>flink-connector-kafka</artifactId>
  4. <version>1.15.2</version>
  5. </dependency>
2、 source:Flink从Kafka中读取数据
  1. public class Demo01KafkaSource {
  2. public static void main(String[] args) throws Exception{
  3. //构建环境
  4. StreamExecutionEnvironment env = StreamExecutionEnvironment.getExecutionEnvironment();
  5. //构建kafka source 环境
  6. KafkaSource<String> source = KafkaSource.<String>builder()
  7. //指定broker列表
  8. .setBootstrapServers("master:9092,node1:9092,node2:9092")
  9. //指定topic
  10. .setTopics("bigdata")
  11. //指定消费组
  12. .setGroupId("my-group")
  13. //指定数据的读取的位置,earliest指的是读取最早的数据,latest:指定的读取的是最新的数据
  14. .setStartingOffsets(OffsetsInitializer.earliest())
  15. //读取数据格式:
  16. .setValueOnlyDeserializer(new SimpleStringSchema())
  17. .build();
  18. //使用kafka数据源
  19. DataStreamSource<String> kafkaSourceDS = env.
  20. fromSource(source, WatermarkStrategy.noWatermarks(), "Kafka Source");
  21. kafkaSourceDS.print();
  22. //启动flink
  23. env.execute();
  24. }
  25. }
        启动生产kafka:
kafka-console-producer.sh --broker-list master:9092,node1:9092,node2:9092 --topic bigdata
3、sink:Flink向Kafka中写入数据
  1. public class Demo02KafkaSink {
  2. public static void main(String[] args) throws Exception{
  3. //构建flink的环境
  4. StreamExecutionEnvironment env = StreamExecutionEnvironment.getExecutionEnvironment();
  5. //读取数据文件:
  6. DataStreamSource<String> studentDS = env.readTextFile("flink/data/students.txt");
  7. //创建kafka sink
  8. KafkaSink<String> sink = KafkaSink.<String>builder()
  9. //指定flink broker列表
  10. .setBootstrapServers("master:9092,node1:9092,node2:9092")
  11. //指定数据的格式:
  12. .setRecordSerializer(KafkaRecordSerializationSchema.builder()
  13. //指定topic,如果topic不存在就会自动的创建一个分区是1个副本是1个的topic
  14. .setTopic("student")
  15. //指定数据的格式
  16. .setValueSerializationSchema(new SimpleStringSchema())
  17. .build()
  18. )
  19. //指定数据处理的语义:
  20. .setDeliverGuarantee(DeliveryGuarantee.AT_LEAST_ONCE)
  21. .build();
  22. //执行flink
  23. studentDS.sinkTo(sink);
  24. //构建flink环境
  25. env.execute();
  26. }
  27. }
        启动消费kafka:
kafka-console-consumer.sh --bootstrap-server  master:9092,node1:9092,node2:9092 --from-beginning --topic student

声明:本文内容由网友自发贡献,不代表【wpsshop博客】立场,版权归原作者所有,本站不承担相应法律责任。如您发现有侵权的内容,请联系我们。转载请注明出处:https://www.wpsshop.cn/article/detail/44163
推荐阅读
相关标签
  

闽ICP备14008679号