当前位置:   article > 正文

Chunjun数据同步工具初体验

chunjun

chunjun (纯钧) 官方文档纯钧

chunjun 有四种运行方式:local、standalone、yarn session、yarn pre-job 。

运行方式/环境依赖flink环境hadoop环境
local××
standalone×
yarn session
yarn pre-job

1.下载

官网已经提供了编译好的插件压缩包,可以直接下载:https://github.com/DTStack/chunjun/releases

chunjun-dist-1.12-SNAPSHOT.tar.gz
 

2.解压

先创建 chunjun 目录

再解压  chunjun-dist-1.12-SNAPSHOT.tar.gz 到 chunjun 这个目录当中

 tar -zxvf chunjun-dist-1.12-SNAPSHOT.tar.gz -C ../module/chunjun

查看目录结构:config 是自己创建的,取什么名称都行,里面放置 脚本文件

 3.案例

mysql->hdfs (local)

根据chunjun 案例 纯钧

编写 mysql 数据同步 hdfs 脚本

vim  config/mysql_hdfs_polling.json

脚本:

  1. {
  2. "job":{
  3. "content":[
  4. {
  5. "reader":{
  6. "name":"mysqlreader",
  7. "parameter":{
  8. "column":[
  9. {
  10. "name":"group_id",
  11. "type":"varchar"
  12. },
  13. {
  14. "name":"company_id",
  15. "type":"varchar"
  16. },
  17. {
  18. "name":"group_name",
  19. "type":"varchar"
  20. }
  21. ],
  22. "username":"root",
  23. "password":"000000",
  24. "queryTimeOut":2000,
  25. "connection":[
  26. {
  27. "jdbcUrl":[
  28. "jdbc:mysql://192.168.233.130:3306/gmall?characterEncoding=UTF-8&autoReconnect=true&failOverReadOnly=false"
  29. ],
  30. "table":[
  31. "cus_group_info"
  32. ]
  33. }
  34. ],
  35. "polling":false,
  36. "pollingInterval":3000
  37. }
  38. },
  39. "writer":{
  40. "name":"hdfswriter",
  41. "parameter":{
  42. "fileType":"text",
  43. "path":"hdfs://192.168.233.130:8020/user/hive/warehouse/stg.db/cus_group_info",
  44. "defaultFS":"hdfs://192.168.233.130:8020",
  45. "fileName":"cus_group_info",
  46. "fieldDelimiter":",",
  47. "encoding":"utf-8",
  48. "writeMode":"overwrite",
  49. "column":[
  50. {
  51. "name":"group_id",
  52. "type":"VARCHAR"
  53. },
  54. {
  55. "name":"company_id",
  56. "type":"VARCHAR"
  57. },
  58. {
  59. "name":"group_name",
  60. "type":"VARCHAR"
  61. }
  62. ]
  63. }
  64. }
  65. }
  66. ],
  67. "setting":{
  68. "speed":{
  69. "readerChannel":1,
  70. "writerChannel":1
  71. }
  72. }
  73. }
  74. }

启动:

sh bin/chunjun-local.sh -job config/mysql_hdfs_polling.json 

运行日志:

 HDFS上的文件:

 

 数据同步成功!

声明:本文内容由网友自发贡献,不代表【wpsshop博客】立场,版权归原作者所有,本站不承担相应法律责任。如您发现有侵权的内容,请联系我们。转载请注明出处:https://www.wpsshop.cn/w/木道寻08/article/detail/795636
推荐阅读
相关标签
  

闽ICP备14008679号