- sqoop job增量导入mysql数据到hive下的表目录
- 初始化数据
- 创建密码文件,原因:oozie执行sqoop job任务时,使用—password的job任务,会需要手动输入密码,导致失败。
- (猜测,十几次反复测试的结果)
- 创建hive表,未使用password-file
- 删除任务
- 使用password-file,该文件需存放在HDFS上
- 定义job
- 查看job列表
- 执行任务
- 使用—password时,运行过程中(第一次执行时)需要输入,import任务里的数据库连接密码
- 使用—password-file时,不需要输入密码
- 查询
- 查看metastore
- 插入数据
- 再次执行,不需要输入密码
- 查询
- 查看metastore
- 上传任务
- 插入数据,再次测试
- start 需设为未来时间,否则任务失败,UTC时间。此时
- $ date -u
- Sat May 16 09:53:44 UTC 2020
- Hive脚本分析数据
- ${OUTPUT} -> /user/jack/oozie-apps/output-data/sqoop-mysql-hive-incr-cron
- 使用hive分析,使用sqoop导出结果到数据库
https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-use-oozie-linux-mac
sqoop job增量导入mysql数据到hive下的表目录
启动sqoop的metastore
安装了sqoop metastore服务的主机上,启动metastore服务。
$ rm -rf metastore/* # 删除元数据$ bin/sqoop metastore &$ jps19796 Sqoop
当 last-value 失控的时候,可以删除元数据,重新规划。
定义sqoop的job
定义sqoop job,测试job并执行。
- https://blog.csdn.net/a904364908/article/details/98355032
- https://www.cnblogs.com/wrencai/p/3912462.html
- 定义sqoop job时,使用绝对路径调用,解决错误“Main class [org.apache.oozie.action.hadoop.SqoopMain], exit code [1]”
https://blog.csdn.net/dz77dz/article/details/86079816 - https://blog.csdn.net/Abysscarry/article/details/82120855
- 密码问题
https://blog.csdn.net/shujuelin/article/details/100265415 - Oozie Sqoop任务Mysql to Hive(with kerberos)
https://www.jianshu.com/p/f2ba19a01c57 ```bash $ ~/Documents/hive/bin/hive hive> drop table user; hive> dfs -rm -r -f /user/hive/warehouse/user;初始化数据
$ mysql -uroot -p123456 DROP TABLE IF EXISTSuser; CREATE TABLEuser(idint(11) NOT NULL,namevarchar(20) DEFAULT NULL, PRIMARY KEY (id) ) ENGINE=MyISAM DEFAULT CHARSET=latin1; INSERT INTOuserVALUES (‘1’, ‘jack’); INSERT INTOuserVALUES (‘2’, ‘tom’); INSERT INTOuserVALUES (‘3’, ‘white’); INSERT INTOuserVALUES (‘4’, ‘black’);
创建密码文件,原因:oozie执行sqoop job任务时,使用—password的job任务,会需要手动输入密码,导致失败。
(猜测,十几次反复测试的结果)
$ echo -n “123456” > 130mysql.pwd
创建hive表,未使用password-file
$ bin/sqoop create-hive-table \ —connect jdbc:mysql://master:3306/mydb \ —username root \ —password 123456 \ —table user \ —hive-table user \ —mysql-delimiters
删除任务
$ bin/sqoop job —delete user_import_incr
使用password-file,该文件需存放在HDFS上
$ ~/Documents/hadoop/bin/hadoop fs -put 130mysql.pwd
定义job
$ bin/sqoop job \ —create user_import_incr \ —meta-connect jdbc:hsqldb:hsql://master:16000/sqoop \ — \ import \ —connect jdbc:mysql://master:3306/mydb \ —username root \ —password-file 130mysql.pwd \ —table user \ —mysql-delimiters \ —incremental append \ —check-column id \ —last-value 0 \ —target-dir /user/hive/warehouse/user \ -m 1
查看job列表
$ bin/sqoop job —list Available jobs: user_import_incr
执行任务
$ bin/sqoop job —exec user_import_incr
使用—password时,运行过程中(第一次执行时)需要输入,import任务里的数据库连接密码
Enter password:
使用—password-file时,不需要输入密码
查询
$ ~/Documents/hive/bin/hive hive> select * from user; OK 1 jack 2 tom 3 white 4 black
查看metastore
$ cat metastore/shared.db.log | grep last INSERT INTO SQOOP_SESSIONS VALUES(‘user_import_incr’,’incremental.last.value’,’0’,’SqoopOptions’) DELETE FROM SQOOP_SESSIONS WHERE JOB_NAME=’user_import_incr’ AND PROPNAME=’incremental.last.value’ AND PROPVAL=’0’ AND PROPCLASS=’SqoopOptions’ INSERT INTO SQOOP_SESSIONS VALUES(‘user_import_incr’,’incremental.last.value’,’4’,’SqoopOptions’)
插入数据
mysql> insert into user values (5, ‘ming’),(6, ‘ning’);
再次执行,不需要输入密码
$ bin/sqoop job —exec user_import_incr $ ~/Documents/hive/bin/hive
查询
hive> select * from user_job; OK 1 jack 2 tom 3 white 4 black 5 ming 6 ning hive> dfs -ls /user/hive/warehouse/user; Found 2 items -rw-r—r— 1 jack supergroup 29 2020-05-16 01:46 /user/hive/warehouse/user/part-m-00000 -rw-r—r— 1 jack supergroup 14 2020-05-16 01:47 /user/hive/warehouse/user/part-m-00001
查看metastore
$ cat shared.db.log | grep last DELETE FROM SQOOP_SESSIONS WHERE JOB_NAME=’user_import_incr’ AND PROPNAME=’incremental.last.value’ AND PROPVAL=’4’ AND PROPCLASS=’SqoopOptions’ INSERT INTO SQOOP_SESSIONS VALUES(‘user_import_incr’,’incremental.last.value’,’6’,’SqoopOptions’)
<a name="gGRyb"></a>## 定义oozie的workflow任务oozie任务调度sqoop增量导入任务- [https://blog.csdn.net/eason_oracle/article/details/78330755](https://blog.csdn.net/eason_oracle/article/details/78330755)```bash$ mkdir oozie-apps/sqoop-mysql2hdfs-wf/# 或者$ cp -r oozie-apps/sqoop-freeform-mysql/ oozie-apps/sqoop-mysql2hdfs-wf$ ll oozie-apps/sqoop-freeform-mysql/total 860-rw-r--r--. 1 jack jack 1024 May 13 20:36 job.properties-rw-r--r--. 1 jack jack 872303 May 13 22:08 mysql-connector-java-5.1.27-bin.jar-rw-r--r--. 1 jack jack 2498 May 13 22:09 workflow.xml[jack@master oozie]$ cp -r examples/apps/sqoop oozie-apps/sqoop-mysql-hive-incr[jack@master oozie]$ cd oozie-apps/sqoop-mysql-hive-incr/[jack@master sqoop-mysql-hive-incr]$ lltotal 16-rw-r--r--. 1 jack jack 1208 May 15 22:35 db.hsqldb.properties-rw-r--r--. 1 jack jack 1081 May 15 22:35 db.hsqldb.script-rw-r--r--. 1 jack jack 1002 May 15 22:35 job.properties-rw-r--r--. 1 jack jack 2085 May 15 22:35 workflow.xml[jack@master sqoop-mysql-hive-incr]$ rm -rf db.hsqldb.*
需要两个文件:
- job.properties
- workflow.xml
job.properties
``` nameNode=hdfs://192.168.32.130:8020 jobTracker=192.168.32.130:8032 queueName=default examplesRoot=oozie-apps
oozie.use.system.libpath=true
oozie.wf.application.path=${nameNode}/user/${user.name}/${examplesRoot}/sqoop-mysql-hive-incr
<a name="u4WR6"></a>### workflow.xmlfile元素放在archive元素前面。```xml<workflow-app xmlns="uri:oozie:workflow:0.2" name="sqoop-wf"><start to="sqoop-node"/><action name="sqoop-node"><sqoop xmlns="uri:oozie:sqoop-action:0.2"><job-tracker>${jobTracker}</job-tracker><name-node>${nameNode}</name-node><configuration><property><name>mapred.job.queue.name</name><value>${queueName}</value></property></configuration><arg>job</arg><arg>--meta-connect</arg><arg>jdbc:hsqldb:hsql://192.168.32.130:16000/sqoop</arg><arg>--exec</arg><arg>user_import_incr</arg><archive>mysql-connector-java-5.1.27-bin.jar</archive></sqoop><ok to="end"/><error to="fail"/></action><kill name="fail"><message>Sqoop failed, error message[${wf:errorMessage(wf:lastErrorNode())}]</message></kill><end name="end"/></workflow-app>
拷贝必要文件
- mysql-connector-jdbc.jar文件
# jdbc$ cp ~/Documents/sqoop/lib/mysql-connector-java-5.1.27-bin.jar oozie-apps/sqoop-mysql-hive-incr/
执行测试
```bash $ mysql -uroot -p mysql> insert into user values (7, ‘zhangsan’),(8, ‘lisi’); $ ~/Documents/hive/bin/hive hive> dfs -rm -r -f /user/jack/oozie-apps/sqoop-mysql-hive-incr; Deleted /user/jack/oozie-apps/sqoop-mysql-hive-incr上传任务
$ ~/Documents/hadoop/bin/hadoop fs -put -f oozie-apps/sqoop-mysql-hive-incr/ oozie-apps/ $ export OOZIE_URL=”http://192.168.32.130:11000/oozie“ $ bin/oozie job -config oozie-apps/sqoop-mysql-hive-incr/job.properties -run -verbose -debug -auth kerberos
$ bin/oozie job -info 0000017-200513181559469-oozie-jack-W
$ ~/Document/hive/bin/hive
hive> select * from user_job;
OK
2 tom
3 white
4 black
5 ming
6 ning
7 zhangsan
8 lisi
hive> dfs -ls /user/hive/warehouse/user/;
Found 3 items
-rw-r—r— 1 jack supergroup 29 2020-05-15 05:02 /user/hive/warehouse/user/part-m-00000
-rw-r—r— 1 jack supergroup 14 2020-05-15 05:06 /user/hive/warehouse/user/part-m-00001
-rw-r—r— 1 jack supergroup 18 2020-05-15 05:21 /user/hive/warehouse/user/part-m-00002
插入数据,再次测试
mysql> insert into user values (9, ‘wangwu’),(10, ‘zhaoliu’);
$ export OOZIE_URL=”http://192.168.32.130:11000/oozie“
$ bin/oozie job -config oozie-apps/sqoop-mysql-hive-incr/job.properties -run -verbose -debug
$ bin/oozie job -info 0000023-200513181559469-oozie-jack-W
$ ~/Document/hive/bin/hive
hive> dfs -ls /user/hive/warehouse/user/;
Found 4 items
-rw-r—r— 1 jack supergroup 29 2020-05-16 01:46 /user/hive/warehouse/user/part-m-00000
-rw-r—r— 1 jack supergroup 14 2020-05-16 01:47 /user/hive/warehouse/user/part-m-00001
-rw-r—r— 1 jack supergroup 18 2020-05-16 02:17 /user/hive/warehouse/user/part-m-00002
-rw-r—r— 1 jack supergroup 20 2020-05-16 02:21 /user/hive/warehouse/user/part-m-00003
hive> select * from user;
OK
1 jack
2 tom
3 white
4 black
5 ming
6 ning
7 zhangsan
8 lisi
9 wangwu
10 zhaoliu
<a name="qcJuf"></a>## 定义oozie的coordination任务```bash$ cp -r examples/apps/cron-schedule/ oozie-apps/sqoop-mysql-hive-incr-cron# 复制jdbc$ cp oozie-apps/sqoop-mysql-hive-incr/mysql-connector-java-5.1.27-bin.jar \oozie-apps/sqoop-mysql-hive-incr-cron/$ ll oozie-apps/sqoop-mysql-hive-incr-crontotal 864-rw-r--r--. 1 jack jack 1591 May 16 02:49 coordinator.xml-rw-r--r--. 1 jack jack 1285 May 16 02:56 job.properties-rw-r--r--. 1 jack jack 872303 May 16 03:03 mysql-connector-java-5.1.27-bin.jar-rw-r--r--. 1 jack jack 1841 May 16 02:46 workflow.xml
job.properties
时间以UTC为准,本身oozie没有做时间配置,时区默认为UTC。
注意:
- oozie.coord.application.path
- workflowAppUri ``` nameNode=hdfs://192.168.32.130:8020 jobTracker=192.168.32.130:8032 queueName=default examplesRoot=oozie-apps
oozie.use.system.libpath=true
oozie.coord.application.path=${nameNode}/user/${user.name}/${examplesRoot}/sqoop-mysql-hive-incr-cron
start 需设为未来时间,否则任务失败,UTC时间。此时
$ date -u
Sat May 16 09:53:44 UTC 2020
start=2020-05-16T10:00Z end=2020-05-16T010:20Z
workflowAppUri=${nameNode}/user/${user.name}/${examplesRoot}/sqoop-mysql-hive-incr-cron
<a name="LAp32"></a>### workflow.xml内容不变,可直接覆盖。```xml<workflow-app xmlns="uri:oozie:workflow:0.2" name="sqoop-wf"><start to="sqoop-node"/><action name="sqoop-node"><sqoop xmlns="uri:oozie:sqoop-action:0.2"><job-tracker>${jobTracker}</job-tracker><name-node>${nameNode}</name-node><configuration><property><name>mapred.job.queue.name</name><value>${queueName}</value></property></configuration><arg>job</arg><arg>--meta-connect</arg><arg>jdbc:hsqldb:hsql://192.168.32.130:16000/sqoop</arg><arg>--exec</arg><arg>user_import_incr</arg><archive>mysql-connector-java-5.1.27-bin.jar</archive></sqoop><ok to="end"/><error to="fail"/></action><kill name="fail"><message>Sqoop failed, error message[${wf:errorMessage(wf:lastErrorNode())}]</message></kill><end name="end"/></workflow-app>
coordinator.xml
5分钟执行一次。
<coordinator-app name="cron-coord" frequency="${coord:minutes(5)}" start="${start}" end="${end}" timezone="UTC"xmlns="uri:oozie:coordinator:0.2"><action><workflow><app-path>${workflowAppUri}</app-path><configuration><property><name>jobTracker</name><value>${jobTracker}</value></property><property><name>nameNode</name><value>${nameNode}</value></property><property><name>queueName</name><value>${queueName}</value></property></configuration></workflow></action></coordinator-app>
frequency="0/10 * * * *" 每10分钟执行一次 。
测试执行
# 每过5分钟插入数据$ mysql -uroot -pmysql> insert into user values (11, 'aaaaa'),(12, 'bbbbb');mysql> insert into user values (13, 'ccccc'),(14, 'ddddd');# 上传,发布任务$ ~/Documents/hadoop/bin/hadoop fs -rm -r -f oozie-apps/sqoop-mysql-hive-incr-cron/$ ~/Documents/hadoop/bin/hadoop fs -put -f oozie-apps/sqoop-mysql-hive-incr-cron/ oozie-apps/$ export OOZIE_URL="http://192.168.32.130:11000/oozie"$ bin/oozie job -config oozie-apps/sqoop-mysql-hive-incr-cron/job.properties -run -verbose -debug$ bin/oozie job -info 0000068-200513181559469-oozie-jack-CJob ID : 0000068-200513181559469-oozie-jack-C------------------------------------------------------------------------------------------------------------------------------------Job Name : cron-coordApp Path : hdfs://192.168.32.130:8020/user/jack/oozie-apps/sqoop-mysql-hive-incr-cronStatus : RUNNINGStart Time : 2020-05-16 10:00 GMTEnd Time : 2020-05-16 10:20 GMTPause Time : -Concurrency : 1------------------------------------------------------------------------------------------------------------------------------------ID Status Ext ID Err Code Created Nominal Time0000068-200513181559469-oozie-jack-C@1 WAITING - - 2020-05-16 09:55 GMT 2020-05-16 10:00 GMT------------------------------------------------------------------------------------------------------------------------------------0000068-200513181559469-oozie-jack-C@2 WAITING - - 2020-05-16 09:55 GMT 2020-05-16 10:05 GMT------------------------------------------------------------------------------------------------------------------------------------0000068-200513181559469-oozie-jack-C@3 WAITING - - 2020-05-16 09:55 GMT 2020-05-16 10:10 GMT------------------------------------------------------------------------------------------------------------------------------------0000068-200513181559469-oozie-jack-C@4 WAITING - - 2020-05-16 09:55 GMT 2020-05-16 10:15 GMT------------------------------------------------------------------------------------------------------------------------------------# 杀死进程$ bin/oozie job --kill 0000068-200513181559469-oozie-jack-C# oozie日志$ tail logs/oozie.log
Hive脚本分析数据
修改workflow任务支持hive hql脚本分析
job-non-coor.properties
定义一个非周期性地workflow任务,进行流程测试。
nameNode=hdfs://192.168.32.130:8020jobTracker=192.168.32.130:8032queueName=defaultexamplesRoot=oozie-appsoozie.use.system.libpath=trueoozie.wf.application.path=${nameNode}/user/${user.name}/${examplesRoot}/sqoop-mysql-hive-incr-cron
workflow.xml
添加hive action,注意输出目录。
- hive.metastore.uris - 不加,报错 【Main class [org.apache.oozie.action.hadoop.HiveMain], exit code [40000]】很莫名其妙!示例没问题!
<workflow-app xmlns="uri:oozie:workflow:0.2" name="sqoop-wf"><start to="sqoop-node"/><action name="sqoop-node"><sqoop xmlns="uri:oozie:sqoop-action:0.2"><job-tracker>${jobTracker}</job-tracker><name-node>${nameNode}</name-node><configuration><property><name>mapred.job.queue.name</name><value>${queueName}</value></property></configuration><arg>job</arg><arg>--meta-connect</arg><arg>jdbc:hsqldb:hsql://192.168.32.130:16000/sqoop</arg><arg>--exec</arg><arg>user_import_incr</arg><archive>mysql-connector-java-5.1.27-bin.jar</archive></sqoop><ok to="hive-node"/><error to="fail-sqoop-node"/></action><action name="hive-node"><hive xmlns="uri:oozie:hive-action:0.2"><job-tracker>${jobTracker}</job-tracker><name-node>${nameNode}</name-node><prepare><delete path="${nameNode}/user/${wf:user()}/${examplesRoot}/output-data/sqoop-mysql-hive-incr-cron"/><mkdir path="${nameNode}/user/${wf:user()}/${examplesRoot}/output-data"/></prepare><configuration><property><name>mapred.job.queue.name</name><value>${queueName}</value></property><property><name>hive.metastore.uris</name><value>thrift://192.168.32.130:9083</value></property></configuration><script>script.q</script><param>OUTPUT=/user/${wf:user()}/${examplesRoot}/output-data/sqoop-mysql-hive-incr-cron</param></hive><ok to="end"/><error to="fail-hive-node"/></action><kill name="fail-sqoop-node"><message>Sqoop failed, error message[${wf:errorMessage(wf:lastErrorNode())}]</message></kill><kill name="fail-hive-node"><message>Hive failed, error message[${wf:errorMessage(wf:lastErrorNode())}]</message></kill><end name="end"/></workflow-app>
script.q
-- hive> desc user;-- OK-- id int-- name string-- 查询所有name长度大于4的记录INSERT OVERWRITE DIRECTORY '${OUTPUT}' SELECT id FROM default.user WHERE length(name) > 4;
测试(修改
${OUTPUT}进行测试) ```bash $ cp ~/Documents/oozie/oozie-apps/sqoop-mysql-hive-incr-cron/script.q ~/Documents/hive/hql
${OUTPUT} -> /user/jack/oozie-apps/output-data/sqoop-mysql-hive-incr-cron
$ ~/Documents/hive/bin/hive -f ~/Documents/hive/hql/script.q
<a name="JL2Ev"></a>### 执行流程测试```bash# 启动hive metastore$ ~/Documents/hive/bin/hive --service metastore# 启动sqoop metastore$ ~/Documents/sqoop/bin/sqoop metastore# 上传,发布任务$ ~/Documents/hadoop/bin/hadoop fs -rm -r -f oozie-apps/sqoop-mysql-hive-incr-cron/$ ~/Documents/hadoop/bin/hadoop fs -put -f oozie-apps/sqoop-mysql-hive-incr-cron/ oozie-apps/$ export OOZIE_URL="http://192.168.32.130:11000/oozie"$ bin/oozie job -config oozie-apps/sqoop-mysql-hive-incr-cron/job-non-coor.properties -run -verbose -debug$ $ bin/oozie job -info 0000105-200513181559469-oozie-jack-WJob ID : 0000105-200513181559469-oozie-jack-W------------------------------------------------------------------------------------------------------------------------------------Workflow Name : sqoop-wfApp Path : hdfs://192.168.32.130:8020/user/jack/oozie-apps/sqoop-mysql-hive-incr-cronStatus : SUCCEEDEDRun : 0User : jackGroup : -Created : 2020-05-18 06:34 GMTStarted : 2020-05-18 06:34 GMTLast Modified : 2020-05-18 06:34 GMTEnded : 2020-05-18 06:34 GMTCoordAction ID: -Actions------------------------------------------------------------------------------------------------------------------------------------ID Status Ext ID Ext Status Err Code------------------------------------------------------------------------------------------------------------------------------------0000105-200513181559469-oozie-jack-W@:start: OK - OK -------------------------------------------------------------------------------------------------------------------------------------0000105-200513181559469-oozie-jack-W@sqoop-node OK job_1588552715711_0212 SUCCEEDED -------------------------------------------------------------------------------------------------------------------------------------0000105-200513181559469-oozie-jack-W@hive-node OK job_1588552715711_0213 SUCCEEDED -------------------------------------------------------------------------------------------------------------------------------------0000105-200513181559469-oozie-jack-W@end OK - OK -------------------------------------------------------------------------------------------------------------------------------------# 查看结果$ ~/Documents/hadoop/bin/hadoop fs -cat /user/jack/oozie-apps/output-data/sqoop-mysql-hive-incr-cron/*34791011121314
执行周期流程测试
job.properties
nameNode=hdfs://192.168.32.130:8020jobTracker=192.168.32.130:8032queueName=defaultexamplesRoot=oozie-appsoozie.use.system.libpath=trueoozie.coord.application.path=${nameNode}/user/${user.name}/${examplesRoot}/sqoop-mysql-hive-incr-cron# start 需设为未来时间,否则任务失败,UTC时间。此时# $ date -u# Mon May 18 03:46:36 UTC 2020start=2020-05-18T03:50Zend=2020-05-18T04:10ZworkflowAppUri=${nameNode}/user/${user.name}/${examplesRoot}/sqoop-mysql-hive-incr-cron
执行测试
此时,时间已过,仍然直接运行。很莫名其妙!
$ bin/oozie job -config oozie-apps/sqoop-mysql-hive-incr-cron/job.properties -run -verbose -debug$ date -uMon May 18 06:44:12 UTC 2020$ bin/oozie job -info 0000106-200513181559469-oozie-jack-CJob ID : 0000106-200513181559469-oozie-jack-C------------------------------------------------------------------------------------------------------------------------------------Job Name : cron-coordApp Path : hdfs://192.168.32.130:8020/user/jack/oozie-apps/sqoop-mysql-hive-incr-cronStatus : RUNNINGStart Time : 2020-05-18 03:50 GMTEnd Time : 2020-05-18 04:10 GMTPause Time : -Concurrency : 1------------------------------------------------------------------------------------------------------------------------------------ID Status Ext ID Err Code Created Nominal Time0000106-200513181559469-oozie-jack-C@1 RUNNING 0000107-200513181559469-oozie-jack-W - 2020-05-18 06:43 GMT 2020-05-18 03:50 GMT------------------------------------------------------------------------------------------------------------------------------------0000106-200513181559469-oozie-jack-C@2 READY - - 2020-05-18 06:43 GMT 2020-05-18 03:55 GMT------------------------------------------------------------------------------------------------------------------------------------0000106-200513181559469-oozie-jack-C@3 READY - - 2020-05-18 06:43 GMT 2020-05-18 04:00 GMT------------------------------------------------------------------------------------------------------------------------------------0000106-200513181559469-oozie-jack-C@4 READY - - 2020-05-18 06:43 GMT 2020-05-18 04:05 GMT------------------------------------------------------------------------------------------------------------------------------------
# 上传任务,测试,每过5分钟插入2条数据$ mysql -uroot -pmysql> insert into user values (15, '222'),(16, '222222');mysql> insert into user values (17, '333'),(18, '333333');$ bin/oozie job -info 0000106-200513181559469-oozie-jack-C------------------------------------------------------------------------------------------------------------------------------------ID Status Ext ID Err Code Created Nominal Time0000106-200513181559469-oozie-jack-C@1 SUCCEEDED 0000107-200513181559469-oozie-jack-W - 2020-05-18 06:43 GMT 2020-05-18 03:50 GMT------------------------------------------------------------------------------------------------------------------------------------0000106-200513181559469-oozie-jack-C@2 SUCCEEDED 0000108-200513181559469-oozie-jack-W - 2020-05-18 06:43 GMT 2020-05-18 03:55 GMT------------------------------------------------------------------------------------------------------------------------------------0000106-200513181559469-oozie-jack-C@3 SUCCEEDED 0000109-200513181559469-oozie-jack-W - 2020-05-18 06:43 GMT 2020-05-18 04:00 GMT------------------------------------------------------------------------------------------------------------------------------------0000106-200513181559469-oozie-jack-C@4 SUCCEEDED 0000110-200513181559469-oozie-jack-W - 2020-05-18 06:43 GMT 2020-05-18 04:05 GMT------------------------------------------------------------------------------------------------------------------------------------$ ~/Documents/hadoop/bin/hadoop fs -cat /user/jack/oozie-apps/output-data/sqoop-mysql-hive-incr-cron/*# 此时多了,16/18347910111213141618
