FAQ-TaskSetManager: Lost task 0.0 in stage 0.0

问题描述/异常栈

TaskSetManager: Lost task 0.0 in stage 0.0
21/06/17 15:27:36 WARN TaskSetManager: Lost task 0.0 in stage 0.0 (TID 0, bigdata016.dmp.jztweb.com, executor 1): ru.yandex.clickhouse.except.ClickHouseException: ClickHouse exception, code: 27, host: 10.3.82.23, port: 8123; Code: 27, e.displayText() = DB::ParsingException: Cannot parse input: expected '\t' before: '.0000000000\t50.0000000000\t\\N\t华北制药股份有限公司\t华北制药股份\t03-S13047\t80万U\tAB\t处方药\t有\t\\N\t\\N\t抗生素类\t\\N\t2\t24个月\t\\N\t\\N\t\\N\t\\N\tXY':
Row 1:
Column 0,   name: pk,                          type: Int64,              parsed text: "1"
Column 1,   name: createtime,                  type: DateTime,           parsed text: "2010-12-03 21:28:27"
Column 2,   name: lastmodifytime,              type: DateTime,           parsed text: "2021-06-17 11:22:35"
Column 3,   name: branchid,                    type: String,             parsed text: "FH1"
Column 4,   name: deleteflag,                  type: Nullable(Int8),     parsed text: "0"
Column 5,   name: note,                        type: Nullable(String),   parsed text: "<BACKSLASH>N"
Column 6,   name: prodid,                      type: String,             parsed text: "SPH00000001"
Column 7,   name: blocid,                      type: Nullable(String),   parsed text: "45762"
Column 8,   name: nationcode,                  type: Nullable(String),   parsed text: "AAA01011D"
Column 9,   name: prodno,                      type: Nullable(String),   parsed text: "AAA001030D"
Column 10,  name: prodbarcode,                 type: Nullable(String),   parsed text: "6938588800016"
Column 11,  name: prodname,                    type: Nullable(String),   parsed text: "注射用青霉素钠"
Column 12,  name: prodlocalname,               type: Nullable(String),   parsed text: "注射用青霉素钠"
Column 13,  name: prodenglishname,             type: Nullable(String),   parsed text: "<BACKSLASH>N"
Column 14,  name: prodmemorycode,              type: Nullable(String),   parsed text: "ZSYQMSN"
Column 15,  name: proddosageformno,            type: Nullable(String),   parsed text: "009"
Column 16,  name: proddosageformnotext,        type: Nullable(String),   parsed text: "粉针剂"
Column 17,  name: packageunit,                 type: Nullable(String),   parsed text: "支"
Column 18,  name: bigpackagequantity,          type: Int64,              parsed text: "1000"
ERROR: garbage after Int64: ".000000000"

: (at row 1)
 (version 21.6.4.26 (official build))

    at ru.yandex.clickhouse.except.ClickHouseExceptionSpecifier.specify(ClickHouseExceptionSpecifier.java:58)
    at ru.yandex.clickhouse.except.ClickHouseExceptionSpecifier.specify(ClickHouseExceptionSpecifier.java:28)
    at ru.yandex.clickhouse.ClickHouseStatementImpl.checkForErrorAndThrow(ClickHouseStatementImpl.java:875)
    at ru.yandex.clickhouse.ClickHouseStatementImpl.sendStream(ClickHouseStatementImpl.java:851)
    at ru.yandex.clickhouse.ClickHouseStatementImpl.sendStream(ClickHouseStatementImpl.java:824)
    at ru.yandex.clickhouse.ClickHouseStatementImpl.sendStream(ClickHouseStatementImpl.java:817)
    at ru.yandex.clickhouse.ClickHousePreparedStatementImpl.executeBatch(ClickHousePreparedStatementImpl.java:335)
    at ru.yandex.clickhouse.ClickHousePreparedStatementImpl.executeBatch(ClickHousePreparedStatementImpl.java:320)
    at org.apache.spark.sql.execution.datasources.jdbc.JdbcUtils$.savePartition(JdbcUtils.scala:654)
    at com.netease.music.da.transfer.jdbc.writer.JDBCWriter$$anonfun$doWrite$1.apply(JDBCWriter.scala:128)
    at com.netease.music.da.transfer.jdbc.writer.JDBCWriter$$anonfun$doWrite$1.apply(JDBCWriter.scala:126)
    at org.apache.spark.rdd.RDD$$anonfun$foreachPartition$1$$anonfun$apply$29.apply(RDD.scala:935)
    at org.apache.spark.rdd.RDD$$anonfun$foreachPartition$1$$anonfun$apply$29.apply(RDD.scala:935)
    at org.apache.spark.SparkContext$$anonfun$runJob$5.apply(SparkContext.scala:2082)
    at org.apache.spark.SparkContext$$anonfun$runJob$5.apply(SparkContext.scala:2082)
    at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:87)
    at org.apache.spark.scheduler.Task.run(Task.scala:109)
    at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:345)
    at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
    at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
    at java.lang.Thread.run(Thread.java:748)
Caused by: java.lang.Throwable: Code: 27, e.displayText() = DB::ParsingException: Cannot parse input: expected '\t' before: '.0000000000\t50.0000000000\t\\N\t华北制药股份有限公司\t华北制药股份\t03-S13047\t80万U\tAB\t处方药\t有\t\\N\t\\N\t抗生素类\t\\N\t2\t24个月\t\\N\t\\N\t\\N\t\\N\tXY':
Row 1:
Column 0,   name: pk,                          type: Int64,              parsed text: "1"
Column 1,   name: createtime,                  type: DateTime,           parsed text: "2010-12-03 21:28:27"
Column 2,   name: lastmodifytime,              type: DateTime,           parsed text: "2021-06-17 11:22:35"
Column 3,   name: branchid,                    type: String,             parsed text: "FH1"
Column 4,   name: deleteflag,                  type: Nullable(Int8),     parsed text: "0"
Column 5,   name: note,                        type: Nullable(String),   parsed text: "<BACKSLASH>N"
Column 6,   name: prodid,                      type: String,             parsed text: "SPH00000001"
Column 7,   name: blocid,                      type: Nullable(String),   parsed text: "45762"
Column 8,   name: nationcode,                  type: Nullable(String),   parsed text: "AAA01011D"
Column 9,   name: prodno,                      type: Nullable(String),   parsed text: "AAA001030D"
Column 10,  name: prodbarcode,                 type: Nullable(String),   parsed text: "6938588800016"
Column 11,  name: prodname,                    type: Nullable(String),   parsed text: "注射用青霉素钠"
Column 12,  name: prodlocalname,               type: Nullable(String),   parsed text: "注射用青霉素钠"
Column 13,  name: prodenglishname,             type: Nullable(String),   parsed text: "<BACKSLASH>N"
Column 14,  name: prodmemorycode,              type: Nullable(String),   parsed text: "ZSYQMSN"
Column 15,  name: proddosageformno,            type: Nullable(String),   parsed text: "009"
Column 16,  name: proddosageformnotext,        type: Nullable(String),   parsed text: "粉针剂"
Column 17,  name: packageunit,                 type: Nullable(String),   parsed text: "支"
Column 18,  name: bigpackagequantity,          type: Int64,              parsed text: "1000"
ERROR: garbage after Int64: ".000000000"

: (at row 1)
 (version 21.6.4.26 (official build))

    at ru.yandex.clickhouse.except.ClickHouseExceptionSpecifier.specify(ClickHouseExceptionSpecifier.java:53)
    ... 20 more

21/06/17 15:27:36 INFO TaskSetManager: Starting task 0.1 in stage 0.0 (TID 1, bigdata016.dmp.jztweb.com, executor 1, partition 0, PROCESS_LOCAL, 327379 bytes)
21/06/17 15:27:40 ERROR RangerAdminRESTClient: Error getting policies. request=http://10.4.14.73:6080, serviceName=easyops-hive
org.apache.ranger.com.sun.jersey.api.client.ClientHandlerException: java.lang.ClassCastException: org.apache.hadoop.fs.FsUrlConnection cannot be cast to java.net.HttpURLConnection
    at org.apache.ranger.com.sun.jersey.client.urlconnection.URLConnectionClientHandler.handle(URLConnectionClientHandler.java:149)
    at org.apache.ranger.com.sun.jersey.api.client.Client.handle(Client.java:648)
    at org.apache.ranger.com.sun.jersey.api.client.WebResource.handle(WebResource.java:670)
    at org.apache.ranger.com.sun.jersey.api.client.WebResource.access$200(WebResource.java:74)
    at org.apache.ranger.com.sun.jersey.api.client.WebResource$Builder.get(WebResource.java:503)
    at org.apache.ranger.admin.client.RangerAdminRESTClient.getServicePoliciesIfUpdated(RangerAdminRESTClient.java:76)
    at org.apache.ranger.plugin.util.PolicyRefresher.loadPolicyfromPolicyAdmin(PolicyRefresher.java:221)
    at org.apache.ranger.plugin.util.PolicyRefresher.loadPolicy(PolicyRefresher.java:189)
    at org.apache.ranger.plugin.util.PolicyRefresher.run(PolicyRefresher.java:162)
Caused by: java.lang.ClassCastException: org.apache.hadoop.fs.FsUrlConnection cannot be cast to java.net.HttpURLConnection
    at org.apache.ranger.com.sun.jersey.client.urlconnection.URLConnectionClientHandler._invoke(URLConnectionClientHandler.java:157)
    at org.apache.ranger.com.sun.jersey.client.urlconnection.URLConnectionClientHandler.handle(URLConnectionClientHandler.java:147)
    ... 8 more
21/06/17 15:27:40 WARN PolicyRefresher: cache file does not exist or not readble '/data01/hadoop/yarn/local/user

解决方案

节点属性或运行时高级配置添加:
ndi.spark.spark-conf.spark.transmit.handler.column.handlerNull=true

问题原因

待补充;

作者:qianzhaoyuan