任务启动时异常:Caused by: org.apache.hadoop.yarn.exceptions.YarnException: Failed to submit application_XXXXXXXXXXXXXX_0397 to YARN : org.apache.hadoop.security.AccessControlException: Queue root.sloth.default already has 30 applications, cannot accept submission of application: application_XXXXXXXXXXXXXX_0397

问题描述/异常栈

 The program finished with the following exception:

org.apache.flink.client.program.ProgramInvocationException: The main method caused an error: Could not deploy Yarn job cluster.
    at org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:335)
    at org.apache.flink.client.program.PackagedProgram.invokeInteractiveModeForExecution(PackagedProgram.java:205)
    at org.apache.flink.client.ClientUtils.executeProgram(ClientUtils.java:139)
    at org.apache.flink.client.cli.CliFrontend.executeProgram(CliFrontend.java:682)
    at org.apache.flink.client.cli.CliFrontend.run(CliFrontend.java:231)
    at org.apache.flink.client.cli.CliFrontend.parseParameters(CliFrontend.java:913)
    at org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:986)
    at java.security.AccessController.doPrivileged(Native Method)
    at javax.security.auth.Subject.doAs(Subject.java:422)
    at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1754)
    at org.apache.flink.runtime.security.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
    at org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:986)
Caused by: org.apache.flink.client.deployment.ClusterDeploymentException: Could not deploy Yarn job cluster.
    at org.apache.flink.yarn.YarnClusterDescriptor.deployJobCluster(YarnClusterDescriptor.java:398)
    at org.apache.flink.client.deployment.executors.AbstractJobClusterExecutor.execute(AbstractJobClusterExecutor.java:70)
    at org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.executeAsync(StreamExecutionEnvironment.java:1733)
    at org.apache.flink.streaming.api.environment.StreamContextEnvironment.executeAsync(StreamContextEnvironment.java:94)
    at org.apache.flink.streaming.api.environment.StreamContextEnvironment.execute(StreamContextEnvironment.java:63)
    at org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.java:1620)
    at org.apache.flink.table.planner.delegation.StreamExecutor.execute(StreamExecutor.java:42)
    at org.apache.flink.table.api.internal.TableEnvironmentImpl.execute(TableEnvironmentImpl.java:643)
    at com.netease.sloth.flink.sql.planner.SlothJobExecutor.execute(SlothJobExecutor.java:117)
    at com.netease.sloth.flink.sql.executor.BaseFlinkNotebookExecutor.execute(BaseFlinkNotebookExecutor.java:38)
    at com.netease.sloth.flink.sql.NotebookEntryPoint.main(NotebookEntryPoint.java:47)
    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
    at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
    at java.lang.reflect.Method.invoke(Method.java:498)
    at org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:321)
    ... 11 more
Caused by: org.apache.hadoop.yarn.exceptions.YarnException: Failed to submit application_XXXXXXXXXXXXXX_0397 to YARN : org.apache.hadoop.security.AccessControlException: Queue root.sloth.default already has 30 applications, cannot accept submission of application: application_XXXXXXXXXXXXXX_0397
    at org.apache.hadoop.yarn.client.api.impl.YarnClientImpl.submitApplication(YarnClientImpl.java:271)
    at org.apache.flink.yarn.YarnClusterDescriptor.startAppMaster(YarnClusterDescriptor.java:1003)
    at org.apache.flink.yarn.YarnClusterDescriptor.deployInternal(YarnClusterDescriptor.java:489)
    at org.apache.flink.yarn.YarnClusterDescriptor.deployJobCluster(YarnClusterDescriptor.java:391)
    ... 26 more

解决方案

停止测试任务释放资源或扩大队列运行App数量限制。

问题原因

队列运行App数量达到限额。

作者:邓崃翔