Hive Job not running
NickName:knowledge.gatherer.007 Ask DateTime:2014-09-21T18:03:19

Hive Job not running

Whenever I am running a hive query - select * from table. It is running fine. But when i m trying to run a MR job for query - select count(*) from table. It is failing with the following error -

Total MapReduce jobs = 1
Launching Job 1 out of 1
Number of reduce tasks determined at compile time: 1
In order to change the average load for a reducer (in bytes):
  set hive.exec.reducers.bytes.per.reducer=<number>
In order to limit the maximum number of reducers:
  set hive.exec.reducers.max=<number>
In order to set a constant number of reducers:
  set mapred.reduce.tasks=<number>
java.io.IOException: cannot find dir = hdfs://hostname:9000/tmp in pathToPartitionInfo: [hdfs://hostname:9000/data1/]
        at org.apache.hadoop.hive.ql.io.HiveFileFormatUtils.getPartitionDescFromPathRecursively(HiveFileFormatUtils.java:298)
        at org.apache.hadoop.hive.ql.io.HiveFileFormatUtils.getPartitionDescFromPathRecursively(HiveFileFormatUtils.java:260)
        at org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.getSplits(CombineHiveInputFormat.java:293)
        at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:1051)
        at org.apache.hadoop.mapred.JobClient.writeSplits(JobClient.java:1043)
        at org.apache.hadoop.mapred.JobClient.access$700(JobClient.java:179)
        at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:959)
        at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:912)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:416)
        at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1149)
        at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:912)
        at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:886)
        at org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:447)
        at org.apache.hadoop.hive.ql.exec.MapRedTask.execute(MapRedTask.java:138)
        at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:144)
        at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:57)
        at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1355)
        at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1139)
        at org.apache.hadoop.hive.ql.Driver.run(Driver.java:945)
        at org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:259)
        at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:216)
        at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:413)
        at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:756)
        at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:614)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:616)
        at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
Job Submission failed with exception 'java.io.IOException(cannot find dir = hdfs://hostname:9000/tmp in pathToPartitionInfo: [hdfs://hostname:9000/data1/])'
FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MapRedTask

Copyright Notice:Content Author:「knowledge.gatherer.007」,Reproduced under the CC 4.0 BY-SA copyright license with a link to the original source and this disclaimer.
Link to original article:https://stackoverflow.com/questions/25958041/hive-job-not-running

More about “Hive Job not running” related questions

Hive Job not running

Whenever I am running a hive query - select * from table. It is running fine. But when i m trying to run a MR job for query - select count(*) from table. It is failing with the following error - ...

Show Detail

How can I kill the running hive job using java?

I am using HiveStatement in jdbc to execute the hive jobs.I can get the running logs through the HiveStatement object ,and I could get the kill command in the logs. Now I want to kill the running ...

Show Detail

always Hive Job running in-process local Hadoop

When I set this property in hive-site.xml &lt;property&gt; &lt;name&gt;hive.exec.mode.local.auto&lt;/name&gt; &lt;value&gt;false&lt;/value&gt; &lt;/property&gt; Hive always

Show Detail

Get user running HIVE job?

I am running a HIVE job with my UDF extended class. I want to get the name of the user that submits my job, so i am using System.getProperty("user.name"); to get the username but the problem is ...

Show Detail

Hive job running forever on reduce phase

I am running a query on 60 GB dataset in Hive. When i fire a query 270 mappers will finish within 15 Mins, but when it comes to reducer state to complete 0.01% , its taking minimum 45 mins to 1hr. ...

Show Detail

How to change Tez job name when running query in HIVE

When I submit a Hive SQL using Tez like below: hive (default)&gt; select count(*) from simple_data; In Resource Manager UI the job name shows something like HIVE-9d1906a2-25dd-4a7c-9ea3-bf651036c...

Show Detail

error while running hive mapreduce job

when I trying to run hive commands which involve mapreduce, I am getting the following error, please help me solve this: hive (hiveclass)&gt; create table companies2 as select * from companies; WA...

Show Detail

Mapreduce job not launching when running hive query with where clause

I am using apache-hive-1.2.2 on Hadoop 2.6.0. When am running a hive query with where clause it is giving results immediately without launching any MapReduce job. I'm not sure what is happening. Ta...

Show Detail

HIve query stops in between when running MapReduce job

I am running a single node cluster on Ubuntu 16.04. Hadoop version is 2.7.2 and Hive 1.2.1. When I run hive queries which involve map-reduce job like partitioning or bucketing etc. It shows progre...

Show Detail

hive query in Job tracker

Hi we are running hive queries in CDH 4 environment to which we recently upgraded. One thing I notice is that earlier in CDH 3 we were able to track our queries in Job tracker. The link similar to "

Show Detail