Analyzing Hive Job Failures

When there is an issue with a Hive job, you can first start by analyzing the job’s logs and results. To analyze a job, use the Analyze tab of the QDS user interface (UI). Perform the following steps:

  1. Each Qubole job/command has a unique ID. Your can search a job using the command_id as depicted below.
../../_images/01_anal_hive_fail-cmd_id.png
  1. Any Hive or shell command job contains the logs in the bottom-right section of the UI. Generally, these logs show the number of MapReduce jobs it is going to start and each MapReduce job has its own link Application UI that opens a new browser tab and displays the job details.
../../_images/02a_anal_hive_fail-logs01.png ../../_images/02a_anal_hive_fail-logs02.png
  1. The Application UI page shows important details as mentioned below:
  • Job Status (Succeeded/Failed/Killed)
  • Total Mapper/Reducer tasks
  • Failed/Killed tasks
  • Counter Link: this table shows very useful parameters, such as S3_bytes_read and file_bytes_written. These counters are very useful for understanding the nature of a job. For example, the counters provide details about how much data is being read, how much data is being written on HDFS or S3, and so on.
../../_images/03_anal_hive_fail-job-tracker-url-pg.png