5 Essential Diagnostic Views to Fix Hive Queries

  • Runaway Resource Issues
  • Runaway Timing Issues
  • Spills that cause outages
  • Historical Query Performance (If the query is recurrent)
  • View of the Execution — Mappers, Reducers, Efficiency of Joins
  • View of Data — Which Tables (Facts Dimensions)
  • Efficiency of Yarn Containers
  • Execution Plan — (Logical and Physical plan)
  • Elapsed duration
  • Data Read from HDFS
  • Data Written to HDFS
  • VCore Utilized
  • Memory Utilized
  • Historical comparison of current run and past runs of the same queries
  • The time when the query was executed
  • Tables in question, joins thereof
  • Mapper and reducer performance, anomalous
  • Data layout on the physical file system, for partition strategy
  • Query plan for quick and easy decision-making.

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store