View all newsletters
Receive our newsletter - data, insights and analysis delivered to you
  1. What Is
June 21, 2016updated 07 Feb 2017 3:51pm

What is Hive?

This open source project was originally created by Facebook.

By James Nunns

Hive is a data warehouse infrastructure that is built on top of Hadoop to provide data, querying, and analysis.

As the importance of big data has grown within organisations, so has the amount of
tools that have been made available to store, clean, and process it.

One of those tools is Apache Hive, a data warehouse infrastructure that is built on top of Hadoop to provide data summarisation, query, and analysis.

Originally developed by Facebook, Hive is now used and developed by the Apache community and by companies such as Amazon, which included it in Amazon Elastic MapReduce on Amazon Web Services.

But what jobs is it best suited to?

The main features of Hive are that it supports the analysis of large datasets stored in

Hadoop’s HDFS, as well as compatible file systems such as Amazon S3.

It uses a SQL-like language called HiveQL with schema on read and converts queries to MapReduce, Apache Tex, and Spark jobs.

Content from our partners
Unlocking growth through hybrid cloud: 5 key takeaways
How businesses can safeguard themselves on the cyber frontline
How hackers’ tactics are evolving in an increasingly complex landscape

The open source framework is best used for batch jobs over large sets of append-only data, and is not designed for OLTP workloads, nor does it offer real-time queries or row-level updates.


Websites in our network
Select and enter your corporate email address Tech Monitor's research, insight and analysis examines the frontiers of digital transformation to help tech leaders navigate the future. Our Changelog newsletter delivers our best work to your inbox every week.
  • CIO
  • CTO
  • CISO
  • CSO
  • CFO
  • CDO
  • CEO
  • Architect Founder
  • MD
  • Director
  • Manager
  • Other
Visit our privacy policy for more information about our services, how New Statesman Media Group may use, process and share your personal data, including information on your rights in respect of your personal data and how you can unsubscribe from future marketing communications. Our services are intended for corporate subscribers and you warrant that the email address submitted is your corporate email address.