Introducing The Newly Redesigned Apache HAWQ (incubating)

December 2, 2016

A Technical Primer for Apache HAWQ, The Hadoop Native SQL Database

Overview

Motivated by the interactive analytical query limitations of Hadoop and the performance advantages of MPP databases, Pivotal developed HAWQ, a SQL query database that combines the merits of Pivotal Greenplum with Hadoop distributed storage. Pivotal contributed the Pivotal HAWQ core to the Apache Software Foundation (ASF) in September 2015 and it is now officially an incubating project.

The Pivotal HAWQ team has developed the next generation of Apache HAWQ (incubating) to enable greater elasticity to support the requirements of a growing user base and to enhance the capabilities of running HAWQ on public cloud, private cloud, or shared physical cluster environments. It also allows administrators to run HAWQ more efficiently with better performance and provides the required SQL analytics capabilities to end users. The result is the leading Hadoop Native SQL Database delivering actionable insights to more people in the enterprise.

This white paper is a technical primer for the latest generation of Apache HAWQ (incubating) covering enhancements such as improved query execution flow, greater elasticity, and improved resource management capabilities.

  Download the PDF

Previous
How To Deal With Class Imbalance And Machine Learning On Big Data
How To Deal With Class Imbalance And Machine Learning On Big Data

As a member of Pivotal Data Science Labs, Greg Tam penned this article as an exploration in class imbalance...

Next Video
HAWQ — CJ Jameson & Ben Calegari, Pivotal
HAWQ — CJ Jameson & Ben Calegari, Pivotal

HAWQ Labs has been working with the HAWQ team in Palo Alto for the last while, seeding a Toolsmiths team. ...