Hortonworks Launches Data Platform 2.0, Built on Apache Hadoop 2 GA

Hortonworks, contributor to and provider of enterprise Apache Hadoop, has announced the general availability of Hortonworks Data Platform (HDP) 2.0. The platform delivers the YARN-based architecture of Hadoop 2, phase 2 of the Stinger initiative and includes the very latest innovations from the broader Hadoop ecosystem in a single integrated and tested platform appropriate for mainstream enterprise use.

As a commercial Apache Hadoop distribution, HDP could be an option for the use in cloud hosting environments. The platform has received quality assurance and certification for building private clouds based on Rackspace Private Cloud Software, for example.

As an enterprise Hadoop platform, HDP 2.0 includes the latest enterprise features delivered in Hadoop 2 and all the related Apache projects, many of which had significant GA community releases within the last few weeks. The key projects in HDP include:

  • Apache Hadoop 2.2.0
  • Apache Hive 0.12.0
  • Apache HCatalog
  • Apache Pig 0.12.0
  • Apache HBase 0.96
  • Apache Ambari 1.4.1
  • Apache ZooKeeper 3.4.5
  • Apache Oozie 4.0.0
  • Apache Sqoop 1.4.4
  • Apache Flume 1.4.0
  • Apache Mahout 0.8.0

With Hadoop 2, Apache Hadoop YARN serves as the Hadoop operating system, and takes Hadoop beyond simply a single-use data platform for batch processing to a multi-use platform that enables batch, interactive, online and stream processing. By acting as the primary resource manager and mediator of access to data stored in HDFS, YARN enables enterprises to store data in a single place and interact with it in multiple ways simultaneously and with consistent levels of service.

SQL Server

The Stinger Initiative was launched at the beginning of 2013 as a broad community-based effort to enhance the speed, scale and breadth of SQL semantics supported by Apache Hive. By including the recently released Hive 0.12 which is the culmination of phase 2 of the Stinger Initiative, HDP 2.0 would represent a significant step forward for Hive, the de-facto standard for SQL access in Hadoop today and the only SQL interface designed for queries that scale from gigabytes to petabytes. Microsoft has been a critical partner in the development of HDP 2.0 and has contributed more than 6,000 engineering hours across various Apache projects, as well as porting HDP 2.0 to Windows, which will be available next month.

“The YARN based architecture of HDP 2.0 delivers on our mission to enable the modern data architecture by providing an enterprise Hadoop platform that deeply integrates with existing and future data center technologies,” said Shaun Connolly, vice president of corporate strategy, Hortonworks. “Hortonworks remains committed to delivering a tested, stable, and 100-percent open source Hadoop distribution of the most recent Apache project releases. Our approach ensures that HDP always includes the most proven community-driven innovations that are driving the enterprise deployments shaping the data architectures of tomorrow.

“We are excited to see the next generation of Hadoop delivered with HDP 2.0,” said Eron Kelly, General Manager, SQL Server at Microsoft. “The new YARN architecture is designed to give the Windows community the flexibility to run multiple application types in Hadoop, all sharing a common resource management infrastructure. We are proud to have had the opportunity to contribute significantly to key Apache projects with Hortonworks such as Stinger and Hive to make this a reality. ”

Availability

The Hortonworks Data Platform 2.0 is available now for download from the Hortonworks website at http://hortonworks.com/products/hdp/. HDP 2.0 for Windows will be available in the coming weeks.