Welcome!

News Feed Item

Hortonworks Advances Enterprise Hadoop Innovation with Delivery of Hortonworks Data Platform 2.1

Interactive SQL Query via Stinger Initiative and Addition of Data Governance, Security, Stream Processing and Search Capabilities Highlight Release

PALO ALTO, Calif., and AMSTERDAM, Netherlands, April 2, 2014 /PRNewswire/ -- (Hadoop Summit Europe) -- Hortonworks, the leading provider of enterprise Apache™ Hadoop®, today announced availability of Hortonworks Data Platform (HDP) 2.1, a major step forward for the industry's only completely open enterprise Hadoop platform.

Hortonworks logo.

Providing comprehensive Hadoop capabilities, HDP 2.1 delivers required enterprise functionality for data management, data access, data governance, integration, security and operations developed and delivered completely in the open. Incorporating the very latest community innovations across all Apache Hadoop projects, HDP 2.1 provides the foundational platform for organizations looking to incorporate Hadoop in a modern data architecture.

"Hortonworks remains committed to innovating and delivering a certified, stable, and completely open source enterprise Hadoop platform comprised of the most recent Apache project releases," said Shaun Connolly, vice president of corporate strategy, Hortonworks. "This HDP 2.1 release delivers a comprehensive set of enterprise Hadoop capabilities that span data management, data access, data governance, security, and operations, and our completely open source approach ensures our customers can confidently deploy a Hadoop platform that is not only built for the modern data architectures of tomorrow, but also deeply integrates with existing datacenter technologies."

Hortonworks Data Platform 2.1 release highlights include:

1.

Interactive SQL Query In Hadoop
HDP 2.1 includes the delivery of Apache Hive 0.13. This represents completion of the final phase of the Stinger initiative, a broad community effort to deliver interactive SQL query at petabyte scale in Hadoop. This 13-month initiative resulted in over 390,000 lines of code being added to Apache Hive from 145 developers from over 45 unique organizations including Microsoft, Teradata and SAP and is a testament to the power of community-driven open source.




With the release of Apache Hive 0.13, the Stinger Initiative has delivered on its promise of: 


  

Speed: 100X increase in SQL query performance


  

Scale: interactive query at petabyte scale and across a wide range of complex queries and joins


   • 

SQL: expanded range of SQL semantics for analytic applications running in Hadoop




2.

New Enterprise Capabilities for Data Governance and Security


HDP 2.1 delivers on two key enterprise capabilities:


  •

Apache Falcon delivers a critical data processing framework for governing and orchestrating data flows in and around Hadoop


  •

Apache Knox extends perimeter security for Hadoop and is fully integrated into existing frameworks such as LDAP and Active Directory to provide credential management for Hadoop

3.

New Processing Engines for Streaming and Search


HDP 2.1 includes two new processing engines:


  •

Apache Storm provides real-time event processing to analyze/process sensor and business activity monitoring data


  •

Apache Solr extends Hadoop and HDP with a powerful user interface for advanced search applications and high-performance indexing and sub-second search times over billions of documents

4.

Enhanced Management and Operations Capabilities


HDP 2.1 includes the very latest version of Apache Ambari which now supports new data access engines, provides stack extensibility and rolling restarts, and other significant operational improvements

HDP 2.1 is supported by a broad ecosystem of customers and partners, including the following strategic partners:

Microsoft

"Together with Hortonworks, we're contributing to Hadoop by making it faster, secure and more accessible to the enterprise. The availability of HDP 2.1 for Windows will enable enterprises to quickly and easily analyze and manage their Hadoop data to gain valuable business insights."– Eron Kelly, General Manager, SQL Server Product Marketing, Microsoft

Red Hat

"Red Hat believes a comprehensive open source community approach to the Hadoop platform can address the growing requirements of key enterprise stakeholders and their big data initiatives. Our strategic alliance with Hortonworks is focused on helping customers with efficiency and agility as they embark on big data projects. With today's delivery of Hortonworks HDP 2.1 platform, enterprises can benefit from modularity and cost effectiveness across the core of enterprise Hadoop." – Greg Kleiman, director of strategy, Storage and Big Data, Red Hat

Teradata

"Data lakes are fast becoming a key component in the most effective data architectures. Hortonworks HDP 2.1 enables extremely differentiated data lake deployments." – Scott Gnau, president, Teradata Labs

HDP 2.1 is the first enterprise Hadoop platform to include comprehensive innovation across Hadoop 2 and all the related Apache projects, many of which had significant GA community releases within the last few weeks. A complete list of HDP features and enhancements can be found at: http://hortonworks.com/products/hdp/

Availability

The HDP 2.1 Technical Preview is available now at http://hortonworks.com/products/hdp and the full Linux and Microsoft Windows releases will be available by the end of April 2014.

Further Reading


Enterprise Hadoop:                                        

www.hortonworks.com/hadoop

Hadoop and a Modern Data Architecture:  

www.hortonworks.com/mda

About Hortonworks

Hortonworks develops, distributes and supports the only 100% open source Apache Hadoop data platform. Our team comprises the largest contingent of builders and architects within the Hadoop ecosystem who represent and lead the broader enterprise requirements within these communities.

The Hortonworks Data Platform provides an open platform that deeply integrates with existing IT investments and upon which enterprises can build and deploy Hadoop-based applications.

Hortonworks has deep relationships with the key strategic data center partners that enable our customers to unlock the broadest opportunities from Hadoop.

For more information, visit www.hortonworks.com

For Additional Information Contact:

Mike Haro
Hortonworks
(408) 438-8628
[email protected]

Keith Giannini or Anna Vaverka
MSLGROUP
[email protected] 
Tel: (415) 817-2500

Logo - http://photos.prnewswire.com/prnh/20140227/SF73721LOGO

SOURCE Hortonworks

More Stories By PR Newswire

Copyright © 2007 PR Newswire. All rights reserved. Republication or redistribution of PRNewswire content is expressly prohibited without the prior written consent of PRNewswire. PRNewswire shall not be liable for any errors or delays in the content, or for any actions taken in reliance thereon.

Latest Stories
With more than 30 Kubernetes solutions in the marketplace, it's tempting to think Kubernetes and the vendor ecosystem has solved the problem of operationalizing containers at scale or of automatically managing the elasticity of the underlying infrastructure that these solutions need to be truly scalable. Far from it. There are at least six major pain points that companies experience when they try to deploy and run Kubernetes in their complex environments. In this presentation, the speaker will d...
While DevOps most critically and famously fosters collaboration, communication, and integration through cultural change, culture is more of an output than an input. In order to actively drive cultural evolution, organizations must make substantial organizational and process changes, and adopt new technologies, to encourage a DevOps culture. Moderated by Andi Mann, panelists discussed how to balance these three pillars of DevOps, where to focus attention (and resources), where organizations might...
The deluge of IoT sensor data collected from connected devices and the powerful AI required to make that data actionable are giving rise to a hybrid ecosystem in which cloud, on-prem and edge processes become interweaved. Attendees will learn how emerging composable infrastructure solutions deliver the adaptive architecture needed to manage this new data reality. Machine learning algorithms can better anticipate data storms and automate resources to support surges, including fully scalable GPU-c...
When building large, cloud-based applications that operate at a high scale, it's important to maintain a high availability and resilience to failures. In order to do that, you must be tolerant of failures, even in light of failures in other areas of your application. "Fly two mistakes high" is an old adage in the radio control airplane hobby. It means, fly high enough so that if you make a mistake, you can continue flying with room to still make mistakes. In his session at 18th Cloud Expo, Le...
Machine learning has taken residence at our cities' cores and now we can finally have "smart cities." Cities are a collection of buildings made to provide the structure and safety necessary for people to function, create and survive. Buildings are a pool of ever-changing performance data from large automated systems such as heating and cooling to the people that live and work within them. Through machine learning, buildings can optimize performance, reduce costs, and improve occupant comfort by ...
As Cybric's Chief Technology Officer, Mike D. Kail is responsible for the strategic vision and technical direction of the platform. Prior to founding Cybric, Mike was Yahoo's CIO and SVP of Infrastructure, where he led the IT and Data Center functions for the company. He has more than 24 years of IT Operations experience with a focus on highly-scalable architectures.
The explosion of new web/cloud/IoT-based applications and the data they generate are transforming our world right before our eyes. In this rush to adopt these new technologies, organizations are often ignoring fundamental questions concerning who owns the data and failing to ask for permission to conduct invasive surveillance of their customers. Organizations that are not transparent about how their systems gather data telemetry without offering shared data ownership risk product rejection, regu...
CI/CD is conceptually straightforward, yet often technically intricate to implement since it requires time and opportunities to develop intimate understanding on not only DevOps processes and operations, but likely product integrations with multiple platforms. This session intends to bridge the gap by offering an intense learning experience while witnessing the processes and operations to build from zero to a simple, yet functional CI/CD pipeline integrated with Jenkins, Github, Docker and Azure...
René Bostic is the Technical VP of the IBM Cloud Unit in North America. Enjoying her career with IBM during the modern millennial technological era, she is an expert in cloud computing, DevOps and emerging cloud technologies such as Blockchain. Her strengths and core competencies include a proven record of accomplishments in consensus building at all levels to assess, plan, and implement enterprise and cloud computing solutions. René is a member of the Society of Women Engineers (SWE) and a m...
Dhiraj Sehgal works in Delphix's product and solution organization. His focus has been DevOps, DataOps, private cloud and datacenters customers, technologies and products. He has wealth of experience in cloud focused and virtualized technologies ranging from compute, networking to storage. He has spoken at Cloud Expo for last 3 years now in New York and Santa Clara.
Enterprises are striving to become digital businesses for differentiated innovation and customer-centricity. Traditionally, they focused on digitizing processes and paper workflow. To be a disruptor and compete against new players, they need to gain insight into business data and innovate at scale. Cloud and cognitive technologies can help them leverage hidden data in SAP/ERP systems to fuel their businesses to accelerate digital transformation success.
Containers and Kubernetes allow for code portability across on-premise VMs, bare metal, or multiple cloud provider environments. Yet, despite this portability promise, developers may include configuration and application definitions that constrain or even eliminate application portability. In this session we'll describe best practices for "configuration as code" in a Kubernetes environment. We will demonstrate how a properly constructed containerized app can be deployed to both Amazon and Azure ...
Poor data quality and analytics drive down business value. In fact, Gartner estimated that the average financial impact of poor data quality on organizations is $9.7 million per year. But bad data is much more than a cost center. By eroding trust in information, analytics and the business decisions based on these, it is a serious impediment to digital transformation.
Digital Transformation: Preparing Cloud & IoT Security for the Age of Artificial Intelligence. As automation and artificial intelligence (AI) power solution development and delivery, many businesses need to build backend cloud capabilities. Well-poised organizations, marketing smart devices with AI and BlockChain capabilities prepare to refine compliance and regulatory capabilities in 2018. Volumes of health, financial, technical and privacy data, along with tightening compliance requirements by...
Predicting the future has never been more challenging - not because of the lack of data but because of the flood of ungoverned and risk laden information. Microsoft states that 2.5 exabytes of data are created every day. Expectations and reliance on data are being pushed to the limits, as demands around hybrid options continue to grow.