News Feed Item

DataStax and Databricks Partner to Deliver up to 100X Faster Analytics on Fully Distributed, Highly Scalable Cassandra Database

SANTA CLARA, CA -- (Marketwired) -- 05/08/14 --

Industry-first integration of leading open-source technologies enables companies like Ooyala, Health Market Science, and Pearson Education to deliver highly personalized online customer experiences

By integrating Apache Spark and Apache Cassandra, lightning-fast analytics are now embedded into the transaction processing of the Distributed DBMS

Partnership will deliver open source code back to the Apache Spark and Apache Cassandra communities to ensure that developers always have the most cutting-edge technologies

DataStax, the company that delivers Apache Cassandra to the enterprise, today announced a partnership with Databricks, the company founded by the creators of Apache Spark. As the database industry's first partnership to integrate Spark and Cassandra, DataStax and Databricks will deliver significantly faster analytics to users of both open source technologies and enable today's most progressive businesses to deliver highly personalized online customer experiences.

Transactional Analytics Enable Dynamic Customer Experiences
Apache Cassandra is a fully distributed, highly scalable database that allows users to create online applications that are always on and can process large amounts of data in real time. Originally developed at UC Berkeley's AMPLab, Apache Spark is a processing engine that enables applications in Hadoop clusters to run up to 100X faster in memory, and even 10X faster when running on disk. It also provides SQL, streaming data, machine learning, and graph computation functionality out-of-the-box as first class citizens to simplify building end-to-end analytic workflows. Together, these technologies can significantly boost analytics performance in a transactional database and allow companies to act quicker when serving customers' needs.

Through this partnership, DataStax and Databricks are driving the operational database industry toward a better approach that allows companies to ingest user data at a very fast rate, and then analyze the results within the same distributed database. Responsiveness to customer needs is critical for successful online businesses, and by decreasing their "time to insights", innovative companies such as video analytics provider Ooyala can create highly personalized experiences for their customers.

"The integration of Spark and Shark with Cassandra is enabling Ooyala to efficiently and effectively store, analyze and process every piece of data powering our industry leading video analytics platform," said Kelvin Chu, compute and data team lead, Ooyala. "With Cassandra as the data store and Spark for data crunching, these new analytic capabilities are making the processing of large data volumes a breeze. Spark on Cassandra is giving us the power to act on things in real-time, which means faster decisions and faster results for our ever-growing business."

Cassandra Community Helps Drive Spark Adoption
The Cassandra community is growing quickly, with global user meetups increasing 400 percent over the past year and Spark serving as a frequent topic of discussion. DataStax employees already contribute the majority Apache Cassandra open source code contributions, and by working closely with Databricks engineers, will now contribute to the Spark community as well. The partnership will help spread adoption of both technologies while creating greater cohesiveness among users.

"The Cassandra community has rapidly adopted Spark over the past year because it provides significantly faster analytics than Hadoop," said Martin Van Ryswyk, executive vice president, engineering, DataStax. "We look forward to working closely with Databricks to make the best Spark on Cassandra solution available to the Spark community."

"Spark and Cassandra form a natural bond by combining blazing-fast analytics with a high-performance transactional database," said Arsalan Tavakoli-Shiraji, head of business development, Databricks. "Additionally, all of Spark's benefits, including a unified platform that seamlessly integrates SQL, streaming data and advanced analytics, will be natively available to Cassandra users. This is further validation of Spark's emergence as a general Big Data processing engine with broader applications than just existing Hadoop clusters."

Learn More At Spark Summit on June 30
To learn more about how Spark and Cassandra deliver faster analytics in a transactional database system, users can attend Van Ryswyk's presentation at the Spark Summit on June 30 through July 2 at The Westin St. Francis in San Francisco.

About DataStax
DataStax provides a massively scalable enterprise NoSQL platform to run mission-critical
business applications for some of the world's most innovative and data-intensive enterprises. Powered by the open source Apache Cassandra™ database, DataStax delivers a fully distributed, continuously available platform that is faster to deploy and less expensive to maintain than other database platforms.

DataStax has more than 500 customers in 45 countries including leaders such as Netflix,
Rackspace, Pearson Education, and Constant Contact, and spans verticals including web, financial services, telecommunications, logistics, and government. Based in Santa Clara, Calif., DataStax is backed by industry-leading investors including Lightspeed Venture Partners, Meritech Capital, and Crosslink Capital. For more information, visit DataStax.com or follow us @DataStax and @DataStaxEU.

About Databricks
Databricks was founded by the creators of Apache Spark, and are using cutting-edge technology based on years of research to build next-generation software for analyzing and extracting value from Big Data. They believe Big Data is a tremendous opportunity that is still largely untapped, and are working to revolutionize what enterprises can do with it. They are venture-backed by Andreessen Horowitz.

Media Contact:
Elisa Greene
Email Contact

More Stories By Marketwired .

Copyright © 2009 Marketwired. All rights reserved. All the news releases provided by Marketwired are copyrighted. Any forms of copying other than an individual user's personal reference without express written permission is prohibited. Further distribution of these materials is strictly forbidden, including but not limited to, posting, emailing, faxing, archiving in a public database, redistributing via a computer network or in a printed form.

Latest Stories
The IoT industry is now at a crossroads, between the fast-paced innovation of technologies and the pending mass adoption by global enterprises. The complexity of combining rapidly evolving technologies and the need to establish practices for market acceleration pose a strong challenge to global enterprises as well as IoT vendors. In his session at @ThingsExpo, Clark Smith, senior product manager for Numerex, will discuss how Numerex, as an experienced, established IoT provider, has embraced a ...
DevOps theory promotes a culture of continuous improvement built on collaboration, empowerment, systems thinking, and feedback loops. But how do you collaborate effectively across the traditional silos? How can you make decisions without system-wide visibility? How can you see the whole system when it is spread across teams and locations? How do you close feedback loops across teams and activities delivering complex multi-tier, cloud, container, serverless, and/or API-based services?
Today every business relies on software to drive the innovation necessary for a competitive edge in the Application Economy. This is why collaboration between development and operations, or DevOps, has become IT’s number one priority. Whether you are in Dev or Ops, understanding how to implement a DevOps strategy can deliver faster development cycles, improved software quality, reduced deployment times and overall better experiences for your customers.
In the 21st century, security on the Internet has become one of the most important issues. We hear more and more about cyber-attacks on the websites of large corporations, banks and even small businesses. When online we’re concerned not only for our own safety but also our privacy. We have to know that hackers usually start their preparation by investigating the private information of admins – the habits, interests, visited websites and so on. On the other hand, our own security is in danger bec...
The Internet of Things (IoT), in all its myriad manifestations, has great potential. Much of that potential comes from the evolving data management and analytic (DMA) technologies and processes that allow us to gain insight from all of the IoT data that can be generated and gathered. This potential may never be met as those data sets are tied to specific industry verticals and single markets, with no clear way to use IoT data and sensor analytics to fulfill the hype being given the IoT today.
Enterprises have been using both Big Data and virtualization for years. Until recently, however, most enterprises have not combined the two. Big Data's demands for higher levels of performance, the ability to control quality-of-service (QoS), and the ability to adhere to SLAs have kept it on bare metal, apart from the modern data center cloud. With recent technology innovations, we've seen the advantages of bare metal erode to such a degree that the enhanced flexibility and reduced costs that cl...
Without lifecycle traceability and visibility across the tool chain, stakeholders from Planning-to-Ops have limited insight and answers to who, what, when, why and how across the DevOps lifecycle. This impacts the ability to deliver high quality software at the needed velocity to drive positive business outcomes. In his session at @DevOpsSummit 19th Cloud Expo, Eric Robertson, General Manager at CollabNet, will show how customers are able to achieve a level of transparency that enables everyon...
Donna Yasay, President of HomeGrid Forum, today discussed with a panel of technology peers how certification programs are at the forefront of interoperability, and the answer for vendors looking to keep up with today's growing industry for smart home innovation. "To ensure multi-vendor interoperability, accredited industry certification programs should be used for every product to provide credibility and quality assurance for retail and carrier based customers looking to add ever increasing num...
“Media Sponsor” of SYS-CON's 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. CloudBerry Backup is a leading cross-platform cloud backup and disaster recovery solution integrated with major public cloud services, such as Amazon Web Services, Microsoft Azure and Google Cloud Platform.
In the next forty months – just over three years – businesses will undergo extraordinary changes. The exponential growth of digitization and machine learning will see a step function change in how businesses create value, satisfy customers, and outperform their competition. In the next forty months companies will take the actions that will see them get to the next level of the game called Capitalism. Or they won’t – game over. The winners of today and tomorrow think differently, follow different...
@DevOpsSummit has been named the ‘Top DevOps Influencer' by iTrend. iTrend processes millions of conversations, tweets, interactions, news articles, press releases, blog posts - and extract meaning form them and analyzes mobile and desktop software platforms used to communicate, various metadata (such as geo location), and automation tools. In overall placement, @DevOpsSummit ranked as the number one ‘DevOps Influencer' followed by @CloudExpo at third, and @MicroservicesE at 24th.
In his general session at 19th Cloud Expo, Manish Dixit, VP of Product and Engineering at Dice, will discuss how Dice leverages data insights and tools to help both tech professionals and recruiters better understand how skills relate to each other and which skills are in high demand using interactive visualizations and salary indicator tools to maximize earning potential. Manish Dixit is VP of Product and Engineering at Dice. As the leader of the Product, Engineering and Data Sciences team a...
The security needs of IoT environments require a strong, proven approach to maintain security, trust and privacy in their ecosystem. Assurance and protection of device identity, secure data encryption and authentication are the key security challenges organizations are trying to address when integrating IoT devices. This holds true for IoT applications in a wide range of industries, for example, healthcare, consumer devices, and manufacturing. In his session at @ThingsExpo, Lancen LaChance, vic...
Regulatory requirements exist to promote the controlled sharing of information, while protecting the privacy and/or security of the information. Regulations for each type of information have their own set of rules, policies, and guidelines. Cloud Service Providers (CSP) are faced with increasing demand for services at decreasing prices. Demonstrating and maintaining compliance with regulations is a nontrivial task and doing so against numerous sets of regulatory requirements can be daunting task...
What are the successful IoT innovations from emerging markets? What are the unique challenges and opportunities from these markets? How did the constraints in connectivity among others lead to groundbreaking insights? In her session at @ThingsExpo, Carmen Feliciano, a Principal at AMDG, will answer all these questions and share how you can apply IoT best practices and frameworks from the emerging markets to your own business.