Welcome!

News Feed Item

Cloudera Announces New Apache Spark Training Course for Big Data Developers

Hands-On Course Prepares Developers to Write Sophisticated Parallel Applications for Faster Time-to-Insight and Stream Processing, Applied to a Wide Variety of Use Cases, Architectures, and Industries

PALO ALTO, CA -- (Marketwired) -- 07/16/14 -- Cloudera, the leader in enterprise analytic data management powered by Apache Hadoop™, today announced the first hands-on Apache Spark training course that will enrich developers' experience with this groundbreaking new processing engine. The three-day course, called Cloudera Developer Training for Apache Spark, will prepare developers and software engineers to build complete, unified applications that combine batch, streaming, and interactive analytics on all of their data. With Cloudera Developer Training for Apache Spark, data professionals can take advantage of this next-generation framework's advantages for speed, ease of use, and advanced analytics to enable faster business decisions and better user outcomes.

Spark is an open source data analytics framework originally developed in the AMPLab at the University of California, Berkeley that complements Hadoop as part of an enterprise data hub. Broadly embraced by the open source community, Big Data vendors, and data-intensive enterprises for its stream processing capabilities and its support for complex, iterative algorithms, Spark offers performance gains that enable applications to run on the data in a Hadoop cluster at speeds up to 100 times faster than traditional MapReduce programs. Cloudera was also the first company to offer commercial support for Spark as part of a Cloudera Enterprise subscription and recently announced a collaboration with Databricks, IBM, Intel, and MapR to broaden support for Spark as the standard data processing engine for the Hadoop ecosystem.

Through instructor-led discussions and interactive, hands-on exercises, participants will dive deep into the technical applications of Spark to understand how it relates to the rest of the Hadoop ecosystem and write sophisticated parallel applications. Developers will learn real-world best practices drawn from Cloudera's work with Spark on some of the largest clusters in development and production:

  • Using the Spark shell for interactive data analysis
  • The features of Spark's Resilient Distributed Datasets
  • How Spark runs on a cluster
  • Parallel programming with Spark
  • Writing Spark applications
  • Processing streaming data with Spark

"Spark offers clear benefits for realizing sophisticated analytics and is quickly becoming the future of data processing on Hadoop," said Sarah Sproehnle, vice president, Education Services, Cloudera. "With Spark, customers can realize immediate business advantages. For example, Spark Streaming enables businesses to process live data as it arrives in the enterprise data hub, rather than having to wait to batch-process it later. The fact that the same codebase can be used for streaming data and data-at-rest significantly reduces development time for Big Data applications, speeding up time-to-insight by several orders of magnitude and decreasing the need for expensive specialized systems. This is just one case where the benefits of Spark have a direct impact on a company's bottom line."

Cloudera offers a wide variety of courses to prepare developers to work with all aspects of Big Data. Cloudera Developer Training for Apache Spark offers developers a chance to experience the dramatic data processing improvements Spark delivers and build their expertise with one of the most relevant tools in an enterprise data hub. To learn more about this new course offering:

What developers are saying about Cloudera Developer Training for Apache Spark:
"The presentation format of all Cloudera training courses is always very clear and progressive. By building on previous concepts with each new course, Cloudera University makes Spark Developer Training an important step in a developer's learning path. The labs were extremely relevant to everyday Big Data challenges and went beyond the typical introductory exercises I have seen elsewhere. The classroom discussion led by the Cloudera instructor was invaluable, as each student came with different use cases and levels of knowledge. The course effectively reinforces the importance of learning Spark Streaming and the Lambda Architecture for combining batch and streaming workloads within a single environment. After seeing how other participants responded to the presentation format of Cloudera's Spark Developer Training course, I've actually changed the way I'm going to present the fundamental concepts in my book, Spark In Action."
-- Chris Fregly, author of Spark in Action

About Cloudera
Cloudera is revolutionizing enterprise data management by offering the first unified Platform for Big Data, an enterprise data hub built on Apache Hadoop™. Cloudera offers enterprises one place to store, process and analyze all their data, empowering them to extend the value of existing investments while enabling fundamental new ways to derive value from their data. Only Cloudera offers everything needed on a journey to an enterprise data hub, including software for business critical data challenges such as storage, access, management, analysis, security and search. As the leading educator of Hadoop professionals, Cloudera has trained over 22,000 individuals worldwide. Over 1,000 partners and a seasoned professional services team help deliver greater time to value. Finally, only Cloudera provides proactive and predictive support to run an enterprise data hub with confidence. Leading organizations in every industry plus top public sector organizations globally run Cloudera in production. www.cloudera.com

Connect with Cloudera
Read our blogs: http://www.cloudera.com/blog/ and http://vision.cloudera.com/
Follow Cloudera on Twitter: http://twitter.com/cloudera
Follow Cloudera University on Twitter: http://twitter.com/ClouderaU
Visit us on Facebook: http://www.facebook.com/cloudera

Cloudera, Cloudera's Platform for Big Data, Cloudera Enterprise Data Hub Edition, Cloudera Enterprise Flex Edition, Cloudera Enterprise Basic Edition and CDH are trademarks or registered trademarks of Cloudera Inc. in the United States, and in jurisdictions throughout the world. All other company and product names may be trade names or trademarks of their respective owners.

More Stories By Marketwired .

Copyright © 2009 Marketwired. All rights reserved. All the news releases provided by Marketwired are copyrighted. Any forms of copying other than an individual user's personal reference without express written permission is prohibited. Further distribution of these materials is strictly forbidden, including but not limited to, posting, emailing, faxing, archiving in a public database, redistributing via a computer network or in a printed form.

Latest Stories
The IoT has the potential to create a renaissance of manufacturing in the US and elsewhere. In his session at 18th Cloud Expo, Florent Solt, CTO and chief architect of Netvibes, will discuss how the expected exponential increase in the amount of data that will be processed, transported, stored, and accessed means there will be a huge demand for smart technologies to deliver it. Florent Solt is the CTO and chief architect of Netvibes. Prior to joining Netvibes in 2007, he co-founded Rift Technol...
Struggling to keep up with increasing application demand? Learn how Platform as a Service (PaaS) can streamline application development processes and make resource management easy.
If there is anything we have learned by now, is that every business paves their own unique path for releasing software- every pipeline, implementation and practices are a bit different, and DevOps comes in all shapes and sizes. Software delivery practices are often comprised of set of several complementing (or even competing) methodologies – such as leveraging Agile, DevOps and even a mix of ITIL, to create the combination that’s most suitable for your organization and that maximize your busines...
The increasing popularity of the Internet of Things necessitates that our physical and cognitive relationship with wearable technology will change rapidly in the near future. This advent means logging has become a thing of the past. Before, it was on us to track our own data, but now that data is automatically available. What does this mean for mHealth and the "connected" body? In her session at @ThingsExpo, Lisa Calkins, CEO and co-founder of Amadeus Consulting, will discuss the impact of wea...
See storage differently! Storage performance problems have only gotten worse and harder to solve as applications have become largely virtualized and moved to a cloud-based infrastructure. Storage performance in a virtualized environment is not just about IOPS, it is about how well that potential performance is guaranteed to individual VMs for these apps as the number of VMs keep going up real time. In his session at 18th Cloud Expo, Dhiraj Sehgal, in product and marketing at Tintri, will discu...
SYS-CON Events announced today that Peak 10, Inc., a national IT infrastructure and cloud services provider, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. Peak 10 provides reliable, tailored data center and network services, cloud and managed services. Its solutions are designed to scale and adapt to customers’ changing business needs, enabling them to lower costs, improve performance and focus inter...
We’ve worked with dozens of early adopters across numerous industries and will debunk common misperceptions, which starts with understanding that many of the connected products we’ll use over the next 5 years are already products, they’re just not yet connected. With an IoT product, time-in-market provides much more essential feedback than ever before. Innovation comes from what you do with the data that the connected product provides in order to enhance the customer experience and optimize busi...
Up until last year, enterprises that were looking into cloud services usually undertook a long-term pilot with one of the large cloud providers, running test and dev workloads in the cloud. With cloud’s transition to mainstream adoption in 2015, and with enterprises migrating more and more workloads into the cloud and in between public and private environments, the single-provider approach must be revisited. In his session at 18th Cloud Expo, Yoav Mor, multi-cloud solution evangelist at Cloudy...
In his session at @ThingsExpo, Chris Klein, CEO and Co-founder of Rachio, will discuss next generation communities that are using IoT to create more sustainable, intelligent communities. One example is Sterling Ranch, a 10,000 home development that – with the help of Siemens – will integrate IoT technology into the community to provide residents with energy and water savings as well as intelligent security. Everything from stop lights to sprinkler systems to building infrastructures will run ef...
SYS-CON Events announced today that Enzu, a leading provider of cloud hosting solutions, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. Enzu’s mission is to be the leading provider of enterprise cloud solutions worldwide. Enzu enables online businesses to use its IT infrastructure to their competitive advantage. By offering a suite of proven hosting and management services, Enzu wants companies to foc...
So, you bought into the current machine learning craze and went on to collect millions/billions of records from this promising new data source. Now, what do you do with them? Too often, the abundance of data quickly turns into an abundance of problems. How do you extract that "magic essence" from your data without falling into the common pitfalls? In her session at @ThingsExpo, Natalia Ponomareva, Software Engineer at Google, will provide tips on how to be successful in large scale machine lear...
Digital payments using wearable devices such as smart watches, fitness trackers, and payment wristbands are an increasing area of focus for industry participants, and consumer acceptance from early trials and deployments has encouraged some of the biggest names in technology and banking to continue their push to drive growth in this nascent market. Wearable payment systems may utilize near field communication (NFC), radio frequency identification (RFID), or quick response (QR) codes and barcodes...
SYS-CON Events announced today that Stratoscale, the software company developing the next generation data center operating system, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. Stratoscale is revolutionizing the data center with a zero-to-cloud-in-minutes solution. With Stratoscale’s hardware-agnostic, Software Defined Data Center (SDDC) solution to store everything, run anything and scale everywhere...
Angular 2 is a complete re-write of the popular framework AngularJS. Programming in Angular 2 is greatly simplified – now it's a component-based well-performing framework. This immersive one-day workshop at 18th Cloud Expo, led by Yakov Fain, a Java Champion and a co-founder of the IT consultancy Farata Systems and the product company SuranceBay, will provide you with everything you wanted to know about Angular 2.
You think you know what’s in your data. But do you? Most organizations are now aware of the business intelligence represented by their data. Data science stands to take this to a level you never thought of – literally. The techniques of data science, when used with the capabilities of Big Data technologies, can make connections you had not yet imagined, helping you discover new insights and ask new questions of your data. In his session at @ThingsExpo, Sarbjit Sarkaria, data science team lead ...