Apache Spark for Big Data Processing

October 2, 2015

Recorded at SpringOne2GX 2015 Presenters: Ludwine Probst & Ilayaperumal Gopinathan Big Data Track Today, we live in the world of Big Data. Hadoop and MapReduce are highly dominant in the domain of large scale data processing. However, the MapReduce model shows its limits for various types of treatment, especially for highly iterative algorithms frequently encountered in the field of Machine Learning. Spark is an in-memory data processing framework that, unlike Hadoop, provides interactive and real-time analysis on large datasets. Furthermore, Spark has a more flexible programming model and gives better performance than Hadoop. In this talk, we aim at giving a portrait of Spark and at browsing its ecosystem, in particular Spark Streaming and MLlib with a concrete example. We will also show how you can use Spark with Spring XD, allowing you to take advantage of the strengths in each platform.

Grails Gotchas and Best Practices

Recorded at SpringOne2GX 2015 Presenter: Tom Henricksen GG Special Topics Track Grails is a powerful frame...

Next Presentation

Fullstack Groovy Developer

Recorded at SpringOne2GX 2015 Presenter: Ivan Lopez Groovy Ecosystem Track How many times have you ever he...

Apache Spark for Big Data Processing

Previous

Next Presentation

Most Recent

Spring Live April 19-20, 2020 Kerry Schaffer Senior IT Director, OneMagnify

Spring Live April 19-20, 2020 Laura Santamaria Developer Advocate, LogDNA

Spring Live April 19-20, 2020 Thomas Risberg Sr Principal Software Engineer, VMware

Spring Live April 19-20, 2020 Mark Heckler Principal Technologist & Developer Advocate, VMware

Spring Live March 19-20, 2020 Speaker: Aaron Parecki How to Hack OAuth

Spring Live March 19 - 20, 2020 Speaker: Jeff Williams Security Instrumentation is the Future of All Software

Spring Live April 19-20, 2020 Cora Iberkleid Advisory Cloud Application and Platform Architect, VMware

Spring Live March 19-20, 2020 Speaker: Hannah Foxwell 4 Questions to Ask Your Dev Team

Spring Live April 19-20, 2020 Josh Long Spring Developer Advocate, VMware

Spring Live April 19-20, 2020 Kylie Liang Senior PM at Cloud + Enterprise Group, Microsoft

Spring Live April 19-20, 2020 Kenneth Kousen Software developer, Speaker, and Technical trainer, President Kousen IT, Inc

Spring Live April 19-20, 2020 Walter Scarborough Fullstack Developer, VMware Pivotal Labs

Spring Live March 19-20, 2020 Laura Santamaria Developer Advocate, LogDNA

Spring Live March 19-20,2020 Speaker: Jennifer Reif Developing Applications with Spring and Neo4j

Spring Live March 19-20, 2020 Angela Chin Senior Software Engineer

Spring Live April 19-20, 2020 Madhura Bhave Developer, VMware

Spring Live March 19-20, 2020 Speaker: Jonathan Regehr Title: Handling Secrets in Your Cloud Native Architecture

Spring Live March 19-20, 2020 Brian Sletten Software Engineer

Spring Live April 19-20, 2020 Michael Coté Technial Marketing, VMware

Spring Live April 19-20, 2020 Omotola Awofolu Senior Platform Architect, VMware