このページは http://www.slideshare.net/adarshrp1/apache-spark-the-analytics-operating-system の内容を掲載しています。
This presentation was delivered by Adarsh Pannu at IBM’s Insight Conference in Nov 2015. For a re...
This presentation was delivered by Adarsh Pannu at IBM’s Insight Conference in Nov 2015. For a recording, visit: https://www.youtube.com/watch?v=Tbm7HIlmwJQ
The presentation provides an overview of Apache Spark, a general-purpose big data processing engine built around speed, ease of use and sophisticated analytics. It enumerates the benefits of incorporating Spark in the enterprise, including how it allows developers to write fully-featured distributed applications ranging from traditional data processing pipelines to complex machine learning. The presentation uses the Airline "On Time" data set to explore various components of the Spark stack.