このページは http://www.slideshare.net/QAware/adersberger-timeseriesanalysisspark160511233915 の内容を掲載しています。
Chronix Spark - a framework for time series processing with Apache Spark. Presentation by @adersb...
Chronix Spark - a framework for time series processing with Apache Spark. Presentation by @adersberger at the Apache Big Data Conference, North America, 2016, Vancouver BC.
A lot of data is best represented as time series: Operational data, financial data and even in general-purpose DWHs the dominant dimension is time. The area of time series databases is growing rapidly but the support in Spark to process and analyze time series data is still in the early stages. We present Chronix Spark which provides a mature TimeSeriesRDD implementation for fast retrieval and complex analysis of time series data. Chronix Spark is open source software and battle-proved at a big german car manufacturer and a german telco. We show how we‘ve used Chronix Spark in a real-life project and provide some benchmarks how it has outperformed common time series databases like OpenTSDB, KairosDB and InfluxDB. We lift the curtain and deep-dive into the internals how we‘ve achieved this.