このページは http://www.slideshare.net/QAware/adersberger-timeseriesanalysisspark160511233915 の内容を掲載しています。
Apache Big Data Conference 2016, Vancouver BC: Talk by Josef Adersberger (@adersberger, CTO at QA...
Apache Big Data Conference 2016, Vancouver BC: Talk by Josef Adersberger (@adersberger, CTO at QAware).
Abstract: A lot of data is best represented as time series: Operational data, financial data and even in general-purpose DWHs the dominant dimension is time. The area of time series databases is growing rapidly, but the support in Spark to process and analyze time series data is still in the early stages. We present Chronix Spark which provides a mature TimeSeriesRDD implementation for fast retrieval and complex analysis of time series data. Chronix Spark is open source software and battle-proved at a big German car manufacturer and a German telco company. We show how we have used Chronix Spark in a real-life project and provide some benchmarks how it has outperformed common time series databases like OpenTSDB, KairosDB and InfluxDB. We lift the curtain and deep-dive into the internals how we have achieved this.