Spark Timeseries Analysis
The task was to create a spark app that runs a timeseries algorithm on data.
Our main steps was:
- Getting the data from Cassandra.
- Cleaning the data.
- Building Apache Spark Streaming.
- Calculating the main values: number of likes, comments, reads, shares and the speed of each article.
- Building SVM model for performance prediction.
- Return the statistic results to the Cassandra.