This paper highlights how Stratio’s connector for Apache Spark implements the PrunedFilteredScan API instead of the TableScan API which effectively allows you to avoid scanning the entire collection.
When we first started using Spark, we were twenty people. Twenty Stratians. We took a risk and adopted Spark very early on, but with a lot of teamwork and a lot of mistakes, we managed to create the first pure Spark platform.
It’s been almost two months since we introduced Stratio Sparkta at Strata London 2015, showing a demo for real-time insights on twitter hashtags. During this time we added some new features to the real-time aggregation engine based on Spark Streaming.
We have done it again! Stratio is proud to announce a new realease of the Stratio Big Data product! Among the new features, we are proud to highlight: SparkSQL Connector for Crossdata, the possibility to add new services in installed nodes, multi user managment… Discover them all!
Last March we begun the first Stratio Challenge. After many deliberations, the wait is over and we are proud to announce that the Stratio Challenge winners are Marco Piva, Leonardo Biagioli, Fabio Fantoni and Andrea De Marco, from BitBang.