After the resounding success of the first article on recommender systems, Álvaro Santos is back with some further insight into creating a recommender system. Coming soon: A follow-up Meetup in Madrid to go even further into this exciting topic. Stay tuned!
In this post we will show how to use the different SQL contexts for data query on Spark. We will begin with Spark SQL and follow up with HiveContext. In addition to this, we will conduct queries on various NoSQL databases and analyze the advantages / disadvantages of using them.
We’re just a couple of days away from the Spanish general elections and Twitter is boiling up with campaign related messages. People want to have a say in what goes on in their country and they turn Twitter to express their opinions and feelings.
This paper highlights how Stratio’s connector for Apache Spark implements the PrunedFilteredScan API instead of the TableScan API which effectively allows you to avoid scanning the entire collection.
Stratio is proud to announce a new realease of the Stratio Big Data platform based on Spark! We have included new features such as: HDFS Connector for Crossdata, HDFS as an option of persistence technologies, UX refactor… Have a look to all the improvements!