Security is often a forgotten concern in Big Data environments. However, as these technologies are being embraced by companies with sensitive data (think, for example, about banks or insurance companies), security is a growing requirement.
Stratio has just added top-k queries support to its Lucene based implementation of the Cassandra’s secondary indexes. This implementation was originally designed to allow embedded full-text and multivariable search in Apache Cassandra.
Once the Data Sources API has been released, we’ve wanted to take advantage of these new features and, for this reason, we have developed a Spark-MongoDB library. With this new connector we help the growing MongoDB community to simplify the interaction with this datasource via Spark.
In the next tutorial you will learn how to migrate data from MySQL to MongoDB. We will show you how to do it using Spark step by step. From creating a configuration for the player RDD to the installation guide for prerequisites components. Easy and intuitive!
Stratio is delighted to announce that it is officially a Certified Spark Distribution. The certification is very important for us because we deeply believe that the certification program provides many benefits to the Spark community.
Spark Infographic: Advantages, activity, evolution of Spark adoption and main headlines.
This paper has been presented at the Eurosys 2013 conference and is avaiblable for download at the conference website. The paper presents BlinkDB that, despite its name, is not a database but a query engine on top of Hive and Shark.
Our second Meetup of the Madrid Cassandra Users group took place on February 12th at Paradigma Tecnológico. The meetup consisted in two talks of 20 minutes each. Talk 1: Introduction to CQL3. Talk 2: A systems approach to the data life cycle.
We will remember 2013 as the year when Stratio was born. The brainchild of Oscar Méndez, Nacho Cabrera, Julio Casal and a few others.