Product Archives

Product 7 November, 2016

Stratio Crossdata vs Presto

Nowadays, there are a lot of Big Data query engines available. Some companies struggle to choose which one to use. Benchmarks exist, but results can be contradictory and thus difficult to trust.

Product 24 October, 2016

Creating a Recommender System (Part I)

This two-article series explains how to design and implement a hybrid recommender system that works just like the ones used by Amazon or Ebay.

Product 19 October, 2016

The Developer’s Guide to Scala Implicit Values (Part II)

Conway’s Game of Life: You could hardly imagine a simpler set of rules to code on your computer and you wouldn’t expect any interesting result at all, but… behold the wonders of its hidden might!

Product 29 August, 2016

Continuous delivery in depth #1

In this first issue, we will follow how pipelines are being used at Stratio Big Data to achieve full lifecycle traceability, from the development team to a final productive environment.

Product 23 May, 2016

The Developer’s Guide to Scala Implicit Values (Part I)

Implicit parameters and conversions are powerful tools in Scala increasingly used to develop concise, versatile tools such as DSLs, APIs, libraries… When used correctly, they reduce the verbosity of Scala programs thus providing easy to read code.

Product 10 May, 2016

Benchmarking Machine learning prediction models

When surfing the internet, it is quite easy to find sites comparing the most popular Machine learning toolkits. These sites give you a lot of information about the strengths and weaknesses of the libraries, how they work and some examples to compare how easy it is to use these types of tools.

Product 13 April, 2016

Using Spark SQLContext, HiveContext & Spark Dataframes API with ElasticSearch, MongoDB & Cassandra

In this post we will show how to use the different SQL contexts for data query on Spark. We will begin with Spark SQL and follow up with HiveContext. In addition to this, we will conduct queries on various NoSQL databases and analyze the advantages / disadvantages of using them.

Product 18 February, 2016

How to aggregate Data in Real-Time with Stratio Sparta

When working with Big Data, it’s frequent to have the need to aggregate data in real-time, whether it comes from a specific service, such as social networks (Twitter, Facebook…) or even from more diverse sources, like a weather station.

Product 7 August, 2015

Variance in Scala (“Luke, he is your father too”)

When working with Big Data, sometimes it’s useful to remember that powerful products wouldn’t work properly without the tools that build them.

Product 27 July, 2015

Stratio’s Lucene-based index for Cassandra is now a plugin

Thanks to the changes proposed at CASSANDRA-8717, CASSANDRA-7575 and CASSANDRA-6480, Stratio is glad to present its Lucene-based implementation of Cassandra secondary indexes as a plugin that can be attached to the Apache distribution.

Product

Stratio Crossdata vs Presto

Creating a Recommender System (Part I)

The Developer’s Guide to Scala Implicit Values (Part II)

Continuous delivery in depth #1

The Developer’s Guide to Scala Implicit Values (Part I)

Benchmarking Machine learning prediction models

Using Spark SQLContext, HiveContext & Spark Dataframes API with ElasticSearch, MongoDB & Cassandra

How to aggregate Data in Real-Time with Stratio Sparta

Variance in Scala (“Luke, he is your father too”)

Stratio’s Lucene-based index for Cassandra is now a plugin

Product

Solutions

Use case

Partners

About us

Social