Spark SQL: Partitions and Sizes

Spark SQL: Partitions and Sizes

Data fetching parallelisation in Apache Spark through Spark SQL / JDBC.
PartitionColumn, lowerBound, upperBound, numPartitions and a lot of other parameters: how do they work?

Integration tests with jena-nosql and Cassandra

Integration tests with jena-nosql and Cassandra

Integration tests with jena-nosql and Cassandra In this post I will illustrate the integration tests infrastructure I used for the jena-nosql project on my GitHub account. The core of the project itself is not associated with a specific storage, so a set of integration tests that run towards a (mini)instance of a not-well-known target storage […]