From Cassandra to S3, with Spark

Apache Cassandra, a scalable and high-availability platform, is a good choice for high volume event management applications, such as large deployments of sensors. Applications include telematics data for large fleets, smart meter telemetry in electric, .. READ MORE

The Human Side of Code Reviews

Code Reviews are one of the most effective practices for keeping code quality high. They help catch bugs early, keep best practices for code style/quality, and share knowledge with co-workers. Since code reviews involve multiple .. READ MORE

Implementing a Google DataFlow Pipeline

Stream processing frameworks are quickly gaining traction as an efficient method to analyze, decorate, and direct high volume data.  The power of stream processing comes from the idea that we can perform calculations on data as .. READ MORE

Analyzing Kafka data streams with Spark

This blog describes a Spark Streaming application which consumes event data from a Kafka topic to provide continuous, near real-time processing and analysis of the event data stream. The demonstration application, written in Java 8 .. READ MORE