Big Data

Total 16 Posts

Data Extrapolation: Learning From Your Big Data

The first step in answering any Big Data-oriented question is to simply obtain the data.…
Read More


Jun 08,2017

MongoDB and Apache Spark - Getting started tutorial

MongoDB and Apache Spark are two popular Big Data technologies. In my previous post, I…
Read More


May 03,2017

Streaming With Scala: The Nuance of Real-Time Twitter Data

At Ippon Technologies USA, we're lucky enough to have "Coding Dojos" every 2-4 months. Every…
Read More


Mar 08,2017

Pokemon GO: A Big Data Learning Opportunity

Nick Peterson and Justin Risch have begun to study Big Data, Spark, Hadoop, and the…
Read More


Feb 16,2017

Why NiFi?

In this day and age we are living in, it is not a luxury to…
Read More


Jan 26,2017

Kafka Streams - Scaling up or down

Kafka Streams is a new component of the Kafka platform. It is a lightweight library…
Read More


Oct 07,2016

Spark - Calling Scala code from PySpark

In a previous post, I demonstrated how to consume a Kafka topic using Spark in…
Read More


Sep 12,2016

Apache Spark Datasets

With a Spark 2.0 release imminent, the previously experimental Datasets API will be a…
Read More


Jun 15,2016

Spark & Kafka - Achieving zero data-loss

Kafka and Spark Streaming are two technologies that fit well together. Both are distributed systems…
Read More


May 12,2016

A tour of Databricks Community Edition: a hosted Spark service

With the recent announcement of the Community Edition, it’s time to have a look…
Read More


Apr 13,2016

Kafka, Spark and Avro - Part 3 of 3, Producing and consuming Avro messages

This post is the third and last post in a series in which we learn…
Read More


Apr 06,2016

Testing strategy for Spark Streaming - Part 2 of 2

In a previous post, we’ve seen why it’s important to test your Spark…
Read More


Mar 30,2016

Kafka, Spark and Avro - Part 2 of 3, Consuming Kafka messages with Spark

This post is the second post in a series in which we will learn how…
Read More


Mar 23,2016

Kafka, Spark and Avro - Part 1 of 3, Kafka 101

This post is the first in a series of posts in which we will learn…
Read More


Mar 15,2016

Big Data and Spring Core roundup from Spring One 2GX

By Dennis Sharpe and Romain Lheritier. Spring One 2GX 2015 took place in Washington DC…
Read More


Sep 24,2015