Total 6 Posts

Data Streaming

Confluent & Twitter4j Tutorial

Reading a Real-Time stream of Tweets into Kafka Kafka is an amazing tool for processing…

Read More


Jun 07, 2019 6 min read

Justin Risch

tutorial

Transient Cluster on AWS

This post demonstrates a cost-effective and automated solution for running Spark-Jobs on the EMR cluster on a daily basis using CloudWatch, Lambda, EMR, S3, and SNS.…

Read More


Jun 03, 2019 6 min read

Sripriya Rajanna

Apache Spark

Basics of Apache Nifi: 2

On our previous video on the basics of Nifi, we covered a brief definition of…

Read More


Nov 15, 2017 1 min read

Malcolm Thirus

Big Data

Performance Tweaking Apache Spark

Apache Spark Streaming applications need to be monitored frequently to be certain that they are…

Read More


Jun 26, 2017 5 min read

Jeannine Stark

Data Streaming

Basics of Apache Nifi: 1

In our previous article on Nifi, we discussed the history, architecture, and features of Apache…

Read More


Apr 25, 2017 1 min read

Malcolm Thirus

Big Data

Why NiFi?

In this day and age we are living in, it is not a luxury to…

Read More


Jan 26, 2017 4 min read

Doug Mengistu

Big Data