Total 6 Posts

Data Streaming

Confluent & Twitter4j Tutorial

Reading a Real-Time stream of Tweets into Kafka Kafka is an amazing tool for processing…

Read More


Jun 07, 2019

Justin Risch

tutorial

Transient Cluster on AWS

This post demonstrates a cost-effective and automated solution for running Spark-Jobs on the EMR cluster on a daily basis using CloudWatch, Lambda, EMR, S3, and SNS.…

Read More


Basics of Apache Nifi: 2

On our previous video on the basics of Nifi, we covered a brief definition of…

Read More


Nov 15, 2017

Malcolm Thirus

Big Data

Performance Tweaking Apache Spark

Apache Spark Streaming applications need to be monitored frequently to be certain that they are…

Read More


Basics of Apache Nifi: 1

In our previous article on Nifi, we discussed the history, architecture, and features of Apache…

Read More


Apr 25, 2017

Malcolm Thirus

Big Data

Why NiFi?

In this day and age we are living in, it is not a luxury to…

Read More


Jan 26, 2017

Doug Mengistu

Big Data