Author image

Malcolm Thirus

2 posts LinkedIn
Dirty Data Dancer, Spark Specialist, and an advocator of awesome AWS Applications. My experience ranges from Data Warehousing to Data Management and Data Exploration.

Basics of Apache Nifi: 1

In our previous article on Nifi, we discussed the history, architecture, and features of Apache Nifi. This series will demonstrate the basics of how to create a dataflow within Nifi and the various ways to manipulate the data being ingested. Installation Instructions for downloading and starting Nifi can be found

Apache Spark Datasets

With a Spark 2.0 release imminent, the previously experimental Datasets API will be a core feature. Spark Datasets were introduced in the 1.6 release as a bridge between the Object Oriented type safety of RDDs and the speed and optimization of Dataframes utilizing Spark SQL. Databricks has stated