Total 51 Posts

Data

Saving and Analyzing Trending Topics on Twitter using AWS Athena, Lambda, and CDK

With more than 300 million active users, Twitter is still one of the more optimal…

Read More


Aug 11, 2020 5 min read

Theo LEBRUN

Twitter

Starting with AWS Glue and Querying S3 from Athena

Part one of three in a deep dive of ETL in AWS Glue. Learn how to create powerful low-code/no-code ETL processes from S3 to many data sources in AWS.…

Read More


Jul 28, 2020 8 min read

Sam Portillo

AWS

ABC's of DQM: Audit

This post is part of a series of posts on Data Quality Management.  The focus…

Read More


Jun 26, 2020 8 min read

Dan Ferguson

Data

Exploring Snowsight: Snowflake's Replacement for SQL Worksheets

In June of 2020, Snowflake announced Snowsight: the upcoming replacement for SQL Worksheets and is…

Read More


Jun 26, 2020 5 min read

Sam Portillo

ETL

Apache Spark 3.0

Databricks recently announced the release of Apache Spark 3.0 [https://databricks.com/blog/2020/…

Read More


Jun 23, 2020 3 min read

Theo LEBRUN

Apache Spark

An Introduction to Data Quality Management (DQM)

What is Data Quality Management (DQM)? Data Quality Management (DQM) is a practice that aims…

Read More


Jun 19, 2020 4 min read

Pooja Krishnan

Data

Snowflake External Functions

Snowflake is a cloud-based data warehousing company.  They specialize in provisioning on-demand compute and elastic…

Read More


Jun 16, 2020 7 min read

Dan Ferguson

Cloud

Build an event sourcing system on AWS using DynamoDB and CDK

Over the past few years, event sourcing has become a popular pattern used in modern…

Read More


May 05, 2020 5 min read

Theo LEBRUN

AWS

Solace and Healthcare, a HIPAA Compliant Message Bus

What is Solace? Solace are the makers of the Solace PubSub+ Platform [https://solace.com/…

Read More


Mar 18, 2020 9 min read

Dan Ferguson

Data

Audit your data with JaVers

As an IT consultant, the first requirement that comes to mind when you are working…

Read More


Feb 20, 2020 5 min read

Amine Ouali Alami, Pooja Krishnan

JHipster

Creating an event-driven jHipster Application with Solace

Today, the modern web-application is event driven. In order to be an event-driven application, you…

Read More


Dec 18, 2019 7 min read

Dan Ferguson

Data

Innovative Snowflake Features Part 2: Caching

In the previous blog in this series Innovative Snowflake Features Part 1: Architecture [https://blog.…

Read More


Aug 21, 2019 3 min read

Pooja Krishnan

Data

Innovative Snowflake Features Part 1: Architecture

Earlier this year, Ippon published an Introduction to Snowflake [https://blog.ippon.tech/introduction-to-snowflake/] post…

Read More


Aug 08, 2019 6 min read

Pooja Krishnan

Data

EBS vs Instance-Store for Cassandra on AWS

There seem to be endless options for deploying Cassandra clusters to Amazon Web Services. As…

Read More


Jun 13, 2019 5 min read

Aaron Throckmorton

AWS

Confluent & Twitter4j Tutorial

Reading a Real-Time stream of Tweets into Kafka Kafka is an amazing tool for processing…

Read More


Jun 07, 2019 6 min read

Justin Risch

Data