Total 43 Posts

Data

A Beginner’s Guide to InfluxDB: A Time-Series Database

A time series database (TSDB) is specifically made for data that can be evaluated as…

Read More


Jun 29, 2021 4 min read

Ketki V Deshpande

Data

Data Hackathon Recap

Is the Holiday Spirit Contagious? During Ippon's first Data Hackathon in December 2020, the Data…

Read More


Feb 12, 2021 3 min read

Ramya Shetty

Data

Process CSVs from Amazon S3 using Apache Flink, JHipster, and Kubernetes

Apache Flink is one of the latest distributed Big Data frameworks with a goal of…

Read More


Feb 04, 2021 6 min read

Theo LEBRUN

Data Streaming

Use Stargate by DataStax to effortlessly store and query your data

Stargate is one of the latest shiny tools from DataStax that will act as a…

Read More


Jan 15, 2021 5 min read

Theo LEBRUN

Cassandra

Tips and Tricks for Manually Scaling a Global DynamoDB Table from an AWS Lambda

Objective Write an AWS Lambda that manually scales a global DynamoDB table Why? DynamoDB tables…

Read More


Dec 01, 2020 3 min read

Dennis Sharpe

AWS

ABC's of DQM: Control

This is the finale of a 3-part series introducing a Data Quality Management (DQM) framework…

Read More


Aug 31, 2020 8 min read

Dan Ferguson

Data

The ABCs of DQM: Balance

This blog is a part of a series of posts on Data Quality Management. The…

Read More


Aug 18, 2020 5 min read

Pooja Krishnan

Data

Saving and Analyzing Trending Topics on Twitter using AWS Athena, Lambda, and CDK

With more than 300 million active users, Twitter is still one of the more optimal…

Read More


Aug 11, 2020 5 min read

Theo LEBRUN

Twitter

Starting with AWS Glue and Querying S3 from Athena

Part one of three in a deep dive of ETL in AWS Glue. Learn how to create powerful low-code/no-code ETL processes from S3 to many data sources in AWS.…

Read More


Jul 28, 2020 8 min read

Sam Portillo

AWS

ABC's of DQM: Audit

This post is part of a series of posts on Data Quality Management.  The focus…

Read More


Jun 26, 2020 8 min read

Dan Ferguson

Data

Exploring Snowsight: Snowflake's replacement for SQL Worksheets

In June of 2020, Snowflake announced Snowsight: the upcoming replacement for SQL Worksheets and is…

Read More


Jun 26, 2020 5 min read

Sam Portillo

ETL

Apache Spark 3.0

Databricks recently announced the release of Apache Spark 3.0 with their Databricks Runtime 7.…

Read More


Jun 23, 2020 3 min read

Theo LEBRUN

Apache Spark

An Introduction to Data Quality Management (DQM)

What is Data Quality Management (DQM)? Data Quality Management (DQM) is a practice that aims…

Read More


Jun 19, 2020 4 min read

Pooja Krishnan

Data

Snowflake External Functions

Snowflake is a cloud-based data warehousing company.  They specialize in provisioning on-demand compute and elastic…

Read More


Jun 16, 2020 7 min read

Dan Ferguson

Cloud

Build an event sourcing system on AWS using DynamoDB and CDK

Over the past few years, event sourcing has become a popular pattern used in modern…

Read More


May 05, 2020 5 min read

Theo LEBRUN

AWS