Data Pipelines

multi-stage Pipelines

You can manage large transformations by chaining multiple pipeline stages. Each stage can remain simple, effective, and maintainable while achieving a complicated goal. 


Big Data integration

There isn't a big data buzzword we haven't used in production. We can help you unlock the data that seems trapped in that 10 Billion row Hive table.  We know how to move data between Big Data appliances, and into your warehouse. 

Summarize Data

We don't just want to move your data around, we want to extract new meaning from it along the way. Combined with our Big Data expertise, you can summarize billions of data points a day so you can get answers to your questions.

Data Warehouse

pick the right appliance

There are a lot of appliances that will allow you to quickly query your data. We can help figure out which best meets your needs.  We've used PrestoDB, Spark, and AWS Redshift in production.

scalability and uptime

We have run businesses, so we know first-hand that scalability and uptime are important design factors. If your business is going to depend on a data warehouse, it has to be easy to standup and easy to maintain.

schema design

There are a lot of ways to organize your data. We can help you determine the optimal table structure and layout. We want your team to have the right data, in the right place so they can generate insights.

Business Intelligence

evaluate Bi tools

We have either used in production, or evaluated all the top BI tools on the market. Although we're huge fans of Looker, we'll help you pick the one that works for you.

integrate with your warehouse

We help you put all the pieces together, connecting your data warehouse to the BI tool that you have chosen. Depending on the combination of tools/appliances some assembly might be required.


Train your team

It's not realistic that your team will magically know how to use the new tools you provide them. We can champion your solution and train everyone in the organization to be experts.