PinnedPublished inTowards Data ScienceData Modeling Techniques for the Post-Modern Data StackA set of generic techniques and principles to design a robust, cost-efficient, and scalable data model for your post-modern data stack.Jul 203Jul 203
PinnedPublished inTowards Data ScienceNavigating Your Data Platform’s Growing Pains: A Path from Data Mess to Data MeshA set of strategies and guiding principles to effectively scale your data platform while maximizing its business impact.Apr 31Apr 31
PinnedPublished inZendesk Engineeringdbt at Zendesk — Part I: Setting foundations for scalabilityAn overview of ED&A’s data transformation patterns and our path towards a dbt-based framework.May 23, 2023May 23, 2023
PinnedPublished inTowards Data ScienceWriting design docs for data pipelinesExploring the what, why, and how of design docs for data components — and why they matter.May 22, 20231May 22, 20231
PinnedPublished inTowards Data ScienceBuilding an end-to-end open-source modern data platformA detailed guide to help you navigate the modern data stack and build your own platform using open-source technologies.Dec 28, 202110Dec 28, 202110
Published inThe StartupYour twenties are meant for fun after all — or are they?Reflecting on turning 30, the messy journey toward becoming the person you’re meant to be, and the key learnings along the way.Oct 151Oct 151
Published inTowards Data ScienceA Simple (Yet Effective) Approach to Implementing Unit Tests for dbt ModelsUnit testing dbt models has always been one of the most critical missing pieces of the dbt ecosystem. This article proposes a new unit…Aug 18, 20232Aug 18, 20232
Published inTowards Data ScienceYour data catalog shouldn’t be just one more UIAn in-depth look into how an API-focused data catalog can help you ensure the success of your data platform via combining metadata types.Nov 16, 20223Nov 16, 20223
Published inTowards Data SciencePerforming Data Validation at Scale with Soda CoreAn in-depth look into the past and present of data validation, and how you can leverage today’s tools to ensure data quality at scale.May 26, 2022May 26, 2022
Thanks for sharing this - the architecture is really impressive and it's great to see Livy…I experimented with a similar design a couple of years ago but one problem I had with Livy was the fact that it doesn't support concurrent…Aug 31, 2021Aug 31, 2021