PinnedMahdi KarabibeninTowards Data ScienceNavigating Your Data Platform’s Growing Pains: A Path from Data Mess to Data MeshA set of strategies and guiding principles to effectively scale your data platform while maximizing its business impact.·10 min read·Apr 3, 2024--1--1
PinnedMahdi KarabibeninTowards Data ScienceA Simple (Yet Effective) Approach to Implementing Unit Tests for dbt ModelsUnit testing dbt models has always been one of the most critical missing pieces of the dbt ecosystem. This article proposes a new unit…9 min read·Aug 18, 2023--2--2
PinnedMahdi KarabibeninZendesk Engineeringdbt at Zendesk — Part I: Setting foundations for scalabilityAn overview of ED&A’s data transformation patterns and our path towards a dbt-based framework.10 min read·May 23, 2023----
PinnedMahdi KarabibeninTowards Data ScienceWriting design docs for data pipelinesExploring the what, why, and how of design docs for data components — and why they matter.8 min read·May 22, 2023--1--1
PinnedMahdi KarabibeninTowards Data ScienceBuilding an end-to-end open-source modern data platformA detailed guide to help you navigate the modern data stack and build your own platform using open-source technologies.17 min read·Dec 28, 2021--10--10
Mahdi KarabibeninTowards Data ScienceYour data catalog shouldn’t be just one more UIAn in-depth look into how an API-focused data catalog can help you ensure the success of your data platform via combining metadata types.8 min read·Nov 16, 2022--3--3
Mahdi KarabibeninTowards Data SciencePerforming Data Validation at Scale with Soda CoreAn in-depth look into the past and present of data validation, and how you can leverage today’s tools to ensure data quality at scale.10 min read·May 26, 2022----
Mahdi KarabibenThanks for sharing this - the architecture is really impressive and it's great to see Livy…I experimented with a similar design a couple of years ago but one problem I had with Livy was the fact that it doesn't support concurrent…1 min read·Aug 31, 2021----
Mahdi KarabibeninTowards Data ScienceFully Automating Your ML Pipelines With the AWS CI/CD ToolsA guide to building an automated MLOps pipeline by leveraging the trusted DevOps toolset.7 min read·Jul 18, 2021--1--1
Mahdi KarabibeninTowards Data ScienceRunning Apache Superset at ScaleA set of recommendations and starting points to efficiently run Superset at scale4 min read·Mar 22, 2021--1--1