InTDS ArchivebyDave MelilloBuilding a Data Platform in 2024How to build a modern, scalable data platform to power your analytics and data science projects (updated)Feb 5, 202446Feb 5, 202446
Stefentaime“Real-Time Data Processing and Analytics with Docker, MySQL, Redpanda, MinIO, and Apache Spark…In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO…Mar 27, 20232Mar 27, 20232
InGeek CulturebySven BalnojanHigh-Performance Data Teams Don’t Care About Data QualityHigh-performance data teams focus on the same four metrics high-performance software teams focus on. But they understand how these metrics…Feb 11, 20234Feb 11, 20234
ZEKERİYYA DEMİRCİApache Spark Structured Streaming Via Docker ComposeBuilding a data pipeline could be challenging especially when you have to take into account portability, flexibility, scalability etc. To…Apr 21, 20222Apr 21, 20222
Gaurav ThalpatiDesign Document for Data PlatformsA practical guide for creating an effective design documentSep 9, 20222Sep 9, 20222
InThe PayPal Technology BlogbyJean-Georges PerrinThe next generation of Data Platforms is the Data MeshHere’s why.Aug 3, 202216Aug 3, 202216
Steve JonesData Warehouses are a legacy of a process driven worldWe don’t build applications like we did in 1995, or even 2015, so why are we still using 90s thinking for data?Apr 26, 20223Apr 26, 20223
InFalkonbyAakash KambujThe 7 Traits of a Modern Metrics StackExploring the attributes necessary to bring operational rigor and visibility to key business metricsApr 28, 2021Apr 28, 2021
InUnboxing PhotoboxbyStefano SolimitoPhotobox New Data PlatformBuilding a self-service even driven data platform in 2022Feb 10, 20221Feb 10, 20221
InThe KickStarterbyRyan GrossData Architects: We live in a distributed world. Design as such.A guide to use graph structures to mix data types, computation DAGs, compute connectivity, and real-world actions to model distributed…Feb 10, 20221Feb 10, 20221
Hülya Pamukçu CrowellA Meta-architecture for Data MeshOne of the essential requirements for successful data-driven decisions is speed. While there are many ways to improve the delivery of data…Jan 1, 2022Jan 1, 2022
InTDS ArchivebyFurcy PinModern Data Stack: which place for Spark ?One year ago, some were already predicting that dbt will one day become bigger than Spark, and the year 2021 proved them right: dbt has…Jan 25, 202215Jan 25, 202215
InTDS ArchivebyPrukalpa5 “Meta” Takeaways from the Gartner Data and Analytics Summit 2021What you should know about active metadata and how you can use it to build a forward-looking data stackJul 31, 2021Jul 31, 2021
InLevel Up CodingbyDoug FooThe *Mythical Modern Data StackMy search failed, just like my search for the ultimate pancakeJan 10, 20221Jan 10, 20221
InTDS ArchivebyAakash NandIntegrating Trino and Apache RangerUnderstanding how to configure Apache Ranger and Trino for data security.Sep 27, 20217Sep 27, 20217
InEngineers HavenbyMo AliI Would Choose AWS CDK Over TerraformAWS vs TerraformJan 28, 202415Jan 28, 202415
InDawn CapitalbyDawn CapitalSaaS Foundations: Building scale-up data infrastructureBy Shamillah Bankiya and David ArndtDec 7, 2021Dec 7, 2021
InTDS ArchivebyBarr MosesWhat’s in Store for the Future of the Modern Data Stack?Bob Muglia, the former CEO of Snowflake, discusses what’s next for the tooling and technologies powering data analytics and engineeringDec 2, 20213Dec 2, 20213