I recently came across a LinkedIn post where someone used two CTEs to pull DISTINCT merchant_ids from the same table—split by year—and then joined everything back to the orders table. It worked! But it got me thinking. Sometimes, the real challenge i...
Artificial Intelligence (AI) and Machine Learning (ML) are transforming businesses across industries. Google Cloud offers a powerful suite of AI/ML tools, making it easier than ever to leverage these technologies. In this blog post, we'll break down ...
Data has become a vital asset for organizations and businesses across the globe. With the increasing use of data analytics, companies rely heavily on the insights provided by the data to make informed decisions. However, the value of data is only rea...
Data quality assurance stands as a crucial pillar upon which the efficacy and reliability of AI models rest. As AI continues to revolutionize industries from healthcare, autonomous vehicles, smart home automation to finance, the integrity of the data...
Throughout 2024, Collate made incredible progress bringing new capabilities to our customers and to the OpenMetadata open source community. We’ve shipped new features and improvements to accelerate AI automation for data discovery, observability, and...
Preface📖 In part 1, we explained what a data contract is why we need them, and what a typical one contains In this blog, we dive into a demo to explore how they actually work so we can process data safer, faster and effectively. Goal🎯 The data...
In the modern data-centric landscape, data science has become one of the most in-demand professions. Organizations across various sectors are utilizing data to make smarter decisions, streamline operations, and stay ahead of the competition. Conseque...
In today’s fast-paced business environment, the integration of data science into supply chain management has proven to be transformative. Organizations across various industries are harnessing data-driven insights to enhance efficiency, reduce costs,...
Preface 📚 I’ve actually written a post on data contracts before, so have a quick scan here if you want to see a project I created on them using Python, AWS S3 and other libraries (like Selenium and Soda). What is a contract? 🤔 Let’s first talk abou...
In the world of data engineering, ensuring data quality is paramount. From business analysts relying on dashboards to C-level executives making strategic decisions, and data scientists training machine learning models — everyone depends on the qualit...