IT'S NOT ABOUT THE CELL

August 16, 2025
8 min read

Introduction to practical Query folding

One of the most powerful capabilities of Power Query and the M Language is Query Folding (also referred to as query delegation, and predicate push-down). Query Folding allows the Power Query Mashup Engine to push the transformations expressed in an M (mashup) query to the data source, in the data source’s query language, resulting in more efficient data processing.

October 25, 2024
5 min read

That's So Meta

I don't know about you, but my goodness, wouldn't it be great to press Close & Apply on Power Query and instantly have all of your columns documented in your model based on metadata definitions in your query? And shoot, why stop there? Let's just leave the column titles exactly as they are from the original source system for easier tracking with our lineage but update to friendlier display names while in the semantic model. It almost sounds too good to be true!

July 10, 2024
7 min read

Cleaning the (Staging) Lakeside

Dirty lakes need cleaning, simple as that - and it's no different when it comes to Dataflow Gen2's staging feature. You might be wondering, "What's the big deal?" Well, let's dive into and dissect the backend staging setup of dataflows in the Lakehouse. Close your eyes and imagine countless system tables, each tagged with lengthy identifiers that intertwine with your actual query names—think some-random-crazystring_Address_002E.parquet. And within these auxiliary tables? YOUR DATA alongside generic column titles like Column1, Column2, Column3 (metadata too! don't worry your column names aren't lost!). But every single query leveraging staging, is using this backend implementation approach - ultimately contributing to data sprawl in my opinion.

April 29, 2024
4 min read

From Pipelines to Table: The Outputs We Ingest

If it’s important, you should probably be logging it. With KQL Databases and the newly introduced Semantic model refresh activity within Data pipelines, its just way too easy (2EZ) now. Before we dive in, let’s look at a few prerequisite items needed for this article:

October 17, 2023
7 min read

CHANGE (IN THE HOUSE OF LAKES)

Data Factory in Microsoft Fabric is an AMAZING tool that allows us to combine the flexibility of Data pipelines and ease of a Dataflow Gen2 to create some nifty solutions with a tiny smidge of code. One of the most common use cases is the ability to load new data into a destination for a select period (incrementally). Below is a high-level diagram that we’ll start breaking down to copy data from a SQL Database into a Lakehouse.

June 30, 2023
3 min read

It's a LongType() to the top (If you want to rock and roll)

I’ll always be the first to admit that the things that interest me may be of little (technical) interest to you. However, I figure “Hey, why not just start documenting my random gibberish findings?!” and well, here we are with a short but sweet article on Dataflows Gen2 in Microsoft Fabric.

April 7, 2023
3 min read

5 Tips for Learning Data Types in Power Query Formula Language

So I ran an experiment: what happens when you ask an AI to write a blog post about Power Query data types? The result was... interesting. Below you'll find five tips generated by Bing's GPT, followed by my commentary on what the robot got right, what it got hilariously wrong (spoiler: "as number" is not a thing), and what this means for content creation now that our robot overlords have arrived. Consider this equal parts tutorial, case study, and existential crisis. #IDK the future is weird.

January 27, 2023
7 min read

The (Almost) Definitive Guide to Query Folding

There's only one rule. Don't break the fold! Before we start enforcing all these rules, we should start with something like a solid foundation of what does and doesn't fold in Power Query and their SQL equivalents. Now, if you found this page and we're like "what in the world is query folding?" read this incredible article, from this amazing author.

January 26, 2023
7 min read

Yo, Listen!

There is a bombardment of information constantly vying for our attention - mobile phones, smart TVs, that 1996 Space Jam website and of course endless cat videos, which is why I believe the advances in low-code automation have been an incredible benefit for sharing (the most relevant) information with others.

January 2, 2018
6 min read

I Always Feel Like SUMPRODUCT()'s Watching Me

As the war wages on between #TEAMVLOOKUP and #TEAMINDEXMATCH, a new challenger has arisen. #SUMPRODUCT, a function often associated with math and trig, has now taken on a new purpose - Boolean logic. But can it compete? Or, will it continue to lurk in the shadows, only to be utilized by Power Users...