Medallion Architecture

Every business system the company runs on, consolidated into one governed architecture on Microsoft Fabric.

Microsoft Fabric·PySpark·Delta Lake·Power BI·DAX·T-SQL·REST API·SFTP·Azure DevOps

The story in four beats

The mess

Six systems, no shared language

Six VMS platforms, each with its own data shape and change semantics. Reporting ran on email and spreadsheets.

The move

One architecture on Fabric

Build a Medallion architecture and a governed semantic model. One source of truth.

The build

Bronze → Silver → Gold

Six vendor schemas, canonicalized into one governed star schema. Each source ingested on its own terms.

The result

99.5%+, and the ritual's gone

Reconciled across four years of history. 30+ reports on one governed model. Still scaling.

The Problem

I joined a healthcare staffing MSP as its founding data hire. The company was two years in. Small enough that everything still ran on email and spreadsheets, growing fast enough that the approach was about to stop working. There was no BI. No dashboards, no models, no single number anyone agreed on. When a leader needed a figure (margin on a placement, headcount on a contract, invoice status for a client) they emailed whoever they thought might know it, and waited.

I built Power BI from scratch. The first reports went out with proper models, each one shaped to the stakeholder who asked for it. Finance, operations, account management, sales. They all had legitimate reasons to define the same metric slightly differently, and every report was correct in its own context. By the time the catalog hit ten reports, the problem surfaced. Leadership read two dashboards side by side and saw different numbers for the same thing. Every report was right. The system was incoherent.

Figure. Each report tuned to its stakeholder. No shared vocabulary underneath.

Fixing it wasn't a technical decision I could make alone. I spent weeks consulting with program management, finance, leadership, and the sales team to understand what each definition meant to them, why it mattered, and where the real boundaries were. We consolidated those conversations into a single set of canonical definitions, approved by leadership, with any report-specific logic explicitly flagged so nobody had to guess which version of a number they were looking at.

What made that enforceable was a governed semantic model sitting on top of a Medallion architecture on Microsoft Fabric. The Medallion ingests and conforms data from multiple vendor systems into one canonical model. The semantic model governs it. Every measure, every calculation, every business definition lives in that model once. When someone downstream needs a new report, they import the model and build visuals. They don't write custom logic. They don't redefine margin. Data drift became structurally impossible, and reporting got dramatically faster. The catalog kept growing as the company scaled, but every new report shipped on the same foundation as the last one.

The Challenge

A VMS (Vendor Management System) is the central platform that healthcare staffing runs on. It manages the full lifecycle of a placement: job orders from facilities, candidate submissions from agencies, credentialing, shift scheduling, timecards, invoicing, billing, rate negotiations, worker compliance. Every transaction between a hospital and the staffing agencies filling its roles flows through a VMS. They are large, complex systems, and most companies in this space don't run one. They run several.

The business currently runs on four of them, with another two being onboarded. None of them were designed to be unified, and the differences go far deeper than schema.

Delivery channels have nothing in common. One source exposes a REST API with change timestamps. Another only returns currently-active records, so if something is deleted upstream it silently disappears with no tombstone. A third ships daily SFTP CSVs with no incremental markers. A fourth is a live database shortcut. Each one requires a completely different ingestion pattern.
Change semantics diverge. Some APIs tell you what changed. Others make you figure it out by diffing against yesterday. One vendor periodically freezes entire tables for months without notice. A pipeline that assumes they all behave the same will quietly lose data.
Business concepts don't align. A "submission" in one system has five stages. In another it has three, with different names. The lifecycle of an order, the definition of an active worker, the point at which a timecard is considered approved, all different.
Timecard and invoicing workflows vary. Which timekeeping solution backs the data, how approvals flow, what level of line-item detail is exposed on an invoice. Some systems give you everything. Others give you a summary with no way to drill in.
Entity depth is inconsistent. Some systems expose rich detail on agencies, clients, and facilities. Others give you an ID and a name. The canonical model has to hold all of them without fabricating detail that doesn't exist in the source, and without dropping detail that does.
Foreign key assumptions break. One system's assignment records overwhelmingly skip the standard position key because workers were onboarded outside the normal workflow. Joining naively sent thousands of shifts to "Unknown Facility" in the gold layer. The fix was understanding the denormalized structure, not fixing the join.

The model had to be built to the lowest common denominator where the systems diverge, and enriched where individual systems offered more depth.

The Architecture

Figure 1. Six sources, three layers. Two more onboarding into the same architecture.

Three layers, each one solving a problem the layer above it doesn't have to know about.

Bronze is the raw landing zone. Source column casing preserved as received, one schema per VMS, append-only wherever the source's change semantics allow it. The ingestion pattern is matched to the source, not imposed on it. An overlap window on the API that publishes change timestamps. An incremental MERGE on the system with reliable watermarks. An append with file-date tiebreaker for the SFTP feed. And for the VMS that silently drops deleted records, a three-clause soft-delete MERGE that refreshes the active rows, inserts genuine new ones, and audit-trails the disappearances rather than treating absence as a non-event.

Silver is where the vocabularies become one. Per-source transforms map each vendor's column language onto a canonical schema, and a master-append step unions them into unified facts and dimensions with deterministic surrogate keys. The work is conformance, not transport.

Gold is a Fabric Warehouse. Conformed facts joined to shared dimensions, with every VMS's native status codes reconciled against one canonical model. The materialization runs as a transactional TRUNCATE+INSERT inside a stored procedure rather than DROP+CREATE, because Direct Lake reads the underlying parquet live and a drop opens a "table not found" window to anyone querying at the wrong moment.

A single governed semantic model sits on top. All measures, all business logic, defined once. Every report draws from this one model. Row-Level Security handles departmental scoping without duplicating dashboards.

What Broke, and What I Took From It

An "incremental" upsert that was secretly an overwrite. A backwards-compatibility alias in a shared utility was delegating to mode("overwrite") instead of performing a real Delta MERGE. The pipeline was filtering source data to a recent window and writing that window back as the entire bronze table. The damage hadn't surfaced because the function had rarely fired with non-empty deltas. A coincidence dressed as correctness. I rewrote it as a real MERGE backed by a behavioral test that proves the four properties that matter: initial write, update in place, preserve untouched rows, insert new. The test that had let the bug ship asserted the function worked. The new one asserts the function fails the right way when it's wrong.

A source pinned to a stale filename. One of the SFTP ingestions had a hardcoded date in its Copy activity, so the pipeline had spent five weeks re-ingesting the same file while new daily drops landed unread. Bronze looked healthy because the row count never decreased. The first run after the fix produced thousands of rows confirming the silent gap. Lesson taken: append-only pipelines need a freshness check, not just a row count.

A Delta schema-evolution edge case on the soft-delete MERGE. The whenNotMatchedBySourceUpdate clause referenced soft-delete columns that didn't exist on legacy tables created by the old overwrite pattern, and Delta's autoMerge doesn't help that specific clause. It resolves SET columns against the pre-evolution schema. The fix was an explicit ALTER TABLE ahead of the MERGE with a COALESCE guard so the first run after migration was well-defined. The deeper lesson was about trusting platform-level "automatic" guarantees. They cover the common path, and the senior engineer's job is knowing which uncommon path they don't cover.

A cross-layer deployment deadlock from a single PR. A project-wide rename to snake_case in one commit meant the gold procedure referenced silver columns that didn't exist yet, the silver notebooks couldn't run until the procedure compiled, and the warehouse's XMLA model definition still carried the old casing. Recovery took three sequenced commits in a specific order. The rule that came out of it (schema changes in Fabric have a required deployment order across bronze, silver, gold, and the semantic model) is now documented and enforced before any cross-layer change ships.

Proving It Was Right

A new pipeline replacing reports that executives already trusted doesn't get the benefit of the doubt. It has to earn the cutover.

I built a reconciliation harness that compared the new gold output against the legacy figures, per source, per grain, per year. The comparison had to apply the same filters the legacy reports applied (they weren't naive snapshots) or the differences would be artifacts of methodology rather than logic. Every residual got triaged into a living log with a status legend: within tolerance, material gap, characterized cause, open question. Nothing shipped while a material gap was unresolved.

The final number was a 99.5%+ match across all four sources and four years of order, timecard, and invoice history. The remaining residuals were fully characterized as refresh-cadence drift between live API ingestion and the legacy CSV exports. A property of when the data was sampled, not a defect in how it was modeled.

Figure 2. The harness compares every fact-grain in the new pipeline against the legacy production data, per source, per year. All active cells fully characterized. N/A cells are sources not yet onboarded.

Results & Impact

99.5%+ reconciliation match across all sources and four years of order, timecard, and invoice history. Residuals fully characterized as refresh-cadence drift, not logic gaps.
30+ reports on one governed semantic model with Row-Level Security, ending the "this doesn't match that" disputes between dashboards.
Financial reporting runs end-to-end from live API and SFTP ingestion through bronze, silver, gold, and into the dashboards leadership opens on Monday morning. No manual assembly.
Operational standards in place. Behavioral tests for MERGE patterns. Documented deployment ordering for cross-layer schema changes. A living reconciliation log with status legend.
Git-backed PR workflow replaced publish-to-production. Every change goes through Azure DevOps with code review before it touches the production workspace. The same instinct that centralized the data model centralized the deployment process.
New source onboarding is repeatable. Two additional VMS integrations are being onboarded into the existing pattern without changes to silver or gold. The architecture made that a process, not a project.
Self-serve reporting. Downstream analysts import the governed model and build visuals. They don't write queries, define measures, or make modeling decisions. That's already done. It removed the founding data hire as a bottleneck on every new report.

What's Underway

Two additional VMS integrations are being onboarded into the existing bronze-silver-gold pattern. The architecture absorbs them without changes to the silver conformance layer or the gold schema.

The bigger expansion is the CRM. This is a different class of source entirely. It holds the business development pipeline (opportunities, sales stages, conversations), client and supplier master data, divisional ownership, team assignments, and implementation phases. Integrating it means the lakehouse will connect operational workforce data to the commercial relationships that drive it. Reporting that currently requires someone to cross-reference a VMS dashboard with a CRM screen will resolve to a single query against the governed model.

What I'd Do Differently

I'd version the semantic model independently from the lakehouse. Right now a column rename in silver can break a DAX measure in the semantic model, and you only find out when the dataset refresh fails. If the semantic model had its own contract tests against the gold schema, those breaks would surface in the PR, not in production at 6 AM.

I'd also push harder on data contracts earlier. The split notebooks in silver do the conformance work, but the "contract" between bronze and silver is implicit. It lives in the notebook logic, not in a schema definition that a pipeline can validate before it runs. That means a source-side schema change (a renamed column, a new enum value, a field that quietly goes nullable) passes bronze without complaint and blows up in silver. A lightweight contract layer, even just a JSON schema check at the bronze-to-silver boundary, would have caught several of the issues described in the "What Broke" section before they reached the transform step.

The Stack

Microsoft Fabric (Lakehouses, Warehouses, Data Pipelines, Direct Lake, Fabric-Git integration), PySpark, Spark SQL, Delta Lake, Python, T-SQL, Power BI, DAX, Azure DevOps with Git-based PR workflows, REST API integration, SFTP ingestion.