Lineage-driven blast-radius review
DraftChange a key table and you can’t see the downstream blast radius — a known unknown.
Column-level lineage makes the before/after impact analysis tractable — one AI-assisted pass.
Review pipelines you’re only adjacent to, with confidence — not chasing every unknown.
- dbt Fusion
- column lineage
- auto-tests
- AI review
The problem
Simple, well-documented pipelines aren’t scary to review. Changes to high-leverage tables with known unknowns are. A logic change’s impact across the downstream population is unknown: did the code quietly assume a scenario that never happens, when it’s actually real? Is the field or row being added or removed relevant to some other downstream calculation — and did that calculation account for it? The pain runs the whole value chain. Developers ship with less confidence and spend more time tracing paths, still leaving unknowns. Reviewers face the same choice — investigate, or accept the risk. Bad code slips through wherever safeguards are thin. And the strategic cost is bigger: low trust in AI-generated code plus long reviews means AI development isn’t even allowed in some circles.
The build
Detailed before/after analysis of how the shape of downstream features changes isn’t widely done at a fine level — and table-level lineage can’t do it, because one table can feed thousands of fields. Column-level lineage (a dbt Fusion–style engine) makes the targeted query feasible: analyze just the impacted paths. Then bring all the pieces — lineage, encoded standards, automatic tests, AI review — together in one PR workflow. Third-party tools can be parts of it; the value is having everything in one place to maximize review throughput.
Why it's not built yet
A Fusion-style column-lineage engine isn’t readily accessible yet. And this kind of work always reads as a nice-to-have, so it drops down the backlog when there’s no clear path and the development effort isn’t scoped.
Where it comes from
The real goal was empowering a team to review code for pipelines they’re only adjacent to. Standards, process, AI support, and automatic tests all help gauge expected results versus actual — and at least ensure due diligence was done. On fast-iterating, complex codebases the gain compounds; hard to put a clean number on it, but it could be 2x or more.