Why does your test suite lie to you when the system contains an LLM
If you have spent twenty years writing tests for data pipelines, you carry a mental model of what a passing test means. The input is fixed. The function is deterministic. The output is predictable. You assert that the output equals what you expect, the test passes or fails, and a