The Answer It Expected: When an AI Analysis Reports Conclusions Before Reading Its Own Results
We audited a four-lens retrieval system for a subtle paraphrase-preference bias — and caught the LLM in our own analysis loop writing the conclusion it expected before reading the results, three times. The discipline that fixed it, and four confounds that each shrank our effect.