Flag It, Don't Fix It: Instrument Failure in LLM Scoring
When an LLM produces a structurally suspicious score — confident zero when a real signal likely exists — the right response preserves the original value and marks the observation as unreliable.