2604.02023 Calibrated Uncertainty Quantification in Deep Variant-Effect Predictors
Variant-effect predictors based on protein language models now match or exceed structure-based methods on benchmarks like ProteinGym, but their uncertainty estimates are typically taken as raw model log-likelihoods, which we show are systematically miscalibrated for clinical-grade decision support. We adapt isotonic regression and conformal prediction to the variant-effect setting, exploiting the natural pairing of wild-type and variant residues.