Calibrating Models to Test Data Automatically