[ad_1] Each evaluation is a window into an AI model, Solaiman says, not a perfect readout of how it will…