Pick a baseline run and a comparison run, then click Compare to see the differences in steps, tokens, cost, trust scores, and policy decisions.