Evidence Evaluation: Measure Z Corresponds to Human Utility Judgments Better than Measure L and Optimal-Experimental-Design Models