Medical AI Benchmarks Shift to Dialogue as Static Tests Mask Clinical Limitations — SYNTHESE