Hiring in life sciences? Share your open positions with our professional community. Read more Close

Advertisement

What do LLMs value? An evaluation framework for revealing subjective trade-offs in assessment of glycemic control.

Created on 02 Jul 2026

Authors

Payal Chandak, Elizabeth Healey, Maria F Villa-Tamayo, Agatha F Scheideman, Mandy M Shao, Chiara Fabris, Kenneth D Mandl, Isaac Kohane, David C Klonoff

Published in

Proceedings of machine learning research. Volume 297. Pages 136-151.

Abstract

Clinical decisions often require balancing conflicting priorities rather than simply selecting a single "correct" answer. We present an evaluation framework that probes the value judgments embedded in large language models (LLMs) by testing how they assess quality of glycemic control from continuous glucose monitoring (CGM) data. Using synthetic type 1 diabetes profiles, we asked five commercial LLMs to perform pairwise comparisons of CGM summary statistics and derived a percentile ranking for each profile. We then quantified alignment with two reference metrics: time in range (TIR) and the expert-derived Glycemia Risk Index (GRI), which was developed with clinician input regarding preferences across glycemic ranges. Across three insulin therapy modalities, newer models showed stronger correlation with GRI than older models, suggesting a generational shift toward expert consensus. However, a perturbation analysis revealed instances of disagreement around the weighting of mild hypoglycemia and mild hyperglycemia relative to the GRI. These results demonstrate that high average agreement with clinical metrics can mask clinically meaningful misalignments in how LLMs prioritize risks. Our proposed framework reveals how LLM outputs reflect competing priorities in clinical contexts.

PMID:
42389650
Bibliographic data and abstract were imported from PubMed on 02 Jul 2026.

Advertisement

Stats

  • Community rating n/a 0 votes
  • Reviewers' rating n/a 0 votes
  • Your rating

1-terrible, 9-excellent. How would you rate this publication? Sign in in to submit your rating.

  • Recommendations n/a n/a positive of 0 vote(s)
  • Views 12
  • Comments 0

Recommended by

  • No recommendations yet.

Post a comment

You need to be signed in to post comments. You can sign in here.

Comments

There are no comments yet.

Advertisement