378 Leveraging large language models to communicate translational science benefits at Weill Cornell Medicine Clinical and Translational Science Center

Michael Bales; EA Wood; Sigaras A; E Campion Sholle; TR Jr; J Imperato-McGinley

doi:10.1017/cts.2024.1001

378 Leveraging large language models to communicate translational science benefits at Weill Cornell Medicine Clinical and Translational Science Center

Published online by Cambridge University Press: 11 April 2025

Michael Bales ,

EA Wood ,

Sigaras A ,

E Campion Sholle ,

TR Jr and

J Imperato-McGinley

Show author details

Michael Bales: Affiliation:
Weill Cornell Medicine
EA Wood: Affiliation:
Weill Cornell Clinical and Translational Science Center, New York, NY, USA
Sigaras A: Affiliation:
AI-XR Lab, Weill Cornell Medicine, New York, NY, USA; Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA; Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY, USA
E Campion Sholle: Affiliation:
Department of Information Technologies & Services, Weill Cornell Medical College, New York, New York; Division of Health Informatics, Department of Population Health Sciences, Weill Cornell Medical College, New York, New York
TR Jr: Affiliation:
Weill Cornell Clinical and Translational Science Center, New York, NY, USA
J Imperato-McGinley: Affiliation:
Weill Cornell Clinical and Translational Science Center, New York, NY, USA

Article contents

Abstract

Rights & Permissions

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

Objectives/Goals: This Weill Cornell Clinical and Translational Science Collaborative (CTSC) project evaluates whether large language models (LLMs) can generate accurate summaries of translational science benefits using the Translational Science Benefits Model (TSBM) framework, aiming to identify optimal LLMs and prompting strategies via expert review. Methods/Study Population: We are using prompt engineering to train multiple LLMs to generate one-page impact profiles based on the TSBM framework. LLMs will be selected via benchmarks, focusing on models excelling in information extraction. Leading LLMs (e.g., Llama 3.2, ChatGPT 4.0, Gemini 1.5 Pro, and Claude) and other high-performing models will be considered. Initial work has utilized Gemini 1.5 Pro. Models use data from CTSC-supported projects in WebCAMP, our local instantiation of a translational research activity tracking system used by >20 CTSA hubs, and manuscripts from the Overton database cited in policy documents. Human experts will evaluate the quality and accuracy of LLM-generated profiles. Results/Anticipated Results: Preliminary results using Gemini 1.5 Pro indicate that LLMs can generate coherent and informative impact profiles encompassing diverse areas within the TSBM. Face validity appears satisfactory, suggesting the outputs align with expectations. We anticipate that further exploration with other LLMs and expert validation will reveal strengths and weaknesses of the LLM approach, including the potential for naccuracies (“hallucinations”), informing further refinement of models and prompting strategies. Analysis of manuscripts cited in policy will provide valuable insights into communicating policy-relevant benefits effectively, and benchmark comparisons will identify optimal LLMs for this use case. Discussion/Significance of Impact: This project demonstrates LLMs’ potential for streamlining and enhancing impact reporting in translational science, enabling broader dissemination of research outcomes and promoting better understanding among stakeholders. Future work will integrate LLM-based reporting into research infrastructure.

Type: Informatics, AI and Data Science
Information: Journal of Clinical and Translational Science , Volume 9 , Issue s1 , April 2025 , pp. 117

DOI: https://doi.org/10.1017/cts.2024.1001 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is unaltered and is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use or in order to create a derivative work.

Article contents

378 Leveraging large language models to communicate translational science benefits at Weill Cornell Medicine Clinical and Translational Science Center

Abstract

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests