No CrossRef data available.
Published online by Cambridge University Press: 11 April 2025
Objectives/Goals: Idiopathic inflammatory myopathies (IIMs) are autoimmune diseases influenced by genetic and environmental factors. This study aims to explore infection patterns preceding IIM onset by applying temporal data mining and machine learning to deidentified patient records and corroborate results from molecular analysis. Methods/Study Population: The dataset used in this work was extracted from TriNetX with a focus on patients who have IIM. Risks for developing the outcomes were assessed using case–control cohorts. For each participant, information was extracted about diagnosis code, date of infection, and study visit in which the infection was reported. This data were then temporally encoded and used to generate sequence files for each of the outcomes. Unsupervised temporal machine learning was then preformed on these files to detect frequent subsequences of infections. Python library scikit-learn was used to perform the unsupervised machine learning with k-means clustering. Results/Anticipated Results: The results of this study identify infections associated with the onset of IIM by analyzing temporal infection patterns. Frequent sequences of infections uncovered, with specific patterns linked to different cohorts, offer insights into the etiology of IIM. Common and cohort-specific infection sequences will help validate existing research and provide new avenues for exploring the disease mechanisms. The findings will highlight significant infection patterns, which will inform our understanding of IIM onset across various patient populations. Discussion/Significance of Impact: The results will provide key insights into pre-symptomatic infection sequences related to IIM onset, enhancing understanding of its etiology and pathogenesis. These findings may aid in developing more precise screening methods for early detection and confirm previous results from analyzing immune signatures of infections in IIM.