Search

Pointing Stabilization of a 1Hz High Power Laser via Machine Learning
Alessio Amodio, Dan Wang, Curtis Berger, Hai-En Tsai, Samuel K Barber, Jeroen van Tilborg, Alexander Picksley, Zakchary Eisentraut, Neel Rajeshbhai Vora, Mahek Logantha, Qing Ji, Qiang Du, Russell Wilcox, Anthony Gonsalves
Journal:

High Power Laser Science and Engineering / Accepted manuscript

Published online by Cambridge University Press:

25 April 2025, pp. 1-10
- Article
- - You have access
  - Open access
- PDF
- Export citation

Reflective error: a metric for assessing predictive performance at extreme events
Robert Edwin Rouse, Henry Moss, Scott Hosking, Allan McRobie, Emily Shuckburgh
Journal:

Environmental Data Science / Volume 4 / 2025

Published online by Cambridge University Press:

24 April 2025, e26
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
When using machine learning to model environmental systems, it is often a model’s ability to predict extreme behaviors that yields the highest practical value to policy makers. However, most existing error metrics used to evaluate the performance of environmental machine learning models weigh error equally across test data. Thus, routine performance is prioritized over a model’s ability to robustly quantify extreme behaviors. In this work, we present a new error metric, termed Reflective Error, which quantifies the degree at which our model error is distributed around our extremes, in contrast to existing model evaluation methods that aggregate error over all events. The suitability of our proposed metric is demonstrated on a real-world hydrological modeling problem, where extreme values are of particular concern.

Integrating weight and imaging features: A machine learning framework for larval instar identification in Mythimna separata (Walker)
Xiao Feng, Jingyu Wang, Yunliang Ji, Sohail Abbas, Cong Zhang, Jamin Ali, Adil Tonga, Rizhao Chen, Qiyun Li
Journal:

Bulletin of Entomological Research , First View

Published online by Cambridge University Press:

23 April 2025, pp. 1-10
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
The oriental armyworm, Mythimna separata (Walker), is a highly migratory pest known for its sudden larval outbreaks, which result in severe crop losses. These unpredictable surges pose significant challenges for timely and accurate monitoring, as conventional methods are labour-intensive and prone to errors. To address these limitations, this study investigates the use of machine learning for automated and precise identification of M. separata larval instars. A total of 1577 larval images representing different instar were analysed for geometric, colour, and texture features. Additionally, larval weight was predicted using 13 regression models. Instar identification was conducted using Support Vector Classifier (SVC), Random Forest, and Multi-Layer Perceptron. Key feature contributing to classification accuracy were subsequently identified through permutation feature importance analysis. The results demonstrated the potential of machine learning for automating instar identification with high efficiency and accuracy. Predicted larval weight emerged as a key feature, significantly enhancing the performance of all identification models. Among the tested approaches, BaggingRegressor exhibited the best performance for larval weight prediction (R2 = 98.20%, RMSE = 0.2313), while SVC achieved the highest instar identification accuracy (94%). Overall, the integration of larval weight with other image-derived features proved to be a highly effective strategy. This study demonstrates the efficacy of machine learning in enhancing pest monitoring systems by providing a scalable and reliable framework for precise pest management. The proposed methodology significantly improves larval instar identification accuracy and efficiency, offering actionable insights for implementing targeted biological and chemical control strategies.

Multivariate time-series clustering analysis of the Global Dietary Database to uncover patterns in dietary trends (1990-2018)
Adriano Matousek, Tiffany H Leung, Herbert Pang
Journal:

Public Health Nutrition / Accepted manuscript

Published online by Cambridge University Press:

21 April 2025, pp. 1-30
- Article
- - You have access
  - Open access
- PDF
- Export citation
Objective:
Understanding country-level nutrition intake is crucial to global nutritional policies that aim to reduce disparities and relevant disease burdens. Still, there are limited numbers of studies using clustering techniques to analyse the recent Global Dietary Database. This study aims to extend an existing multivariate time-series clustering algorithm to allow for greater customisability and to provide the first cluster analysis of the Global Dietary Database to explore temporal trends in country-level nutrition profiles (1990-2018).
Design:
Trends in sugar-sweetened beverage intake and nutritional deficiency were explored using the newly developed program ‘MTSclust’. Time-series clustering algorithms are different from simple clustering approaches in their ability to appreciate temporal elements.
Setting:
Nutritional and demographical data from 176 countries were analysed from the Global Dietary Database.
Participants:
Population representative samples of the 176 in the Global Dietary Database.
Results:
In a 3-class test specific to the domain, the MTSclust program achieved a mean accuracy of 71.5% (Adjusted Rand Index [ARI]=0.381) while the mean accuracy of a popular algorithm, DTWclust, was 58% (ARI=0.224). The clustering of nutritional deficiency and sugar-sweetened beverage intake identified several common trends among countries and found that these did not change by demographics. Multivariate time-series clustering demonstrated a global convergence towards a Western diet.
Conclusion:
While global nutrition trends are associated with geography, demographic variables such as sex and age, are less influential to the trends of certain nutrition intake. The literature could be further supplemented by applying outcome-guided methods to explore how these trends link to disease burdens.

5 - Data and Methods for Modeling Climate-Related Migration
Kelsea Best, The Ohio State University, Kayly Ober, United States Institute of Peace, Robert A. McLeman, Wilfrid Laurier University
Book:

Migration and Displacement in a Changing Climate

Published online:

10 April 2025

Print publication:

17 April 2025, pp 162-188
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter, we review approaches to model climate-related migration including the multiple goals of modeling efforts and why modeling climate-related migration is of interest to researchers, commonly used sources of climate and migration data and data-related challenges, and various modeling methods used. The chapter is not meant to be an exhaustive inventory of approaches to modeling climate-related migration, but rather is intended to present the reader with an overview of the most common approaches and possible pitfalls associated with those approaches. We end the chapter with a discussion of some of the future directions and opportunities for data and modeling of climate-related migration.

Leveraging initial conditions memory for modelling Rayleigh–Taylor turbulence
Sébastien Thévenin, Benoît-Joseph Gréa, Gilles Kluth, Balasubramanya T. Nadiga
Journal:

Journal of Fluid Mechanics / Volume 1009 / 25 April 2025

Published online by Cambridge University Press:

14 April 2025, A17
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
In this study, we tackle the challenge of inferring the initial conditions of a Rayleigh–Taylor mixing zone for modelling purposes by analysing zero-dimensional (0-D) turbulent quantities measured at an unspecified time. This approach assesses the extent to which 0-D observations retain the memory of the flow, evaluating their effectiveness in determining initial conditions and, consequently, in predicting the flow’s evolution. To this end, we generated a comprehensive dataset of direct numerical simulations, focusing on miscible fluids with low density contrasts. The initial interface deformations in these simulations are characterised by an annular spectrum parametrised by four non-dimensional numbers. To study the sensitivity of 0-D turbulent quantities to initial perturbation distributions, we developed a surrogate model using a physics-informed neural network (PINN). This model enables computation of the Sobol indices for the turbulent quantities, disentangling the effects of the initial parameters on the growth of the mixing layer. Within a Bayesian framework, we employ a Markov chain Monte Carlo (MCMC) method to determine the posterior distributions of initial conditions and time, given various state variables. This analysis sheds light on inertial and diffusive trajectories, as well as the progressive loss of initial conditions memory during the transition to turbulence. Furthermore, it identifies which turbulent quantities serve as better predictors of Rayleigh–Taylor mixing zone dynamics by more effectively retaining the memory of the flow. By inferring initial conditions and forward propagating the maximum a posteriori (MAP) estimate, we propose a strategy for modelling the Rayleigh–Taylor transition to turbulence.

Predicting experiences of paranoia and auditory verbal hallucinations in daily life with ambulatory sensor data – A feasibility study
Felix Strakeljahn, Tania M. Lincoln, Björn Schlier
Journal:

Psychological Medicine / Volume 55 / 2025

Published online by Cambridge University Press:

11 April 2025, e114
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Background
Prediction models that can detect the onset of psychotic experiences are a key component of developing Just-In-Time Adaptive Interventions (JITAI). Building these models on passively collectable data could substantially reduce user burden. In this study, we developed prediction models to detect experiences of auditory verbal hallucinations (AVH) and paranoia using ambulatory sensor data and assessed their stability over 12 weeks.
Methods
Fourteen individuals diagnosed with a schizophrenia-spectrum disorder participated in a 12-day Ecological Momentary Assessment (EMA) study. They wore ambulatory sensors measuring autonomic arousal (i.e., electrodermal activity, heart rate variability) and completed questionnaires assessing the intensity/distress of AVHs and paranoia once every hour. After 12 weeks, participants repeated the EMA for four days for a follow-up assessment. We calculated prediction models to detect AVHs, paranoia, and AVH-/paranoia-related distress using random forests within nested cross-validation. Calculated prediction models were applied to the follow-up data to assess the stability of prediction models.
Results
Prediction models calculated with physiological data achieved high accuracy both for AVH (81%) and paranoia (69%–75%). Accuracy increased by providing models with baseline information about psychotic symptom levels (AVH: 86%; paranoia: 80%–85%). During the follow-up EMA accuracy dropped slightly throughout all models but remained high (73%–84%).
Conclusions
Relying solely on physiological data to detect psychotic symptoms achieved substantial accuracy that remained sufficiently stable over 12 weeks. Experiences of AVHs can be predicted with higher accuracy and long-term stability than paranoia. The findings tentatively suggest that psychophysiology-based prediction models could be used to develop and enhance JITAIs for psychosis.

Validity and accuracy of artificial intelligence-based dietary intake assessment methods: a systematic review
Sebastián Cofre, Camila Sanchez, Gladys Quezada-Figueroa, Xaviera A. López-Cortés
Journal:

British Journal of Nutrition , First View

Published online by Cambridge University Press:

10 April 2025, pp. 1-13
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
One of the most significant challenges in research related to nutritional epidemiology is the achievement of high accuracy and validity of dietary data to establish an adequate link between dietary exposure and health outcomes. Recently, the emergence of artificial intelligence (AI) in various fields has filled this gap with advanced statistical models and techniques for nutrient and food analysis. We aimed to systematically review available evidence regarding the validity and accuracy of AI-based dietary intake assessment methods (AI-DIA). In accordance with PRISMA guidelines, an exhaustive search of the EMBASE, PubMed, Scopus and Web of Science databases was conducted to identify relevant publications from their inception to 1 December 2024. Thirteen studies that met the inclusion criteria were included in this analysis. Of the studies identified, 61·5 % were conducted in preclinical settings. Likewise, 46·2 % used AI techniques based on deep learning and 15·3 % on machine learning. Correlation coefficients of over 0·7 were reported in six articles concerning the estimation of calories between the AI and traditional assessment methods. Similarly, six studies obtained a correlation above 0·7 for macronutrients. In the case of micronutrients, four studies achieved the correlation mentioned above. A moderate risk of bias was observed in 61·5 % (n 8) of the articles analysed, with confounding bias being the most frequently observed. AI-DIA methods are promising, reliable and valid alternatives for nutrient and food estimations. However, more research comparing different populations is needed, as well as larger sample sizes, to ensure the validity of the experimental designs.

Data publishing in mechanics and dynamics: challenges, guidelines, and examples from engineering design
Henrik Ebel, Jan van Delden, Timo Lüddecke, Aditya Borse, Rutwik Gulakala, Marcus Stoffel, Manish Yadav, Merten Stender, Leon Schindler, Kristin Miriam de Payrebrune, Maximilian Raff, C. David Remy, Benedict Röder, Rohit Raj, Tobias Rentschler, Alexander Tismer, Stefan Riedelbauch, Peter Eberhard
Journal:

Data-Centric Engineering / Volume 6 / 2025

Published online by Cambridge University Press:

08 April 2025, e23
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Data-based methods have gained increasing importance in engineering. Success stories are prevalent in areas such as data-driven modeling, control, and automation, as well as surrogate modeling for accelerated simulation. Beyond engineering, generative and large-language models are increasingly helping with tasks that, previously, were solely associated with creative human processes. Thus, it seems timely to seek artificial-intelligence-support for engineering design tasks to automate, help with, or accelerate purpose-built designs of engineering systems for instance in mechanics and dynamics, where design so far requires a lot of specialized knowledge. Compared with established, predominantly first-principles-based methods, the datasets used for training, validation, and test become an almost inherent part of the overall methodology. Thus, data publishing becomes just as important in (data-driven) engineering science as appropriate descriptions of conventional methodology in publications in the past. However, in mechanics and dynamics, quite widely, still traditional publishing practices are prevalent that largely do not yet take into account the rising role of data as much as that may already be the case in pure data-scientific research. This article analyzes the value and challenges of data publishing in mechanics and dynamics, in particular regarding engineering design tasks, showing that the latter raise also challenges and considerations not typical in fields where data-driven methods have been booming originally. Researchers currently find barely any guidance to overcome these challenges. Thus, ways to deal with these challenges are discussed and a set of examples from across different design problems shows how data publishing can be put into practice.

5 - Anti-discrimination
from Part II - Legal Controls
Yee-Fui Ng, Monash University, Victoria
Book:

Combatting the Code

Published online:

26 March 2025

Print publication:

03 April 2025, pp 74-95
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter analyzes challenges to AI decision-making based on anti-discrimination in the US, the UK, and Australia. Machine learning algorithms can be trained on datasets that contain human bias, thus resulting in predictions that are tainted with unfair discrimination. Anti-discrimination claims involve challenging the inputs of decision-making, such as the data or source code, and arguing that the flawed algorithm or data inputted into the AI system leads to discriminatory outcomes.

Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT)
Nur Hani Zainal, Regina Eckhardt, Gavin N. Rackoff, Ellen E. Fitzsimmons-Craft, Elsa Rojas-Ashe, Craig Barr Taylor, Burkhardt Funk, Daniel Eisenberg, Denise E. Wilfley, Michelle G. Newman
Journal:

Psychological Medicine / Volume 55 / 2025

Published online by Cambridge University Press:

02 April 2025, e106
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Background
As the use of guided digitally-delivered cognitive-behavioral therapy (GdCBT) grows, pragmatic analytic tools are needed to evaluate coaches’ implementation fidelity.
Aims
We evaluated how natural language processing (NLP) and machine learning (ML) methods might automate the monitoring of coaches’ implementation fidelity to GdCBT delivered as part of a randomized controlled trial.
Method
Coaches served as guides to 6-month GdCBT with 3,381 assigned users with or at risk for anxiety, depression, or eating disorders. CBT-trained and supervised human coders used a rubric to rate the implementation fidelity of 13,529 coach-to-user messages. NLP methods abstracted data from text-based coach-to-user messages, and 11 ML models predicting coach implementation fidelity were evaluated.
Results
Inter-rater agreement by human coders was excellent (intra-class correlation coefficient = .980–.992). Coaches achieved behavioral targets at the start of the GdCBT and maintained strong fidelity throughout most subsequent messages. Coaches also avoided prohibited actions (e.g. reinforcing users’ avoidance). Sentiment analyses generally indicated a higher frequency of coach-delivered positive than negative sentiment words and predicted coach implementation fidelity with acceptable performance metrics (e.g. area under the receiver operating characteristic curve [AUC] = 74.48%). The final best-performing ML algorithms that included a more comprehensive set of NLP features performed well (e.g. AUC = 76.06%).
Conclusions
NLP and ML tools could help clinical supervisors automate monitoring of coaches’ implementation fidelity to GdCBT. These tools could maximize allocation of scarce resources by reducing the personnel time needed to measure fidelity, potentially freeing up more time for high-quality clinical care.

NLP verification: towards a general methodology for certifying robustness
Marco Casadio, Tanvi Dinkar, Ekaterina Komendantskaya, Luca Arnaboldi, Matthew L. Daggitt, Omri Isac, Guy Katz, Verena Rieser, Oliver Lemon
Journal:

European Journal of Applied Mathematics , First View

Published online by Cambridge University Press:

02 April 2025, pp. 1-58
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Machine learning has exhibited substantial success in the field of natural language processing (NLP). For example, large language models have empirically proven to be capable of producing text of high complexity and cohesion. However, at the same time, they are prone to inaccuracies and hallucinations. As these systems are increasingly integrated into real-world applications, ensuring their safety and reliability becomes a primary concern. There are safety critical contexts where such models must be robust to variability or attack and give guarantees over their output. Computer vision had pioneered the use of formal verification of neural networks for such scenarios and developed common verification standards and pipelines, leveraging precise formal reasoning about geometric properties of data manifolds. In contrast, NLP verification methods have only recently appeared in the literature. While presenting sophisticated algorithms in their own right, these papers have not yet crystallised into a common methodology. They are often light on the pragmatical issues of NLP verification, and the area remains fragmented. In this paper, we attempt to distil and evaluate general components of an NLP verification pipeline that emerges from the progress in the field to date. Our contributions are twofold. First, we propose a general methodology to analyse the effect of the embedding gap – a problem that refers to the discrepancy between verification of geometric subspaces, and the semantic meaning of sentences which the geometric subspaces are supposed to represent. We propose a number of practical NLP methods that can help to quantify the effects of the embedding gap. Second, we give a general method for training and verification of neural networks that leverages a more precise geometric estimation of semantic similarity of sentences in the embedding space and helps to overcome the effects of the embedding gap in practice.

Machine-learning-based pressure reconstruction with moving boundaries
Hongping Wang, Fan Wu, Yi Liu, Xinyi He, Shuyi Feng, Shizhao Wang
Journal:

Journal of Fluid Mechanics / Volume 1008 / 10 April 2025

Published online by Cambridge University Press:

02 April 2025, A21
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
The greatest challenge in pressure reconstruction from the measured velocity fields is that the error of material acceleration is significantly contaminated due to error propagation. Particularly for flows with moving boundaries, accurate boundary velocities are difficult to obtain due to error propagation, and a complex boundary processing technique is needed to treat the moving boundaries. The present work proposes a machine-learning-based method to determine the pressure for incompressible flows with moving boundaries. The proposed network consists of two neural networks: one network, named the boundary network, is used to track the Lagrangian boundary points; the other physics-informed neural network, named the flow network, is adopted to approximate the flow fields. These two networks are coupled by imposing boundary conditions. We further propose a new dynamic weight strategy for the loss terms to guarantee convergence and stability. The performance of the proposed method is validated by two examples: the flow over an oscillating cylinder and the flow around a swimming fish. The proposed method can accurately determine the pressure fields and boundary motion from synthetic particle image velocimetry (PIV) flow fields. Moreover, this method can also predict the boundary and pressure at a given instant without supervised data. Finally, this method was applied to reconstruct the pressure from the two-dimensional and three-dimensional PIV velocities of the left ventricle. All of the results indicate that the proposed method can accurately reconstruct the pressure fields for flows with moving boundaries and is a novel method for surface pressure estimation.

Using machine learning to identify features associated with different types of self-injurious behaviors in autistic youth
Ligia Antezana, Caitlin M. Conner, Safaa Eldeeb, Samuel Turecki, Matthew Siegel, Helmet T. Karim, Carla A. Mazefsky
Journal:

Psychological Medicine / Volume 55 / 2025

Published online by Cambridge University Press:

31 March 2025, e98
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Background
Self-injurious behaviors (SIB) are common in autistic people. SIB is mainly studied as a broad category, rather than by specific SIB types. We aimed to determine associations of distinct SIB types with common psychiatric, emotional, medical, and socio-demographic factors.
Methods
Participants included 323 autistic youth (~50% non−/minimally-speaking) with high-confidence autism diagnoses ages 4–21 years. Data were collected by the Autism Inpatient Collection during admission to a specialized psychiatric inpatient unit (www.sfari.org/resource/autism-inpatient-collection/). Caregivers completed questionnaires about their child, including SIB type and severity. The youth completed assessments with clinicians. Elastic net regressions identified associations between SIB types and factors.
Results
No single factor relates to all SIB types. SIB types have unique sets of associations. Consistent with previous work, more repetitive motor movements and lower adaptive skills are associated with most types of SIB; female sex is associated with hair/skin pulling and self-rubbing/scratching. More attention-deficit/hyperactivity disorder symptoms are associated with self-rubbing/scratching, skin picking, hair/skin pulling, and inserts finger/object. Inserts finger/object has the most medical condition associations. Self-hitting against surface/object has the most emotion dysregulation associations.
Conclusions
Specific SIB types have unique sets of associations. Future work can develop clinical likelihood scores for specific SIB types in inpatient settings, which can be tested with large community samples. Current approaches for SIB focus on the behavior functions, but there is an opportunity to further develop interventions by considering the specific SIB type in assessment and treatment. Identifying factors associated with specific SIB types may aid with screening, prevention, and treatment of these often-impairing behaviors.

How to beat a Bayesian adversary
Part of
- Probabilistic methods, simulation and stochastic differential equations
Zihan Ding, Kexin Jin, Jonas Latz, Chenguang Liu
Journal:

European Journal of Applied Mathematics , First View

Published online by Cambridge University Press:

28 March 2025, pp. 1-23
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Deep neural networks and other modern machine learning models are often susceptible to adversarial attacks. Indeed, an adversary may often be able to change a model’s prediction through a small, directed perturbation of the model’s input – an issue in safety-critical applications. Adversarially robust machine learning is usually based on a minmax optimisation problem that minimises the machine learning loss under maximisation-based adversarial attacks. In this work, we study adversaries that determine their attack using a Bayesian statistical approach rather than maximisation. The resulting Bayesian adversarial robustness problem is a relaxation of the usual minmax problem. To solve this problem, we propose Abram – a continuous-time particle system that shall approximate the gradient flow corresponding to the underlying learning problem. We show that Abram approximates a McKean–Vlasov process and justify the use of Abram by giving assumptions under which the McKean–Vlasov process finds the minimiser of the Bayesian adversarial robustness problem. We discuss two ways to discretise Abram and show its suitability in benchmark adversarial deep learning experiments.

News Cycles and Satisfaction With Democracy: How the Pandemic Short-Circuited Media Polarization
Omar Hammoud-Gallego, Roberto S. Foa, Xavier Romero-Vidal
Journal:

British Journal of Political Science / Volume 55 / 2025

Published online by Cambridge University Press:

26 March 2025, e49
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
During the coronavirus pandemic in the United Kingdom, media outlets shifted their focus from divisive political issues to more neutral topics like lifestyle, sports, and entertainment. This study explores how this change in media content relates to partisan divides in satisfaction with democracy. Using data from a representative survey of 201,144 individuals, we linked respondents’ perceptions of democratic performance to their daily media exposure. We did so by analysing 1.5 million tweets from British newspapers using a topic modelling algorithm to identify shifts in topic salience and sentiment using sentiment analysis. Our findings reveal a decline in partisan media exposure during the pandemic, associated with increased satisfaction with democracy at both individual and collective levels, and a narrowing of cross-party divides. These results contribute to discussions on affective polarization, the winner-loser gap in democratic evaluation, and media framing effects, highlighting the potential influence of depoliticized news coverage on democratic attitudes.

Validation of large language models (Llama 3 and ChatGPT-4o mini) for title and abstract screening in biomedical systematic reviews
Adriana López-Pineda, Rauf Nouni-García, Álvaro Carbonell-Soliva, Vicente F Gil-Guillén, Concepción Carratalá-Munuera, Fernando Borrás
Journal:

Research Synthesis Methods ,

Published online by Cambridge University Press:

24 March 2025, pp. 1-11
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
With the increasing volume of scientific literature, there is a need to streamline the screening process for titles and abstracts in systematic reviews, reduce the workload for reviewers, and minimize errors. This study validated artificial intelligence (AI) tools, specifically Llama 3 70B via Groq’s application programming interface (API) and ChatGPT-4o mini via OpenAI’s API, for automating this process in biomedical research. It compared these AI tools with human reviewers using 1,081 articles after duplicate removal. Each AI model was tested in three configurations to assess sensitivity, specificity, predictive values, and likelihood ratios. The Llama 3 model’s LLA_2 configuration achieved 77.5% sensitivity and 91.4% specificity, with 90.2% accuracy, a positive predictive value (PPV) of 44.3%, and a negative predictive value (NPV) of 97.9%. The ChatGPT-4o mini model’s CHAT_2 configuration showed 56.2% sensitivity, 95.1% specificity, 92.0% accuracy, a PPV of 50.6%, and an NPV of 96.1%. Both models demonstrated strong specificity, with CHAT_2 having higher overall accuracy. Despite these promising results, manual validation remains necessary to address false positives and negatives, ensuring that no important studies are overlooked. This study suggests that AI can significantly enhance efficiency and accuracy in systematic reviews, potentially revolutionizing not only biomedical research but also other fields requiring extensive literature reviews.

Exploring the Impact of China’s Retaliatory Tariffs on US Soybean Exports with Machine Learning Techniques
Anastasia W. Thayer, Pengyan Sun, Hernan A. Tejeda, Man-Keun Kim
Journal:

Journal of Agricultural and Applied Economics , FirstView

Published online by Cambridge University Press:

24 March 2025, pp. 1-19
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
The 2018/2019 trade conflict between the United States and China impacted a broad array of agricultural products, including soybeans. Previous trade studies using gravity models fail to account for trends and complex seasonal patterns observed in the data. This study uses a machine learning (ML) approach to estimate losses in soybean export value and volume from the trade war. We find that models using ML techniques outperform traditional models and estimate losses in the value of soybean exports of $10.16 billion/year. The ML models fit the complex export trade data series well, highlighting the importance of utilizing improved modeling approaches.

Factors associated with carditis adverse events following SARS-COV-2-19 vaccination
Kyung Hyun Min, Jun Hyeob Kim, Jin Yeon Gil, Jun Hyuk Park, Ji Min Han, Kyung Eun Lee
Journal:

Epidemiology & Infection / Volume 153 / 2025

Published online by Cambridge University Press:

19 March 2025, e51
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
The study aimed to delve into the incidence and risk factors associated with myocarditis and pericarditis following SARS-COV-2-19 vaccination, addressing a notable gap in understanding the safety profile of vaccinations. Through meticulous data selection from the National Health Insurance System (NHIS) database of Korea, the researchers employed both a case-crossover study and a nested case-control design to analyze temporal patterns and risk factors related to carditis occurrences post-immunization. Key findings revealed a significant association between SARS-COV-2-19 vaccination and the occurrence of carditis, with a strong temporal correlation observed within 10 days post-vaccination. Noteworthy factors contributing to carditis risk included the duration between vaccination and carditis, specific comorbidities and medication use. The study concluded by recommending an extended post-vaccination surveillance duration of at least 10 days and underscored the importance of considering individual medical histories and concurrent medication use in assessing vaccine-induced carditis risk. This study might contribute to understanding vaccine safety profiles and emphasizes the significance of comprehensive post-vaccination monitoring protocols.

Using machine learning for communication classification
Stefan P. Penczynski
Journal:

Experimental Economics / Volume 22 / Issue 4 / December 2019

Published online by Cambridge University Press:

14 March 2025, pp. 1002-1029
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
The present study explores the value of machine learning techniques in the classification of communication content in experiments. Previously human-coded datasets are used to both train and test algorithm-generated models that relate word counts to categories. For various games, the computer models of the classification are able to match out-of-sample the human classification to a considerable extent. The analysis raises hope that the substantial effort going into such studies can be reduced by using computer algorithms for classification. This would enable a quick and replicable analysis of large-scale datasets at reasonable costs and widen the applicability of such approaches. The paper gives an easily accessible technical introduction into the computational method.

Search Results

Refine search

Refine search

Actions for selected content:

849 results

Pointing Stabilization of a 1Hz High Power Laser via Machine Learning

Reflective error: a metric for assessing predictive performance at extreme events

Integrating weight and imaging features: A machine learning framework for larval instar identification in Mythimna separata (Walker)

Multivariate time-series clustering analysis of the Global Dietary Database to uncover patterns in dietary trends (1990-2018)

5 - Data and Methods for Modeling Climate-Related Migration

Summary

Leveraging initial conditions memory for modelling Rayleigh–Taylor turbulence

Predicting experiences of paranoia and auditory verbal hallucinations in daily life with ambulatory sensor data – A feasibility study

Validity and accuracy of artificial intelligence-based dietary intake assessment methods: a systematic review

Data publishing in mechanics and dynamics: challenges, guidelines, and examples from engineering design

5 - Anti-discrimination

Summary

Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT)

NLP verification: towards a general methodology for certifying robustness

Machine-learning-based pressure reconstruction with moving boundaries

Using machine learning to identify features associated with different types of self-injurious behaviors in autistic youth

How to beat a Bayesian adversary

News Cycles and Satisfaction With Democracy: How the Pandemic Short-Circuited Media Polarization

Validation of large language models (Llama 3 and ChatGPT-4o mini) for title and abstract screening in biomedical systematic reviews

Exploring the Impact of China’s Retaliatory Tariffs on US Soybean Exports with Machine Learning Techniques

Factors associated with carditis adverse events following SARS-COV-2-19 vaccination

Using machine learning for communication classification

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

849 results

Summary

Summary