Búsqueda | Portal Regional de la BVS

1.

Prompt Engineering Paradigms for Medical Applications: Scoping Review.

Zaghir, Jamil; Naguib, Marco; Bjelogrlic, Mina; Névéol, Aurélie; Tannier, Xavier; Lovis, Christian.

J Med Internet Res ; 26: e60501, 2024 Sep 10.

Artículo en Inglés | MEDLINE | ID: mdl-39255030

RESUMEN

BACKGROUND: Prompt engineering, focusing on crafting effective prompts to large language models (LLMs), has garnered attention for its capabilities at harnessing the potential of LLMs. This is even more crucial in the medical domain due to its specialized terminology and language technicity. Clinical natural language processing applications must navigate complex language and ensure privacy compliance. Prompt engineering offers a novel approach by designing tailored prompts to guide models in exploiting clinically relevant information from complex medical texts. Despite its promise, the efficacy of prompt engineering in the medical domain remains to be fully explored. OBJECTIVE: The aim of the study is to review research efforts and technical approaches in prompt engineering for medical applications as well as provide an overview of opportunities and challenges for clinical practice. METHODS: Databases indexing the fields of medicine, computer science, and medical informatics were queried in order to identify relevant published papers. Since prompt engineering is an emerging field, preprint databases were also considered. Multiple data were extracted, such as the prompt paradigm, the involved LLMs, the languages of the study, the domain of the topic, the baselines, and several learning, design, and architecture strategies specific to prompt engineering. We include studies that apply prompt engineering-based methods to the medical domain, published between 2022 and 2024, and covering multiple prompt paradigms such as prompt learning (PL), prompt tuning (PT), and prompt design (PD). RESULTS: We included 114 recent prompt engineering studies. Among the 3 prompt paradigms, we have observed that PD is the most prevalent (78 papers). In 12 papers, PD, PL, and PT terms were used interchangeably. While ChatGPT is the most commonly used LLM, we have identified 7 studies using this LLM on a sensitive clinical data set. Chain-of-thought, present in 17 studies, emerges as the most frequent PD technique. While PL and PT papers typically provide a baseline for evaluating prompt-based approaches, 61% (48/78) of the PD studies do not report any nonprompt-related baseline. Finally, we individually examine each of the key prompt engineering-specific information reported across papers and find that many studies neglect to explicitly mention them, posing a challenge for advancing prompt engineering research. CONCLUSIONS: In addition to reporting on trends and the scientific landscape of prompt engineering, we provide reporting guidelines for future studies to help advance research in the medical field. We also disclose tables and figures summarizing medical prompt engineering papers available and hope that future contributions will leverage these existing works to better advance the field.

Asunto(s)

Procesamiento de Lenguaje Natural , Humanos , Informática Médica/métodos

2.

SIMpat: A Synthetic Benchmark for Similarity Metrics on Patient Representations.

Voegeli, Jean-Virgile; Bjelogrlic, Mina; Gaudet-Blavignac, Christophe; Dubos, Richard; Zimmermann, Myriam; Bensahla Talet, Adel; Zheng, Yuanyuan; Ehrsam, Julien; Lovis, Christian.

Stud Health Technol Inform ; 316: 1647-1651, 2024 Aug 22.

Artículo en Inglés | MEDLINE | ID: mdl-39176526

RESUMEN

Similarity and clustering tasks based on data extracted from electronic health records on the patient level suffer from the curse of dimensionality and the lack of inter-patient data comparability. Indeed, for many health institutions, there are many more variables, and ways of expressing those variables to represent patients than patients sharing the same set of data. To lower redundancy and increase interoperability one strategy is to map data to semantic-driven representations through medical knowledge graphs such as SNOMED-CT. However, patient similarity metrics based on this knowledge-graph information lack quantitative evaluation and comparisons with pure data-driven methods. The reasons are twofold, firstly, it is hard to conceptually assess and formalize a gold-standard similarity between patients resulting in poor inter-annotator agreement in qualitative evaluations. Secondly, the community has been lacking a clear benchmark to compare existing metrics developed by scientific communities coming from various fields such as ontology, data science, and medical informatics. This study proposes to leverage the known challenges of evaluating patient similarities by proposing SIMpat, a synthetic benchmark to quantitatively evaluate available metrics, based on controlled cohorts, which could later be used to assess their sensibility regarding aspects such as the sparsity of variables or specificities of patient disease patterns.

Asunto(s)

Benchmarking , Registros Electrónicos de Salud , Humanos , Systematized Nomenclature of Medicine , Semántica

3.

Efficient Clinical Information Extraction from Breast Radiology Reports in French.

Zaghir, Jamil; Lokaj, Belinda; Kinkel, Karen; Djema, Amal-Dahila; Turbé, Hugues; Bjelogrlic, Mina; Durand de Gevigney, Valentin; Schmid, Jérôme; Lovis, Christian; Goldman, Jean-Philippe.

Stud Health Technol Inform ; 316: 1780-1784, 2024 Aug 22.

Artículo en Inglés | MEDLINE | ID: mdl-39176562

RESUMEN

Radiology reports contain crucial patient information, in addition to images, that can be automatically extracted for secondary uses such as clinical support and research for diagnosis. We tested several classifiers to classify 1,218 breast MRI reports in French from two Swiss clinical centers. Logistic regression performed better for both internal (accuracy > 0.95 and macro-F1 > 0.86) and external data (accuracy > 0.81 and macro-F1 > 0.41). Automating this task will facilitate efficient extraction of targeted clinical parameters and provide a good basis for future annotation processes through automatic pre-annotation.

Asunto(s)

Neoplasias de la Mama , Imagen por Resonancia Magnética , Humanos , Femenino , Neoplasias de la Mama/diagnóstico por imagen , Francia , Sistemas de Información Radiológica , Registros Electrónicos de Salud , Procesamiento de Lenguaje Natural , Suiza , Minería de Datos

4.

Beyond Tokens: Fair Evaluation of French Large Language Models for Clinical Named Entity Recognition.

Zaghir, Jamil; Bjelogrlic, Mina; Goldman, Jean-Philippe; Bensahla, Adel; Zheng, Yuanyuan; Lovis, Christian.

Stud Health Technol Inform ; 316: 666-670, 2024 Aug 22.

Artículo en Inglés | MEDLINE | ID: mdl-39176830

RESUMEN

Named Entity Recognition (NER) models based on Transformers have gained prominence for their impressive performance in various languages and domains. This work delves into the often-overlooked aspect of entity-level metrics and exposes significant discrepancies between token and entity-level evaluations. The study utilizes a corpus of synthetic French oncological reports annotated with entities representing oncological morphologies. Four different French BERT-based models are fine-tuned for token classification, and their performance is rigorously assessed at both token and entity-level. In addition to fine-tuning, we evaluate ChatGPT's ability to perform NER through prompt engineering techniques. The findings reveal a notable disparity in model effectiveness when transitioning from token to entity-level metrics, highlighting the importance of comprehensive evaluation methodologies in NER tasks. Furthermore, in comparison to BERT, ChatGPT remains limited when it comes to detecting advanced entities in French.

Asunto(s)

Procesamiento de Lenguaje Natural , Francia , Humanos , Registros Electrónicos de Salud , Lenguaje , Neoplasias , Vocabulario Controlado

5.

A Novel Method and Python Library for ECG Signal Quality Assessment.

Berger, Charles; Turbé, Hugues; Bjelogrlic, Mina; Lovis, Christian.

Stud Health Technol Inform ; 316: 858-862, 2024 Aug 22.

Artículo en Inglés | MEDLINE | ID: mdl-39176928

RESUMEN

Electrocardiogram (ECG) is one of the reference cardiovascular diagnostic exams. However, the ECG signal is very prone to being distorted through different sources of artifacts that can later interfere with the diagnostic. For this reason, signal quality assessment (SQA) methods that identify corrupted signals are critical to improve the robustness of automatic ECG diagnostic methods. This work presents a review and open-source implementation of different available indices for SQA as well as introducing an index that considers the ECG as a dynamical system. These indices are then used to develop machine learning models which evaluate the quality of the signals. The proposed index along the designed ML models are shown to improve SQA for ECG signals.

Asunto(s)

Electrocardiografía , Aprendizaje Automático , Humanos , Procesamiento de Señales Asistido por Computador , Artefactos , Algoritmos , Lenguajes de Programación

6.

Patient Information Summarization in Clinical Settings: Scoping Review.

Keszthelyi, Daniel; Gaudet-Blavignac, Christophe; Bjelogrlic, Mina; Lovis, Christian.

JMIR Med Inform ; 11: e44639, 2023 Nov 28.

Artículo en Inglés | MEDLINE | ID: mdl-38015588

RESUMEN

BACKGROUND: Information overflow, a common problem in the present clinical environment, can be mitigated by summarizing clinical data. Although there are several solutions for clinical summarization, there is a lack of a complete overview of the research relevant to this field. OBJECTIVE: This study aims to identify state-of-the-art solutions for clinical summarization, to analyze their capabilities, and to identify their properties. METHODS: A scoping review of articles published between 2005 and 2022 was conducted. With a clinical focus, PubMed and Web of Science were queried to find an initial set of reports, later extended by articles found through a chain of citations. The included reports were analyzed to answer the questions of where, what, and how medical information is summarized; whether summarization conserves temporality, uncertainty, and medical pertinence; and how the propositions are evaluated and deployed. To answer how information is summarized, methods were compared through a new framework "collect-synthesize-communicate" referring to information gathering from data, its synthesis, and communication to the end user. RESULTS: Overall, 128 articles were included, representing various medical fields. Exclusively structured data were used as input in 46.1% (59/128) of papers, text in 41.4% (53/128) of articles, and both in 10.2% (13/128) of papers. Using the proposed framework, 42.2% (54/128) of the records contributed to information collection, 27.3% (35/128) contributed to information synthesis, and 46.1% (59/128) presented solutions for summary communication. Numerous summarization approaches have been presented, including extractive (n=13) and abstractive summarization (n=19); topic modeling (n=5); summary specification (n=11); concept and relation extraction (n=30); visual design considerations (n=59); and complete pipelines (n=7) using information extraction, synthesis, and communication. Graphical displays (n=53), short texts (n=41), static reports (n=7), and problem-oriented views (n=7) were the most common types in terms of summary communication. Although temporality and uncertainty information were usually not conserved in most studies (74/128, 57.8% and 113/128, 88.3%, respectively), some studies presented solutions to treat this information. Overall, 115 (89.8%) articles showed results of an evaluation, and methods included evaluations with human participants (median 15, IQR 24 participants): measurements in experiments with human participants (n=31), real situations (n=8), and usability studies (n=28). Methods without human involvement included intrinsic evaluation (n=24), performance on a proxy (n=10), or domain-specific tasks (n=11). Overall, 11 (8.6%) reports described a system deployed in clinical settings. CONCLUSIONS: The scientific literature contains many propositions for summarizing patient information but reports very few comparisons of these proposals. This work proposes to compare these algorithms through how they conserve essential aspects of clinical information and through the "collect-synthesize-communicate" framework. We found that current propositions usually address these 3 steps only partially. Moreover, they conserve and use temporality, uncertainty, and pertinent medical aspects to varying extents, and solutions are often preliminary.

7.

Caregivers Interactions with Clinical Autocomplete Tool: A Retrospective Study.

Zaghir, Jamil; Goldman, Jean-Philippe; Bjelogrlic, Mina; Gaudet-Blavignac, Christophe; Lovis, Christian.

Stud Health Technol Inform ; 295: 132-135, 2022 Jun 29.

Artículo en Inglés | MEDLINE | ID: mdl-35773825

RESUMEN

Hospital caregivers report patient data while being under constant pressure. These records include structured information, with some of them being derived from a restricted list of terms. Finding the right term from a large terminology can be time-consuming, harming the clinician's productivity. To deal with this hurdle, an autocomplete system is employed, providing the closest terms after a prefix is typed. While this software application clearly smoothens the term searching, this paper studies the influences of the tool on caregivers' reporting, inspecting the evolution of their typing conduct over time.

Asunto(s)

Cuidadores , Programas Informáticos , Hospitales , Humanos , Estudios Retrospectivos

8.

Performance of Machine Learning Methods to Classify French Medical Publications.

Zaghir, Jamil; Goldman, Jean-Philippe; Bjelogrlic, Mina; Keszthelyi, Daniel; Gaudet-Blavignac, Christophe; Turbé, Hugues; Lokaj, Belinda; Lovis, Christian.

Stud Health Technol Inform ; 294: 874-875, 2022 May 25.

Artículo en Inglés | MEDLINE | ID: mdl-35612232

RESUMEN

Many medical narratives are read by care professionals in their preferred language. These documents can be produced by organizations, authorities or national publishers. However, they are often hardly findable using the usual query engines based on English such as PubMed. This work explores the possibility to automatically categorize medical documents in French following an automatic Natural Language Processing pipeline. The pipeline is used to compare the performance of 6 different machine learning and deep neural network approaches on a large dataset of peer-reviewed weekly published Swiss medical journal in French covering major topics in medicine over the last 15 years. An accuracy of 96% was achieved for 5-topic classification and 81% for 20-topic classification.

Asunto(s)

Aprendizaje Automático , Procesamiento de Lenguaje Natural , Lenguaje , Redes Neurales de la Computación , PubMed

9.

Ambulatory Peritoneal Dialysis Analysis Framework.

Inacio, Nicolas; Lissorgues, Gaëlle; Ugon, Adrien; Bagnis, Corinne; Lovis, Christian; Bjelogrlic, Mina.

Stud Health Technol Inform ; 294: 959-960, 2022 May 25.

Artículo en Inglés | MEDLINE | ID: mdl-35612258

RESUMEN

This paper presents the design of an autonomous tracking device to enhance understanding of ambulatory peritoneal dialysis. The resulting tool aims to serve as a framework for research analysis and a decision support for treatment adjustments in peritoneal dialysis.

Asunto(s)

Fallo Renal Crónico , Diálisis Peritoneal , Instituciones de Atención Ambulatoria , Humanos , Fallo Renal Crónico/terapia

10.

A Lightweight and Interpretable Model to Classify Bundle Branch Blocks from ECG Signals.

Turbé, Hugues; Bjelogrlic, Mina; Namdar, Mehdi; Gaudet-Blavignac, Christophe; Zaghir, Jamil; Goldman, Jean-Philippe; Lokaj, Belinda; Lovis, Christian.

Stud Health Technol Inform ; 294: 43-47, 2022 May 25.

Artículo en Inglés | MEDLINE | ID: mdl-35612013

RESUMEN

Automatic classification of ECG signals has been a longtime research area with large progress having been made recently. However these advances have been achieved with increasingly complex models at the expense of model's interpretability. In this research, a new model based on multivariate autoregressive model (MAR) coefficients combined with a tree-based model to classify bundle branch blocks is proposed. The advantage of the presented approach is to build a lightweight model which combined with post-hoc interpretability can bring new insights into important cross-lead dependencies which are indicative of the diseases of interest.

Asunto(s)

Bloqueo de Rama , Electrocardiografía , Algoritmos , Bloqueo de Rama/diagnóstico , Humanos

11.

Use of the Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT) for Processing Free Text in Health Care: Systematic Scoping Review.

Gaudet-Blavignac, Christophe; Foufi, Vasiliki; Bjelogrlic, Mina; Lovis, Christian.

J Med Internet Res ; 23(1): e24594, 2021 01 26.

Artículo en Inglés | MEDLINE | ID: mdl-33496673

RESUMEN

BACKGROUND: Interoperability and secondary use of data is a challenge in health care. Specifically, the reuse of clinical free text remains an unresolved problem. The Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT) has become the universal language of health care and presents characteristics of a natural language. Its use to represent clinical free text could constitute a solution to improve interoperability. OBJECTIVE: Although the use of SNOMED and SNOMED CT has already been reviewed, its specific use in processing and representing unstructured data such as clinical free text has not. This review aims to better understand SNOMED CT's use for representing free text in medicine. METHODS: A scoping review was performed on the topic by searching MEDLINE, Embase, and Web of Science for publications featuring free-text processing and SNOMED CT. A recursive reference review was conducted to broaden the scope of research. The review covered the type of processed data, the targeted language, the goal of the terminology binding, the method used and, when appropriate, the specific software used. RESULTS: In total, 76 publications were selected for an extensive study. The language targeted by publications was 91% (n=69) English. The most frequent types of documents for which the terminology was used are complementary exam reports (n=18, 24%) and narrative notes (n=16, 21%). Mapping to SNOMED CT was the final goal of the research in 21% (n=16) of publications and a part of the final goal in 33% (n=25). The main objectives of mapping are information extraction (n=44, 39%), feature in a classification task (n=26, 23%), and data normalization (n=23, 20%). The method used was rule-based in 70% (n=53) of publications, hybrid in 11% (n=8), and machine learning in 5% (n=4). In total, 12 different software packages were used to map text to SNOMED CT concepts, the most frequent being Medtex, Mayo Clinic Vocabulary Server, and Medical Text Extraction Reasoning and Mapping System. Full terminology was used in 64% (n=49) of publications, whereas only a subset was used in 30% (n=23) of publications. Postcoordination was proposed in 17% (n=13) of publications, and only 5% (n=4) of publications specifically mentioned the use of the compositional grammar. CONCLUSIONS: SNOMED CT has been largely used to represent free-text data, most frequently with rule-based approaches, in English. However, currently, there is no easy solution for mapping free text to this terminology and to perform automatic postcoordination. Most solutions conceive SNOMED CT as a simple terminology rather than as a compositional bag of ontologies. Since 2012, the number of publications on this subject per year has decreased. However, the need for formal semantic representation of free text in health care is high, and automatic encoding into a compositional ontology could be a solution.

Asunto(s)

Procesamiento de Lenguaje Natural , Systematized Nomenclature of Medicine , Humanos

12.

Emerging Concepts and Applied Machine Learning Research in Patients with Drug-Induced Repolarization Disorders.

Bjelogrlic, Mina; Robert, Arnaud; Miribel, Arnaud; Namdar, Mehdi; Gencer, Baris; Lovis, Christian; Girardin, François.

Stud Health Technol Inform ; 270: 198-202, 2020 Jun 16.

Artículo en Inglés | MEDLINE | ID: mdl-32570374

RESUMEN

The paper presents a review of current research to develop predictive models for automated detection of drug-induced repolarization disorders and shows a feasibility study for developing machine learning tools trained on massive multimodal datasets of narrative, textual and electrocardiographic records. The goal is to reduce drug-induced long QT and associated complications (Torsades-de-Pointes, sudden cardiac death), by identifying prescription patterns with pro-arrhythmic propensity using a validated electronic application for the detection of adverse drug events with data mining and natural language processing; and to compute individual-based predictive scores in order to further identify clinical conditions, concomitant diseases, or other variables that correlate with higher risk of pro-arrhythmic situations.

Asunto(s)

Aprendizaje Automático , Muerte Súbita Cardíaca , Electrocardiografía , Humanos , Síndrome de QT Prolongado , Torsades de Pointes

13.

Evaluation of Document Retrieval Systems on a Medical Corpus in French: Indexation vs. Feature Learning.

Robert, Arnaud; Damachi, Francis; Bjelogrlic, Mina; Goldman, Jean-Philippe; Lovis, Christian.

Stud Health Technol Inform ; 270: 208-212, 2020 Jun 16.

Artículo en Inglés | MEDLINE | ID: mdl-32570376

RESUMEN

This paper presents five document retrieval systems for a small (few thousands) and domain specific corpora (weekly peer-reviewed medical journals published in French) as well as an evaluation methodology to quantify the models performance. The proposed methodology does not rely on external annotations and therefore can be used as an ad hoc evaluation procedure for most document retrieval tasks. Statistical models and vector space models are empirically compared on a synthetic document retrieval task. For our dataset size and specificities the statistical approaches consistently performed better than its vector space counterparts.

Asunto(s)

Almacenamiento y Recuperación de la Información/métodos , Lenguaje , Medical Subject Headings , Modelos Estadísticos , Procesamiento de Lenguaje Natural , Humanos

14.

Adaptive Time-Dependent Priors and Bayesian Inference to Evaluate SARS-CoV-2 Public Health Measures Validated on 31 Countries.

Turbé, Hugues; Bjelogrlic, Mina; Robert, Arnaud; Gaudet-Blavignac, Christophe; Goldman, Jean-Philippe; Lovis, Christian.

Front Public Health ; 8: 583401, 2020.

Artículo en Inglés | MEDLINE | ID: mdl-33553088

RESUMEN

With the rapid spread of the SARS-CoV-2 virus since the end of 2019, public health confinement measures to contain the propagation of the pandemic have been implemented. Our method to estimate the reproduction number using Bayesian inference with time-dependent priors enhances previous approaches by considering a dynamic prior continuously updated as restrictive measures and comportments within the society evolve. In addition, to allow direct comparison between reproduction number and introduction of public health measures in a specific country, the infection dates are inferred from daily confirmed cases and confirmed death. The evolution of this reproduction number in combination with the stringency index is analyzed on 31 European countries. We show that most countries required tough state interventions with a stringency index equal to 79.6 out of 100 to reduce their reproduction number below one and control the progression of the pandemic. In addition, we show a direct correlation between the time taken to introduce restrictive measures and the time required to contain the spread of the pandemic with a median time of 8 days. This analysis is validated by comparing the excess deaths and the time taken to implement restrictive measures. Our analysis reinforces the importance of having a fast response with a coherent and comprehensive set of confinement measures to control the pandemic. Only restrictions or combinations of those have shown to effectively control the pandemic.

Asunto(s)

Teorema de Bayes , COVID-19 , Salud Pública , SARS-CoV-2/aislamiento & purificación , Número Básico de Reproducción , COVID-19/epidemiología , COVID-19/mortalidad , Europa (Continente) , Humanos , Estudios Longitudinales

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA