Evaluating the OpenAI's GPT-3.5 Turbo's performance in extracting information from scientific articles on diabetic retinopathy.

Gue, Celeste Ci Ying; Rahim, Noorul Dharajath Abdul; Rojas-Carabali, William; Agrawal, Rupesh; Rk, Palvannan; Abisheganaden, John; Yip, Wan Fen

Gue, Celeste Ci Ying; Rahim, Noorul Dharajath Abdul; Rojas-Carabali, William; Agrawal, Rupesh; Rk, Palvannan; Abisheganaden, John; Yip, Wan Fen.

Afiliación

Gue CCY; Health Services and Outcomes Research, National Healthcare Group, 3 Fusionopolis Link, #03-08, Nexus@One-North, Singapore, 138543, Singapore.
Rahim NDA; Health Services and Outcomes Research, National Healthcare Group, 3 Fusionopolis Link, #03-08, Nexus@One-North, Singapore, 138543, Singapore.
Rojas-Carabali W; National Healthcare Group Eye Institute, Tan Tock Seng Hospital, 11 Jalan Tan Tock Seng, Singapore, 308433, Singapore.
Agrawal R; Lee Kong Chian School of Medicine, Nanyang Technological University, 11 Mandalay Road, Singapore, 308232, Singapore.
Rk P; National Healthcare Group Eye Institute, Tan Tock Seng Hospital, 11 Jalan Tan Tock Seng, Singapore, 308433, Singapore.
Abisheganaden J; Lee Kong Chian School of Medicine, Nanyang Technological University, 11 Mandalay Road, Singapore, 308232, Singapore.
Yip WF; Health Services and Outcomes Research, National Healthcare Group, 3 Fusionopolis Link, #03-08, Nexus@One-North, Singapore, 138543, Singapore.

Syst Rev ; 13(1): 135, 2024 May 16.

Article en En | MEDLINE | ID: mdl-38755704

ABSTRACT

ABSTRACT

We aimed to compare the concordance of information extracted and the time taken between a large language model (OpenAI's GPT-3.5 Turbo via API) against conventional human extraction methods in retrieving information from scientific articles on diabetic retinopathy (DR). The extraction was done using GPT3.5 Turbo as of October 2023. OpenAI's GPT-3.5 Turbo significantly reduced the time taken for extraction. Concordance was highest at 100% for the extraction of the country of study, 64.7% for significant risk factors of DR, 47.1% for exclusion and inclusion criteria, and lastly 41.2% for odds ratio (OR) and 95% confidence interval (CI). The concordance levels seemed to indicate the complexity associated with each prompt. This suggests that OpenAI's GPT-3.5 Turbo may be adopted to extract simple information that is easily located in the text, leaving more complex information to be extracted by the researcher. It is crucial to note that the foundation model is constantly improving significantly with new versions being released quickly. Subsequent work can focus on retrieval-augmented generation (RAG), embedding, chunking PDF into useful sections, and prompting to improve the accuracy of extraction.

Asunto(s)

Retinopatía Diabética; Humanos; Almacenamiento y Recuperación de la Información/métodos; Procesamiento de Lenguaje Natural; Minería de Datos/métodos

Palabras clave

Concordance; GPT-3.5 Turbo; Information extraction

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Retinopatía Diabética Límite: Humans Idioma: En Revista: Syst Rev Año: 2024 Tipo del documento: Article País de afiliación: Singapur Pais de publicación: Reino Unido

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google