Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
Más filtros











Base de datos
Intervalo de año de publicación
1.
Front Big Data ; 5: 965619, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-36338335

RESUMEN

Here we study the semantic search and retrieval problem in biomedical digital libraries. First, we introduce MedGraph, a knowledge graph embedding-based method that provides semantic relevance retrieval and ranking for the biomedical literature indexed in PubMed. Second, we evaluate our approach using PubMed's Best Match algorithm. Moreover, we compare our method MedGraph to a traditional TF-IDF-based algorithm. Third, we use a dataset extracted from PubMed, including 30 million articles' metadata such as abstracts, author information, citation information, and extracted biological entity mentions. We pull a subset of the dataset to evaluate MedGraph using predefined queries with ground truth ranked results. To our knowledge, this technique has not been explored before in biomedical information retrieval. In addition, our results provide some evidence that semantic approaches to search and relevance in biomedical digital libraries that rely on knowledge graph modeling offer better search relevance results when compared with traditional methods in terms of objective metrics.

2.
Scientometrics ; 126(5): 4491-4509, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-33746309

RESUMEN

COVID-19 cases have surpassed the 109 + million markers, with deaths tallying up to 2.4 million. Tens of thousands of papers regarding COVID-19 have been published along with countless bibliometric analyses done on COVID-19 literature. Despite this, none of the analyses have focused on domain entities occurring in scientific publications. However, analysis of these bio-entities and the relations among them, a strategy called entity metrics, could offer more insights into knowledge usage and diffusion in specific cases. Thus, this paper presents an entitymetric analysis on COVID-19 literature. We construct an entity-entity co-occurrence network and employ network indicators to analyze the extracted entities. We find that ACE-2 and C-reactive protein are two very important genes and that lopinavir and ritonavir are two very important chemicals, regardless of the results from either ranking.

3.
Sci Data ; 7(1): 205, 2020 06 26.
Artículo en Inglés | MEDLINE | ID: mdl-32591513

RESUMEN

PubMed® is an essential resource for the medical domain, but useful concepts are either difficult to extract or are ambiguous, which has significantly hindered knowledge discovery. To address this issue, we constructed a PubMed knowledge graph (PKG) by extracting bio-entities from 29 million PubMed abstracts, disambiguating author names, integrating funding data through the National Institutes of Health (NIH) ExPORTER, collecting affiliation history and educational background of authors from ORCID®, and identifying fine-grained affiliation data from MapAffil. Through the integration of these credible multi-source data, we could create connections among the bio-entities, authors, articles, affiliations, and funding. Data validation revealed that the BioBERT deep learning method of bio-entity extraction significantly outperformed the state-of-the-art models based on the F1 score (by 0.51%), with the author name disambiguation (AND) achieving an F1 score of 98.09%. PKG can trigger broader innovations, not only enabling us to measure scholarly impact, knowledge usage, and knowledge transfer, but also assisting us in profiling authors and organizations based on their connections with bio-entities.


Asunto(s)
Autoria , Bases del Conocimiento , PubMed , Aprendizaje Profundo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA