Deep active learning with high structural discriminability for molecular mutagenicity prediction.
Commun Biol
; 7(1): 1071, 2024 Aug 31.
Article
en En
| MEDLINE
| ID: mdl-39217273
ABSTRACT
The assessment of mutagenicity is essential in drug discovery, as it may lead to cancer and germ cells damage. Although in silico methods have been proposed for mutagenicity prediction, their performance is hindered by the scarcity of labeled molecules. However, experimental mutagenicity testing can be time-consuming and costly. One solution to reduce the annotation cost is active learning, where the algorithm actively selects the most valuable molecules from a vast chemical space and presents them to the oracle (e.g., a human expert) for annotation, thereby rapidly improving the model's predictive performance with a smaller annotation cost. In this paper, we propose muTOX-AL, a deep active learning framework, which can actively explore the chemical space and identify the most valuable molecules, resulting in competitive performance with a small number of labeled samples. The experimental results show that, compared to the random sampling strategy, muTOX-AL can reduce the number of training molecules by about 57%. Additionally, muTOX-AL exhibits outstanding molecular structural discriminability, allowing it to pick molecules with high structural similarity but opposite properties.
Texto completo:
1
Colección:
01-internacional
Base de datos:
MEDLINE
Asunto principal:
Aprendizaje Profundo
/
Mutágenos
Límite:
Humans
Idioma:
En
Revista:
Commun Biol
Año:
2024
Tipo del documento:
Article
País de afiliación:
China
Pais de publicación:
Reino Unido