Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 453
Filtrar
1.
J Agric Food Chem ; 2024 Sep 11.
Artículo en Inglés | MEDLINE | ID: mdl-39258845

RESUMEN

In the realm of crop protection products, ensuring the safety of pollinators stands as a pivotal aspect of advancing sustainable solutions. Extensive research has been dedicated to this crucial topic as well as new approach methodologies in toxicity testing. Hence, within the agricultural and chemical industries, prioritizing pollinator safety remains a constant objective during the development of predictive tools. One of these tools includes computational models like quantitative structure-activity relationships (QSARs) that are valuable in predicting the toxicity of chemicals. This research uses bee toxicity data to develop artificial neural network classification models for predicting honey bee acute toxicity. Bee toxicity data from 1542 compounds were used to develop models; the sensitivity and specificity of the best model were 0.90 and 0.91, respectively. These in silico models can aid in the discovery of next-generation crop protection products. These tools can guide the screening and selection of next-generation crop protection molecules with high margins of safety to pollinators, and candidates with favorable sustainability profiles can be identified at the early discovery stage as precursors to in vivo data generation.

2.
Chem Pharm Bull (Tokyo) ; 72(9): 794-799, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-39218704

RESUMEN

Recently, remarkable progress has been achieved in artificial intelligence (AI), including machine learning. Various AI models have been proposed for drug discovery, including the design of small molecules, activity prediction, and three-dimensional (3D) structure prediction of proteins. AI consists of diverse elements, including information retrieval and machine learning, and can be used in a wide range of drug discovery scenarios. In this review, we focused on AI for small-molecule drug discovery with respect to molecular design, activity prediction, and prediction of the binding poses of compounds to target molecules. We also discussed the applications of AI in academic drug discovery.


Asunto(s)
Inteligencia Artificial , Quimioinformática , Descubrimiento de Drogas , Humanos , Aprendizaje Automático , Bibliotecas de Moléculas Pequeñas/química
3.
ArXiv ; 2024 Aug 30.
Artículo en Inglés | MEDLINE | ID: mdl-39253636

RESUMEN

Researchers in biomedical research, public health and the life sciences often spend weeks or months discovering, accessing, curating, and integrating data from disparate sources, significantly delaying the onset of actual analysis and innovation. Instead of countless developers creating redundant and inconsistent data pipelines, BioBricks.ai offers a centralized data repository and a suite of developer-friendly tools to simplify access to scientific data. Currently, BioBricks.ai delivers over ninety biological and chemical datasets. It provides a package manager-like system for installing and managing dependencies on data sources. Each 'brick' is a Data Version Control git repository that supports an updateable pipeline for extraction, transformation, and loading data into the BioBricks.ai backend at https://biobricks.ai. Use cases include accelerating data science workflows and facilitating the creation of novel data assets by integrating multiple datasets into unified, harmonized resources. In conclusion, BioBricks.ai offers an opportunity to accelerate access and use of public data through a single open platform.

4.
Molecules ; 29(15)2024 Aug 01.
Artículo en Inglés | MEDLINE | ID: mdl-39125052

RESUMEN

Marine natural products (MNPs) continue to be tested primarily in cellular toxicity assays, both mammalian and microbial, despite most being inactive at concentrations relevant to drug discovery. These MNPs become missed opportunities and represent a wasteful use of precious bioresources. The use of cheminformatics aligned with published bioactivity data can provide insights to direct the choice of bioassays for the evaluation of new MNPs. Cheminformatics analysis of MNPs found in MarinLit (n = 39,730) up to the end of 2023 highlighted indol-3-yl-glyoxylamides (IGAs, n = 24) as a group of MNPs with no reported bioactivities. However, a recent review of synthetic IGAs highlighted these scaffolds as privileged structures with several compounds under clinical evaluation. Herein, we report the synthesis of a library of 32 MNP-inspired brominated IGAs (25-56) using a simple one-pot, multistep method affording access to these diverse chemical scaffolds. Directed by a meta-analysis of the biological activities reported for marine indole alkaloids (MIAs) and synthetic IGAs, the brominated IGAs 25-56 were examined for their potential bioactivities against the Parkinson's Disease amyloid protein alpha synuclein (α-syn), antiplasmodial activities against chloroquine-resistant (3D7) and sensitive (Dd2) parasite strains of Plasmodium falciparum, and inhibition of mammalian (chymotrypsin and elastase) and viral (SARS-CoV-2 3CLpro) proteases. All of the synthetic IGAs tested exhibited binding affinity to the amyloid protein α-syn, while some showed inhibitory activities against P. falciparum, and the proteases, SARS-CoV-2 3CLpro, and chymotrypsin. The cellular safety of the IGAs was examined against cancerous and non-cancerous human cell lines, with all of the compounds tested inactive, thereby validating cheminformatics and meta-analyses results. The findings presented herein expand our knowledge of marine IGA bioactive chemical space and advocate expanding the scope of biological assays routinely used to investigate NP bioactivities, specifically those more suitable for non-toxic compounds. By integrating cheminformatics tools and functional assays into NP biological testing workflows, we can aim to enhance the potential of NPs and their scaffolds for future drug discovery and development.


Asunto(s)
Productos Biológicos , Quimioinformática , Descubrimiento de Drogas , Productos Biológicos/química , Productos Biológicos/farmacología , Humanos , Quimioinformática/métodos , SARS-CoV-2/efectos de los fármacos , Organismos Acuáticos/química , Indoles/química , Indoles/farmacología , Plasmodium falciparum/efectos de los fármacos , Alcaloides Indólicos/farmacología , Alcaloides Indólicos/química , Animales
5.
Comput Biol Med ; 180: 108954, 2024 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-39094327

RESUMEN

Indoleamine 2,3-dioxygenase (IDO) and tryptophan 2,3-dioxygenase (TDO) are attractive drug targets for cancer immunotherapy. After disappointing results of the epacadostat as a selective IDO inhibitor in phase III clinical trials, there is much interest in the development of the TDO selective inhibitors. In the current study, several data analysis methods and machine learning approaches including logistic regression, Random Forest, XGBoost and Support Vector Machines were used to model a data set of compounds retrieved from ChEMBL. Models based on the Morgan fingerprints revealed notable fragments for the selective inhibition of the IDO, TDO or both. Multiple fragment docking was performed to find the best set of bound fragments and their orientation in the space for efficient linking. Linking the fragments and optimization of the final molecules were accomplished by means of an artificial intelligence generative framework. Finally, selectivity of the optimized molecules was assessed and the top 4 lead molecules were filtered through PAINS, Brenk and NIH filters. Results indicated that phenyloxalamide, fluoroquinoline, and 3-bromo-4-fluroaniline confer selectivity towards the IDO inhibition. Correspondingly, 1-benzyl-1H-naphtho[2,3-d][1,2,3]triazole-4,9-dione was found to be an integral fragment for the selective inhibition of the TDO by constituting a coordination bond with the Fe atom of heme. In addition, furo[2,3-c]pyridine-2,3-diamine was found as a common fragment for inhibition of the both targets and can be used in the design of the dual target inhibitors of the IDO and TDO. The new fragments introduced here can be a useful building blocks for incorporation into the selective TDO or dual IDO/TDO inhibitors.


Asunto(s)
Quimioinformática , Inhibidores Enzimáticos , Indolamina-Pirrol 2,3,-Dioxigenasa , Aprendizaje Automático , Triptófano Oxigenasa , Indolamina-Pirrol 2,3,-Dioxigenasa/antagonistas & inhibidores , Indolamina-Pirrol 2,3,-Dioxigenasa/química , Indolamina-Pirrol 2,3,-Dioxigenasa/metabolismo , Triptófano Oxigenasa/antagonistas & inhibidores , Triptófano Oxigenasa/metabolismo , Triptófano Oxigenasa/química , Humanos , Quimioinformática/métodos , Inhibidores Enzimáticos/química , Simulación del Acoplamiento Molecular
6.
Chem ; 10(7): 2074-2088, 2024 Jul 11.
Artículo en Inglés | MEDLINE | ID: mdl-39006239

RESUMEN

Circular dichroism (CD) based enantiomeric excess (ee) determination assays are optical alternatives to chromatographic ee determination in high-throughput screening (HTS) applications. However, the implementation of these assays requires calibration experiments using enantioenriched materials. We present a data-driven approach that circumvents the need for chiral resolution and calibration experiments for an octahedral Fe(II) complex (1) used for the ee determination of α-chiral primary amines. By computationally parameterizing the imine ligands formed in the assay conditions, a model of the circular dichroism (CD) response of the Fe(II) assembly was developed. Using this model, calibration curves were generated for four analytes and compared to experimentally generated curves. In a single-blind ee determination study, the ee values of unknown samples were determined within 9% mean absolute error, which rivals the error using experimentally generated calibration curves.

7.
Comput Biol Med ; 179: 108898, 2024 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-39047503

RESUMEN

Cannabidiol has been reported to interact with broad-spectrum biological targets with pleiotropic pharmacology including epilepsy although a cohesive mechanism is yet to be determined. Even though some studies propose that cannabidiol may manipulate glutamatergic signals, there is insufficient evidence to support cannabidiol direct effect on glutamate signaling, which is important in intervening epilepsy. Therefore, the present study aimed to analyze the epilepsy-related targets for cannabidiol, assess the differentially expressed genes with its treatment, and identify the possible glutamatergic signaling target. In this study, the epileptic protein targets of cannabidiol were identified using the Tanimoto coefficient and similarity index-based targets fishing which were later overlapped with the altered expression, epileptic biomarkers, and genetically altered proteins in epilepsy. The common proteins were then screened for possible glutamatergic signaling targets with differentially expressed genes. Later, molecular docking and simulation were performed using AutoDock Vina and GROMACS to evaluate binding affinity, ligand-protein stability, hydrophilic interaction, protein compactness, etc. Cannabidiol identified 30 different epilepsy-related targets of multiple protein classes including G-protein coupled receptors, enzymes, ion channels, etc. Glutamate receptor 2 was identified to be genetically varied in epilepsy which was targeted by cannabidiol and its expression was increased with its treatment. More importantly, cannabidiol showed a direct binding affinity with Glutamate receptor 2 forming a stable hydrophilic interaction and comparatively lower root mean squared deviation and residual fluctuations, increasing protein compactness with broad conformational changes. Based on the cheminformatic target fishing, evaluation of differentially expressed genes, molecular docking, and simulations, it can be hypothesized that cannabidiol may possess glutamate receptor 2-mediated anti-epileptic activities.


Asunto(s)
Cannabidiol , Epilepsia , Ácido Glutámico , Simulación del Acoplamiento Molecular , Transducción de Señal , Cannabidiol/farmacología , Cannabidiol/metabolismo , Epilepsia/tratamiento farmacológico , Epilepsia/metabolismo , Epilepsia/genética , Humanos , Transducción de Señal/efectos de los fármacos , Ácido Glutámico/metabolismo , Anticonvulsivantes/química , Anticonvulsivantes/uso terapéutico , Anticonvulsivantes/farmacología
8.
Molecules ; 29(12)2024 Jun 13.
Artículo en Inglés | MEDLINE | ID: mdl-38930883

RESUMEN

Intracellular tau fibrils are sources of neurotoxicity and oxidative stress in Alzheimer's. Current drug discovery efforts have focused on molecules with tau fibril disaggregation and antioxidation functions. However, recent studies suggest that membrane-bound tau-containing oligomers (mTCOs), smaller and less ordered than tau fibrils, are neurotoxic in the early stage of Alzheimer's. Whether tau fibril-targeting molecules are effective against mTCOs is unknown. The binding of epigallocatechin-3-gallate (EGCG), CNS-11, and BHT-CNS-11 to in silico mTCOs and experimental tau fibrils was investigated using machine learning-enhanced docking and molecular dynamics simulations. EGCG and CNS-11 have tau fibril disaggregation functions, while the proposed BHT-CNS-11 has potential tau fibril disaggregation and antioxidation functions like EGCG. Our results suggest that the three molecules studied may also bind to mTCOs. The predicted binding probability of EGCG to mTCOs increases with the protein aggregate size. In contrast, the predicted probability of CNS-11 and BHT-CNS-11 binding to the dimeric mTCOs is higher than binding to the tetrameric mTCOs for the homo tau but not for the hetero tau-amylin oligomers. Our results also support the idea that anionic lipids may promote the binding of molecules to mTCOs. We conclude that tau fibril-disaggregating and antioxidating molecules may bind to mTCOs, and that mTCOs may also be useful targets for Alzheimer's drug design.


Asunto(s)
Antioxidantes , Aprendizaje Automático , Simulación del Acoplamiento Molecular , Simulación de Dinámica Molecular , Unión Proteica , Proteínas tau , Proteínas tau/metabolismo , Proteínas tau/química , Humanos , Antioxidantes/química , Antioxidantes/farmacología , Amiloide/química , Amiloide/metabolismo , Catequina/análogos & derivados , Catequina/química , Catequina/metabolismo , Catequina/farmacología , Agregado de Proteínas
9.
Molecules ; 29(12)2024 Jun 12.
Artículo en Inglés | MEDLINE | ID: mdl-38930871

RESUMEN

Synthetic efforts toward complex natural product (NP) scaffolds are useful ones, particularly those aimed at expanding their bioactive chemical space. Here, we utilised an orthogonal cheminformatics-based approach to predict the potential biological activities for a series of synthetic bis-indole alkaloids inspired by elusive sponge-derived NPs, echinosulfone A (1) and echinosulfonic acids A-D (2-5). Our work includes the first synthesis of desulfato-echinosulfonic acid C, an α-hydroxy bis(3'-indolyl) alkaloid (17), and its full NMR characterisation. This synthesis provides corroborating evidence for the structure revision of echinosulfonic acids A-C. Additionally, we demonstrate a robust synthetic strategy toward a diverse range of α-methine bis(3'-indolyl) acids and acetates (11-16) without the need for silica-based purification in either one or two steps. By integrating our synthetic library of bis-indoles with bioactivity data for 2048 marine indole alkaloids (reported up to the end of 2021), we analyzed their overlap with marine natural product chemical diversity. Notably, the C-6 dibrominated α-hydroxy bis(3'-indolyl) and α-methine bis(3'-indolyl) analogues (11, 14, and 17) were found to contain significant overlap with antibacterial C-6 dibrominated marine bis-indoles, guiding our biological evaluation. Validating the results of our cheminformatics analyses, the dibrominated α-methine bis(3'-indolyl) alkaloids (11, 12, 14, and 15) were found to exhibit antibacterial activities against methicillin-sensitive and -resistant Staphylococcus aureus. Further, while investigating other synthetic approaches toward bis-indole alkaloids, 16 incorrectly assigned synthetic α-hydroxy bis(3'-indolyl) alkaloids were identified. After careful analysis of their reported NMR data, and comparison with those obtained for the synthetic bis-indoles reported herein, all of the structures have been revised to α-methine bis(3'-indolyl) alkaloids.


Asunto(s)
Antibacterianos , Quimioinformática , Alcaloides Indólicos , Antibacterianos/farmacología , Antibacterianos/química , Antibacterianos/síntesis química , Alcaloides Indólicos/química , Alcaloides Indólicos/farmacología , Alcaloides Indólicos/síntesis química , Quimioinformática/métodos , Pruebas de Sensibilidad Microbiana , Estructura Molecular , Relación Estructura-Actividad , Productos Biológicos/química , Productos Biológicos/farmacología , Productos Biológicos/síntesis química
10.
J Comput Biol ; 31(6): 498-512, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38758924

RESUMEN

Information on the structure of molecules, retrieved via biochemical databases, plays a pivotal role in various disciplines, including metabolomics, systems biology, and drug discovery. No such database can be complete and it is often necessary to incorporate data from several sources. However, the molecular structure for a given compound is not necessarily consistent between databases. This article presents StructRecon, a novel tool for resolving unique molecular structures from database identifiers. Currently, identifiers from BiGG, ChEBI, Escherichia coli Metabolome Database (ECMDB), MetaNetX, and PubChem are supported. StructRecon traverses the cross-links between entries in different databases to construct what we call identifier graphs. The goal of these graphs is to offer a more complete view of the total information available on a given compound across all the supported databases. To reconcile discrepancies met during the traversal of the databases, we develop an extensible model for molecular structure supporting multiple independent levels of detail, which allows standardization of the structure to be applied iteratively. In some cases, our standardization approach results in multiple candidate structures for a given compound, in which case a random walk-based algorithm is used to select the most likely structure among incompatible alternatives. As a case study, we applied StructRecon to the EColiCore2 model. We found at least one structure for 98.66% of its compounds, which is more than twice as many as possible when using the databases in more standard ways not considering the complex network of cross-database references captured by our identifier graphs. StructRecon is open-source and modular, which enables support for more databases in the future.


Asunto(s)
Escherichia coli , Escherichia coli/genética , Bases de Datos Factuales , Programas Informáticos , Estructura Molecular , Algoritmos , Metabolómica/métodos , Bases de Datos de Compuestos Químicos , Biología Computacional/métodos , Metaboloma
11.
Comput Struct Biotechnol J ; 23: 2116-2121, 2024 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-38808129

RESUMEN

De novo drug design aims to rationally discover novel and potent compounds while reducing experimental costs during the drug development stage. Despite the numerous generative models that have been developed, few successful cases of drug design utilizing generative models have been reported. One of the most common challenges is designing compounds that are not synthesizable or realistic. Therefore, methods capable of accurately assessing the chemical structures proposed by generative models for drug design are needed. In this study, we present AnoChem, a computational framework based on deep learning designed to assess the likelihood of a generated molecule being real. AnoChem achieves an area under the receiver operating characteristic curve score of 0.900 for distinguishing between real and generated molecules. We utilized AnoChem to evaluate and compare the performances of several generative models, using other metrics, namely SAscore and Fréschet ChemNet distance (FCD). AnoChem demonstrates a strong correlation with these metrics, validating its effectiveness as a reliable tool for assessing generative models. The source code for AnoChem is available at https://github.com/CSB-L/AnoChem.

12.
Chem Biol Drug Des ; 103(5): e14530, 2024 May.
Artículo en Inglés | MEDLINE | ID: mdl-38725091

RESUMEN

Feline immunodeficiency virus (FIV) is a common infection found in domesticated and wild cats worldwide. Despite the wealth of therapeutic understanding of the disease in humans, considerably less information exists regarding the treatment of the disease in felines. Current treatment relies on drugs developed for the related human immunodeficiency virus (HIV) and includes compounds of the popular non-nucleotide reverse transcriptase (NNRTI) class. This is despite FIV-RT being only 67% similar to HIV-1 RT at the enzyme level, increasing to 88% for the allosteric pocket targeted by NNRTIs. The goal of this project was to try to quantify how well the more extensive pharmacological knowledge available for human disease translates to felines. To this end we screened known NNRTIs and 10 diverse pyrimidine analogs identified virtually. We use this chemo-centric probe approach to (a) assess the similarity between the two related RT targets based on the observed experimental inhibition values, (b) try to identify more potent inhibitors at FIV, and (c) gain a better appreciation of the structure-activity relationships (SAR). We found the correlation between IC50s at the two targets to be strong (r2 = 0.87) and identified compound 1 as the most potent inhibitor of FIV with IC50 of 0.030 µM ± 0.009. This compared to FIV IC50 values of 0.22 ± 0.17 µM, 0.040 ± 0.010 µM and >160 µM for known anti HIV-1 RT drugs Efavirenz, Rilpivirine, and Nevirapine, respectively. This knowledge, along with an understanding of the structural origin that give rise to any differences could improve the way HIV drugs are repurposed for FIV.


Asunto(s)
Transcriptasa Inversa del VIH , Virus de la Inmunodeficiencia Felina , Inhibidores de la Transcriptasa Inversa , Animales , Inhibidores de la Transcriptasa Inversa/farmacología , Inhibidores de la Transcriptasa Inversa/química , Gatos , Virus de la Inmunodeficiencia Felina/efectos de los fármacos , Transcriptasa Inversa del VIH/antagonistas & inhibidores , Transcriptasa Inversa del VIH/metabolismo , Humanos , Relación Estructura-Actividad , Pirimidinas/química , Pirimidinas/farmacología , Alquinos/química , Alquinos/farmacología , VIH-1/efectos de los fármacos , VIH-1/enzimología , Ciclopropanos/farmacología , Ciclopropanos/química , Simulación del Acoplamiento Molecular , Benzoxazinas/química , Benzoxazinas/farmacología
13.
J Chem Inf Model ; 64(11): 4392-4409, 2024 Jun 10.
Artículo en Inglés | MEDLINE | ID: mdl-38815246

RESUMEN

By accelerating time-consuming processes with high efficiency, computing has become an essential part of many modern chemical pipelines. Machine learning is a class of computing methods that can discover patterns within chemical data and utilize this knowledge for a wide variety of downstream tasks, such as property prediction or substance generation. The complex and diverse chemical space requires complex machine learning architectures with great learning power. Recently, learning models based on transformer architectures have revolutionized multiple domains of machine learning, including natural language processing and computer vision. Naturally, there have been ongoing endeavors in adopting these techniques to the chemical domain, resulting in a surge of publications within a short period. The diversity of chemical structures, use cases, and learning models necessitate a comprehensive summarization of existing works. In this paper, we review recent innovations in adapting transformers to solve learning problems in chemistry. Because chemical data is diverse and complex, we structure our discussion based on chemical representations. Specifically, we highlight the strengths and weaknesses of each representation, the current progress of adapting transformer architectures, and future directions.


Asunto(s)
Quimioinformática , Aprendizaje Automático , Quimioinformática/métodos
14.
Food Chem ; 454: 139794, 2024 Oct 01.
Artículo en Inglés | MEDLINE | ID: mdl-38797094

RESUMEN

Sweet potatoes are rich in cardioprotective phytochemicals with potential anti-platelet aggregation activity, although this benefit may vary among cultivars/genotypes. The phenolic profile [HPLC-ESI(-)-qTOF-MS2], cheminformatics (ADMET properties, affinity toward platelet proteins) and anti-PA activity of phenolic-rich hydroalcoholic extracts obtained from orange (OSP) and purple (PSP) sweet potato storage roots, was evaluated. The phenolic richness [Hydroxycinnamic acids> flavonoids> benzoic acids] was PSP > OSP. Their main chlorogenic acids could interact with platelet proteins (integrins/adhesins, kinases/metalloenzymes) but their bioavailability could be poor. Just OSP exhibited a dose-dependent anti-platelet aggregation activity [inductor (IC50, mg.ml-1): thrombin receptor activator peptide-6 (0.55) > Adenosine-5'-diphosphate (1.02) > collagen (1.56)] and reduced P-selectin expression (0.75-1.0 mg.ml-1) but not glycoprotein IIb/IIIa secretion. The explored anti-PA activity of OSP/PSP seems to be inversely related to their phenolic richness. The poor first-pass bioavailability of its chlorogenic acids (documented in silico) may represent a further obstacle for their anti-PA in vivo.


Asunto(s)
Ipomoea batatas , Fenoles , Extractos Vegetales , Raíces de Plantas , Inhibidores de Agregación Plaquetaria , Agregación Plaquetaria , Ipomoea batatas/química , Fenoles/química , Agregación Plaquetaria/efectos de los fármacos , Extractos Vegetales/química , Extractos Vegetales/farmacología , Inhibidores de Agregación Plaquetaria/química , Inhibidores de Agregación Plaquetaria/farmacología , Raíces de Plantas/química , Humanos , Quimioinformática , Animales , Plaquetas/metabolismo , Plaquetas/efectos de los fármacos
15.
J Cheminform ; 16(1): 48, 2024 Apr 29.
Artículo en Inglés | MEDLINE | ID: mdl-38685101

RESUMEN

Previous studies have shown that the three-dimensional (3D) geometric and electronic structure of molecules play a crucial role in determining their key properties and intermolecular interactions. Therefore, it is necessary to establish a quantum chemical (QC) property database containing the most stable 3D geometric conformations and electronic structures of molecules. In this study, a high-quality QC property database, called QuanDB, was developed, which included structurally diverse molecular entities and featured a user-friendly interface. Currently, QuanDB contains 154,610 compounds sourced from public databases and scientific literature, with 10,125 scaffolds. The elemental composition comprises nine elements: H, C, O, N, P, S, F, Cl, and Br. For each molecule, QuanDB provides 53 global and 5 local QC properties and the most stable 3D conformation. These properties are divided into three categories: geometric structure, electronic structure, and thermodynamics. Geometric structure optimization and single point energy calculation at the theoretical level of B3LYP-D3(BJ)/6-311G(d)/SMD/water and B3LYP-D3(BJ)/def2-TZVP/SMD/water, respectively, were applied to ensure highly accurate calculations of QC properties, with the computational cost exceeding 107 core-hours. QuanDB provides high-value geometric and electronic structure information for use in molecular representation models, which are critical for machine-learning-based molecular design, thereby contributing to a comprehensive description of the chemical compound space. As a new high-quality dataset for QC properties, QuanDB is expected to become a benchmark tool for the training and optimization of machine learning models, thus further advancing the development of novel drugs and materials. QuanDB is freely available, without registration, at https://quandb.cmdrg.com/ .

16.
Adv Inf Retr ; 14609: 34-49, 2024 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-38585224

RESUMEN

Nearest neighbor-based similarity searching is a common task in chemistry, with notable use cases in drug discovery. Yet, some of the most commonly used approaches for this task still leverage a brute-force approach. In practice this can be computationally costly and overly time-consuming, due in part to the sheer size of modern chemical databases. Previous computational advancements for this task have generally relied on improvements to hardware or dataset-specific tricks that lack generalizability. Approaches that leverage lower-complexity searching algorithms remain relatively underexplored. However, many of these algorithms are approximate solutions and/or struggle with typical high-dimensional chemical embeddings. Here we evaluate whether a combination of low-dimensional chemical embeddings and a k-d tree data structure can achieve fast nearest neighbor queries while maintaining performance on standard chemical similarity search benchmarks. We examine different dimensionality reductions of standard chemical embeddings as well as a learned, structurally-aware embedding-SmallSA-for this task. With this framework, searches on over one billion chemicals execute in less than a second on a single CPU core, five orders of magnitude faster than the brute-force approach. We also demonstrate that SmallSA achieves competitive performance on chemical similarity benchmarks.

17.
Eur J Med Chem ; 270: 116362, 2024 Apr 15.
Artículo en Inglés | MEDLINE | ID: mdl-38574637

RESUMEN

Antimicrobial resistance (AMR) represents one of the most challenging global Public Health issues, with an alarmingly increasing rate of attributable mortality. This scenario highlights the urgent need for innovative medicinal strategies showing activity on resistant isolates (especially, carbapenem-resistant Gram-negative bacteria, methicillin-resistant S. aureus, and vancomycin-resistant enterococci) yielding new approaches for the treatment of bacterial infections. We previously reported AlkylGuanidino Ureas (AGUs) with broad-spectrum antibacterial activity and a putative membrane-based mechanism of action. Herein, new tetra- and mono-guanidino derivatives were designed and synthesized to expand the structure-activity relationships (SARs) and, thereby, tested on the same panel of Gram-positive and Gram-negative bacteria. The membrane-active mechanism of selected compounds was then investigated through molecular dynamics (MD) on simulated bacterial membranes. In the end, the newly synthesized series, along with the whole library of compounds (more than 70) developed in the last decade, was tested in combination with subinhibitory concentrations of the last resort antibiotic colistin to assess putative synergistic or additive effects. Moreover, all the AGUs were subjected to cheminformatic and machine learning analyses to gain a deeper knowledge of the key features required for bioactivity.


Asunto(s)
Antibacterianos , Staphylococcus aureus Resistente a Meticilina , Antibacterianos/farmacología , Colistina/farmacología , Bacterias Gramnegativas , Bacterias Grampositivas , Bacterias , Análisis de Datos , Pruebas de Sensibilidad Microbiana
18.
Int J Mol Sci ; 25(8)2024 Apr 13.
Artículo en Inglés | MEDLINE | ID: mdl-38673888

RESUMEN

Urease, a pivotal enzyme in nitrogen metabolism, plays a crucial role in various microorganisms, including the pathogenic Helicobacter pylori. Inhibiting urease activity offers a promising approach to combating infections and associated ailments, such as chronic kidney diseases and gastric cancer. However, identifying potent urease inhibitors remains challenging due to resistance issues that hinder traditional approaches. Recently, machine learning (ML)-based models have demonstrated the ability to predict the bioactivity of molecules rapidly and effectively. In this study, we present ML models designed to predict urease inhibitors by leveraging essential physicochemical properties. The methodological approach involved constructing a dataset of urease inhibitors through an extensive literature search. Subsequently, these inhibitors were characterized based on physicochemical properties calculations. An exploratory data analysis was then conducted to identify and analyze critical features. Ultimately, 252 classification models were trained, utilizing a combination of seven ML algorithms, three attribute selection methods, and six different strategies for categorizing inhibitory activity. The investigation unveiled discernible trends distinguishing urease inhibitors from non-inhibitors. This differentiation enabled the identification of essential features that are crucial for precise classification. Through a comprehensive comparison of ML algorithms, tree-based methods like random forest, decision tree, and XGBoost exhibited superior performance. Additionally, incorporating the "chemical family type" attribute significantly enhanced model accuracy. Strategies involving a gray-zone categorization demonstrated marked improvements in predictive precision. This research underscores the transformative potential of ML in predicting urease inhibitors. The meticulous methodology outlined herein offers actionable insights for developing robust predictive models within biochemical systems.


Asunto(s)
Inhibidores Enzimáticos , Aprendizaje Automático , Ureasa , Ureasa/antagonistas & inhibidores , Ureasa/química , Ureasa/metabolismo , Inhibidores Enzimáticos/química , Inhibidores Enzimáticos/farmacología , Helicobacter pylori/enzimología , Helicobacter pylori/efectos de los fármacos , Algoritmos , Humanos
19.
J Hazard Mater ; 471: 134436, 2024 Jun 05.
Artículo en Inglés | MEDLINE | ID: mdl-38688221

RESUMEN

Membrane distillation (MD) has received ample recognition for treating complex wastewater, including hypersaline oil and gas (O&G) produced water (PW). Rigorous water quality assessment is critical in evaluating PW treatment because PW consists of numerous contaminants beyond the targets listed in general discharge and reuse standards. This study evaluated a novel photocatalytic membrane distillation (PMD) process, with and without a UV light source, against a standard vacuum membrane distillation (VMD) process for treating PW, utilizing targeted analyses and a non-targeted chemical identification workflow coupled with toxicity predictions. PMD with UV light resulted in better removals of dissolved organic carbon, ammoniacal nitrogen, and conductivity. Targeted organic analyses identified only trace amounts of acetone and 2-butanone in distillates. According to non-targeted analysis, the number of suspects reduced from 65 in feed to 25-30 across all distillate samples. Certain physicochemical properties of compounds influenced contaminant rejection in different MD configurations. According to preliminary toxicity predictions, VMD, PMD with and without UV distillate samples, respectively contained 21, 22, and 23 suspects associated with critical toxicity concerns. Overall, non-targeted analysis together with toxicity prediction provides a competent supportive tool to assess treatment efficiency and potential impacts on public health and the environment during PW reuse.

20.
Molecules ; 29(8)2024 Apr 17.
Artículo en Inglés | MEDLINE | ID: mdl-38675645

RESUMEN

In the realm of predictive toxicology for small molecules, the applicability domain of QSAR models is often limited by the coverage of the chemical space in the training set. Consequently, classical models fail to provide reliable predictions for wide classes of molecules. However, the emergence of innovative data collection methods such as intensive hackathons have promise to quickly expand the available chemical space for model construction. Combined with algorithmic refinement methods, these tools can address the challenges of toxicity prediction, enhancing both the robustness and applicability of the corresponding models. This study aimed to investigate the roles of gradient boosting and strategic data aggregation in enhancing the predictivity ability of models for the toxicity of small organic molecules. We focused on evaluating the impact of incorporating fragment features and expanding the chemical space, facilitated by a comprehensive dataset procured in an open hackathon. We used gradient boosting techniques, accounting for critical features such as the structural fragments or functional groups often associated with manifestations of toxicity.


Asunto(s)
Algoritmos , Relación Estructura-Actividad Cuantitativa , Toxicología/métodos , Humanos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA