Predicting drug solubility in organic solvents mixtures: A machine-learning approach supported by high-throughput experimentation.

Cenci, Francesca; Diab, Samir; Ferrini, Paola; Harabajiu, Catajina; Barolo, Massimiliano; Bezzo, Fabrizio; Facco, Pierantonio

Cenci, Francesca; Diab, Samir; Ferrini, Paola; Harabajiu, Catajina; Barolo, Massimiliano; Bezzo, Fabrizio; Facco, Pierantonio.

Afiliación

Cenci F; GSK, Park Road, Ware SG12 0DP, United Kingdom; CAPE-Lab - Computer-Aided Process Engineering Laboratory, Department of Industrial Engineering, University of Padova, via Marzolo 9, 35131 Padova, Italy.
Diab S; GSK, Park Road, Ware SG12 0DP, United Kingdom.
Ferrini P; GSK, Gunnels Wood Road, Stevenage SG1 2NY, United Kingdom.
Harabajiu C; GSK, Gunnels Wood Road, Stevenage SG1 2NY, United Kingdom.
Barolo M; CAPE-Lab - Computer-Aided Process Engineering Laboratory, Department of Industrial Engineering, University of Padova, via Marzolo 9, 35131 Padova, Italy.
Bezzo F; CAPE-Lab - Computer-Aided Process Engineering Laboratory, Department of Industrial Engineering, University of Padova, via Marzolo 9, 35131 Padova, Italy.
Facco P; CAPE-Lab - Computer-Aided Process Engineering Laboratory, Department of Industrial Engineering, University of Padova, via Marzolo 9, 35131 Padova, Italy. Electronic address: pierantonio.facco@unipd.it.

Int J Pharm ; 660: 124233, 2024 Jul 20.

Article en En | MEDLINE | ID: mdl-38763309

ABSTRACT

ABSTRACT

A novel approach based on supervised machine-learning is proposed to predict the solubility of drugs and drug-like molecules in mixtures of organic solvents. Similar to quantitative structure-property relationship (QSPR) models, different solvent types are identified by molecular descriptors, which, in this study, are considered as UNIFAC subgroups. To overcome the potential lack of UNIFAC subgroups for the complex Active Pharmaceutical Ingredients (APIs) currently developed in the pharmaceutical industry, the API molecule is considered as a unique entity in the proposed modelling approach. Therefore, API solubility is predicted as a function of temperature, functional subgroups of the solvents and composition of the solvent mixture; in turn, regressors' correlation is handled through Partial Least-Squares (PLS) regression. The method is developed and tested with experimental data of a real API and 14 organic solvents that are industrially employed for crystallisation. Solubility predictions are accurate and precise for single solvents, binary mixtures and ternary mixtures of organic solvents at different compositions and temperatures, with a determination coefficient R2 ≥ 0.90. To further test the applicability of the model, the proposed approach is applied to 9 literature organic solubility datasets of drugs and drug-like compounds and compared to benchmark solubility models in the literature. Results show that the proposed approach provides satisfactory predictions the majority of validation and calibration data have R2 = 0.95-0.99; the ratio between RMSE (root mean squared error) of the proposed method and the range of measured solubility values is from 1 to 3 orders of magnitude smaller than the RMSE ratio obtained by the benchmark models.

Asunto(s)

Aprendizaje Automático; Solubilidad; Solventes; Solventes/química; Preparaciones Farmacéuticas/química; Análisis de los Mínimos Cuadrados; Relación Estructura-Actividad Cuantitativa; Compuestos Orgánicos/química; Ensayos Analíticos de Alto Rendimiento/métodos; Temperatura

Palabras clave

Crystallisation; Drug solubility prediction; High-throughput experimentation; Machine-learning; Mixtures of organic solvents; Novel drug solubility data

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Solubilidad / Solventes / Aprendizaje Automático Idioma: En Revista: Int J Pharm Año: 2024 Tipo del documento: Article País de afiliación: Italia Pais de publicación: Países Bajos

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google