Búsqueda | Portal Regional de la BVS

Markov chain stochastic DCA and applications in deep learning with PDEs regularization.

Luu, Hoang Phuc Hau; Le, Hoai Minh; Le Thi, Hoai An.

Neural Netw ; 170: 149-166, 2024 Feb.

Artículo en Inglés | MEDLINE | ID: mdl-37984042

RESUMEN

This paper addresses a large class of nonsmooth nonconvex stochastic DC (difference-of-convex functions) programs where endogenous uncertainty is involved and i.i.d. (independent and identically distributed) samples are not available. Instead, we assume that it is only possible to access Markov chains whose sequences of distributions converge to the target distributions. This setting is legitimate as Markovian noise arises in many contexts including Bayesian inference, reinforcement learning, and stochastic optimization in high-dimensional or combinatorial spaces. We then design a stochastic algorithm named Markov chain stochastic DCA (MCSDCA) based on DCA (DC algorithm) - a well-known method for nonconvex optimization. We establish the convergence analysis in both asymptotic and nonasymptotic senses. The MCSDCA is then applied to deep learning via PDEs (partial differential equations) regularization, where two realizations of MCSDCA are constructed, namely MCSDCA-odLD and MCSDCA-udLD, based on overdamped and underdamped Langevin dynamics, respectively. Numerical experiments on time series prediction and image classification problems with a variety of neural network topologies show the merits of the proposed methods.

Asunto(s)

Aprendizaje Profundo , Cadenas de Markov , Teorema de Bayes , Redes Neurales de la Computación , Algoritmos

Stochastic DCA for minimizing a large sum of DC functions with application to multi-class logistic regression.

Le Thi, Hoai An; Le, Hoai Minh; Phan, Duy Nhat; Tran, Bach.

Neural Netw ; 132: 220-231, 2020 Dec.

Artículo en Inglés | MEDLINE | ID: mdl-32919312

RESUMEN

We consider the large sum of DC (Difference of Convex) functions minimization problem which appear in several different areas, especially in stochastic optimization and machine learning. Two DCA (DC Algorithm) based algorithms are proposed: stochastic DCA and inexact stochastic DCA. We prove that the convergence of both algorithms to a critical point is guaranteed with probability one. Furthermore, we develop our stochastic DCA for solving an important problem in multi-task learning, namely group variables selection in multi class logistic regression. The corresponding stochastic DCA is very inexpensive, all computations are explicit. Numerical experiments on several benchmark datasets and synthetic datasets illustrate the efficiency of our algorithms and their superiority over existing methods, with respect to classification accuracy, sparsity of solution as well as running time.

Asunto(s)

Algoritmos , Aprendizaje Automático , Modelos Logísticos , Procesos Estocásticos

Block clustering based on difference of convex functions (DC) programming and DC algorithms.

Le, Hoai Minh; Le Thi, Hoai An; Dinh, Tao Pham; Huynh, Van Ngai.

Neural Comput ; 25(10): 2776-807, 2013 Oct.

Artículo en Inglés | MEDLINE | ID: mdl-23777526

RESUMEN

We investigate difference of convex functions (DC) programming and the DC algorithm (DCA) to solve the block clustering problem in the continuous framework, which traditionally requires solving a hard combinatorial optimization problem. DC reformulation techniques and exact penalty in DC programming are developed to build an appropriate equivalent DC program of the block clustering problem. They lead to an elegant and explicit DCA scheme for the resulting DC program. Computational experiments show the robustness and efficiency of the proposed algorithm and its superiority over standard algorithms such as two-mode K-means, two-mode fuzzy clustering, and block classification EM.

Asunto(s)

Algoritmos , Análisis por Conglomerados , Inteligencia Artificial , Neoplasias Encefálicas/patología , Simulación por Computador , Bases de Datos Factuales , Lógica Difusa , Humanos , Neoplasias Pulmonares/patología , Neoplasias/patología , Solución de Problemas , Programas Informáticos

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA