Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 2.166
Filtrar
1.
Methods Mol Biol ; 2856: 197-212, 2025.
Artículo en Inglés | MEDLINE | ID: mdl-39283453

RESUMEN

Peakachu is a supervised-learning-based approach that identifies chromatin loops from chromatin contact data. Here, we present Peakachu version 2, an updated version that significantly improves extensibility, usability, and computational efficiency compared to its predecessor. It features pretrained models tailored for a wide range of experimental platforms, such as Hi-C, Micro-C, ChIA-PET, HiChIP, HiCAR, and TrAC-loop. This chapter offers a step-by-step tutorial guiding users through the process of training Peakachu models from scratch and utilizing pretrained models to predict chromatin loops across various platforms.


Asunto(s)
Cromatina , Biología Computacional , Programas Informáticos , Cromatina/metabolismo , Cromatina/genética , Biología Computacional/métodos , Humanos , Aprendizaje Automático Supervisado , Conformación de Ácido Nucleico
2.
Food Chem ; 462: 140931, 2025 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-39217752

RESUMEN

This research focused on distinguishing distinct matrix assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) spectral signatures of three Enterococcus species. We evaluated and compared the predictive performance of four supervised machine learning algorithms, K-nearest neighbor (KNN), support vector machine (SVM), and random forest (RF), to accurately classify Enterococcus species. This study involved a comprehensive dataset of 410 strains, generating 1640 individual spectra through on-plate and off-plate protein extraction methods. Although the commercial database correctly identified 76.9% of the strains, machine learning classifiers demonstrated superior performance (accuracy 0.991). In the RF model, top informative peaks played a significant role in the classification. Whole-genome sequencing showed that the most informative peaks are biomarkers connected to proteins, which are essential for understanding bacterial classification and evolution. The integration of MALDI-TOF MS and machine learning provides a rapid and accurate method for identifying Enterococcus species, improving healthcare and food safety.


Asunto(s)
Enterococcus , Espectrometría de Masa por Láser de Matriz Asistida de Ionización Desorción , Aprendizaje Automático Supervisado , Espectrometría de Masa por Láser de Matriz Asistida de Ionización Desorción/métodos , Enterococcus/clasificación , Enterococcus/química , Enterococcus/aislamiento & purificación , Enterococcus/genética , Algoritmos , Máquina de Vectores de Soporte , Técnicas de Tipificación Bacteriana/métodos , Aprendizaje Automático
3.
Sci Rep ; 14(1): 20854, 2024 09 06.
Artículo en Inglés | MEDLINE | ID: mdl-39242792

RESUMEN

Progressive gait impairment is common among aging adults. Remote phenotyping of gait during daily living has the potential to quantify gait alterations and evaluate the effects of interventions that may prevent disability in the aging population. Here, we developed ElderNet, a self-supervised learning model for gait detection from wrist-worn accelerometer data. Validation involved two diverse cohorts, including over 1000 participants without gait labels, as well as 83 participants with labeled data: older adults with Parkinson's disease, proximal femoral fracture, chronic obstructive pulmonary disease, congestive heart failure, and healthy adults. ElderNet presented high accuracy (96.43 ± 2.27), specificity (98.87 ± 2.15), recall (82.32 ± 11.37), precision (86.69 ± 17.61), and F1 score (82.92 ± 13.39). The suggested method yielded superior performance compared to two state-of-the-art gait detection algorithms, with improved accuracy and F1 score (p < 0.05). In an initial evaluation of construct validity, ElderNet identified differences in estimated daily walking durations across cohorts with different clinical characteristics, such as mobility disability (p < 0.001) and parkinsonism (p < 0.001). The proposed self-supervised method has the potential to serve as a valuable tool for remote phenotyping of gait function during daily living in aging adults, even among those with gait impairments.


Asunto(s)
Acelerometría , Marcha , Aprendizaje Automático Supervisado , Humanos , Anciano , Masculino , Femenino , Marcha/fisiología , Acelerometría/métodos , Acelerometría/instrumentación , Anciano de 80 o más Años , Actividades Cotidianas , Muñeca , Algoritmos , Dispositivos Electrónicos Vestibles , Persona de Mediana Edad
4.
Commun Biol ; 7(1): 1123, 2024 Sep 12.
Artículo en Inglés | MEDLINE | ID: mdl-39266614

RESUMEN

Spatial transcriptomics (ST) technologies allow for comprehensive characterization of gene expression patterns in the context of tissue microenvironment. However, accurately identifying domains with spatial coherence in both gene expression and histology in situ and effectively integrating data from multi-sample remains challenging. Here, we propose ResST, a graph self-supervised residual learning model based on graph neural network and Margin Disparity Discrepancy (MDD) theory. ResST aggregates gene expression, biological effects, spatial location, and morphological information to capture nonlinear relationships between a cell and surrounding cells for spatial domain identification. Also, ResST integrates multiple ST datasets and aligns latent embeddings based on MDD theory for correcting batch effects. Results show that ResST identifies continuous spatial domains at a finer scale in ten ST datasets acquired with different technologies. Moreover, ResST efficiently integrated data from multiple tissue sections vertically or horizontally while correcting batch effects. Overall, ResST demonstrates exceptional performance in analyzing ST datasets.


Asunto(s)
Perfilación de la Expresión Génica , Transcriptoma , Humanos , Perfilación de la Expresión Génica/métodos , Aprendizaje Automático Supervisado , Redes Neurales de la Computación , Algoritmos , Biología Computacional/métodos
5.
Commun Biol ; 7(1): 1107, 2024 Sep 10.
Artículo en Inglés | MEDLINE | ID: mdl-39251817

RESUMEN

The central nervous system (CNS) comprises a diverse range of brain cell types with distinct functions and gene expression profiles. Although single-cell RNA sequencing (scRNA-seq) provides new insights into the brain cell atlases, integrating large-scale CNS scRNA-seq data still encounters challenges due to the complexity and heterogeneity among CNS cell types/subtypes. In this study, we introduce a self-supervised contrastive learning method, called scCM, for integrating large-scale CNS scRNA-seq data. scCM brings functionally related cells close together while simultaneously pushing apart dissimilar cells by comparing the variations of gene expression, effectively revealing the heterogeneous relationships within the CNS cell types/subtypes. The effectiveness of scCM is evaluated on 20 CNS datasets covering 4 species and 10 CNS diseases. Leveraging these strengths, we successfully integrate the collected human CNS datasets into a large-scale reference to annotate cell types and subtypes in neural tissues. Results demonstrate that scCM provides an accurate annotation, along with rich spatial information of cell state. In summary, scCM is a robust and promising method for integrating large-scale CNS scRNA-seq data, enabling researchers to gain insights into the cellular and molecular mechanisms underlying CNS functions and diseases.


Asunto(s)
Sistema Nervioso Central , Análisis de Expresión Génica de una Sola Célula , Aprendizaje Automático Supervisado , Sistema Nervioso Central/citología , Humanos , Conjuntos de Datos como Asunto , Análisis por Conglomerados , Enfermedades Neurodegenerativas/patología , Atlas como Asunto , Animales , Aprendizaje Profundo
6.
Commun Biol ; 7(1): 1104, 2024 Sep 09.
Artículo en Inglés | MEDLINE | ID: mdl-39251833

RESUMEN

Many biological problems are understudied due to experimental limitations and human biases. Although deep learning is promising in accelerating scientific discovery, its power compromises when applied to problems with scarcely labeled data and data distribution shifts. We develop a deep learning framework-Meta Model Agnostic Pseudo Label Learning (MMAPLE)-to address these challenges by effectively exploring out-of-distribution (OOD) unlabeled data when conventional transfer learning fails. The uniqueness of MMAPLE is to integrate the concept of meta-learning, transfer learning and semi-supervised learning into a unified framework. The power of MMAPLE is demonstrated in three applications in an OOD setting where chemicals or proteins in unseen data are dramatically different from those in training data: predicting drug-target interactions, hidden human metabolite-enzyme interactions, and understudied interspecies microbiome metabolite-human receptor interactions. MMAPLE achieves 11% to 242% improvement in the prediction-recall on multiple OOD benchmarks over various base models. Using MMAPLE, we reveal novel interspecies metabolite-protein interactions that are validated by activity assays and fill in missing links in microbiome-human interactions. MMAPLE is a general framework to explore previously unrecognized biological domains beyond the reach of present experimental and computational techniques.


Asunto(s)
Aprendizaje Automático Supervisado , Humanos , Aprendizaje Profundo , Microbiota , Biología Computacional/métodos
7.
Med Image Anal ; 98: 103304, 2024 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-39173412

RESUMEN

Masked Image Modelling (MIM), a form of self-supervised learning, has garnered significant success in computer vision by improving image representations using unannotated data. Traditional MIMs typically employ a strategy of random sampling across the image. However, this random masking technique may not be ideally suited for medical imaging, which possesses distinct characteristics divergent from natural images. In medical imaging, particularly in pathology, disease-related features are often exceedingly sparse and localized, while the remaining regions appear normal and undifferentiated. Additionally, medical images frequently accompany reports, directly pinpointing pathological changes' location. Inspired by this, we propose Masked medical Image Modelling (MedIM), a novel approach, to our knowledge, the first research that employs radiological reports to guide the masking and restore the informative areas of images, encouraging the network to explore the stronger semantic representations from medical images. We introduce two mutual comprehensive masking strategies, knowledge-driven masking (KDM), and sentence-driven masking (SDM). KDM uses Medical Subject Headings (MeSH) words unique to radiology reports to identify symptom clues mapped to MeSH words (e.g., cardiac, edema, vascular, pulmonary) and guide the mask generation. Recognizing that radiological reports often comprise several sentences detailing varied findings, SDM integrates sentence-level information to identify key regions for masking. MedIM reconstructs images informed by this masking from the KDM and SDM modules, promoting a comprehensive and enriched medical image representation. Our extensive experiments on seven downstream tasks covering multi-label/class image classification, pneumothorax segmentation, and medical image-report analysis, demonstrate that MedIM with report-guided masking achieves competitive performance. Our method substantially outperforms ImageNet pre-training, MIM-based pre-training, and medical image-report pre-training counterparts. Codes are available at https://github.com/YtongXie/MedIM.


Asunto(s)
Aprendizaje Automático Supervisado , Humanos , Procesamiento de Imagen Asistido por Computador/métodos , Interpretación de Imagen Asistida por Computador/métodos , Algoritmos
8.
Comput Biol Med ; 181: 109046, 2024 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-39205345

RESUMEN

In deep-learning-based medical image segmentation tasks, semi-supervised learning can greatly reduce the dependence of the model on labeled data. However, existing semi-supervised medical image segmentation methods face the challenges of object boundary ambiguity and a small amount of available data, which limit the application of segmentation models in clinical practice. To solve these problems, we propose a novel semi-supervised medical image segmentation network based on dual-consistency guidance, which can extract reliable semantic information from unlabeled data over a large spatial and dimensional range in a simple and effective manner. This serves to improve the contribution of unlabeled data to the model accuracy. Specifically, we construct a split weak and strong consistency constraint strategy to capture data-level and feature-level consistencies from unlabeled data to improve the learning efficiency of the model. Furthermore, we design a simple multi-scale low-level detail feature enhancement module to improve the extraction of low-level detail contextual information, which is crucial to accurately locate object contours and avoid omitting small objects in semi-supervised medical image dense prediction tasks. Quantitative and qualitative evaluations on six challenging datasets demonstrate that our model outperforms other semi-supervised segmentation models in terms of segmentation accuracy and presents advantages in terms of generalizability. Code is available at https://github.com/0Jmyy0/SSMIS-DC.


Asunto(s)
Aprendizaje Profundo , Humanos , Procesamiento de Imagen Asistido por Computador/métodos , Aprendizaje Automático Supervisado , Algoritmos , Interpretación de Imagen Asistida por Computador/métodos
9.
Int J Neural Syst ; 34(11): 2450057, 2024 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-39155691

RESUMEN

Typically, deep learning models for image segmentation tasks are trained using large datasets of images annotated at the pixel level, which can be expensive and highly time-consuming. A way to reduce the amount of annotated images required for training is to adopt a semi-supervised approach. In this regard, generative deep learning models, concretely Generative Adversarial Networks (GANs), have been adapted to semi-supervised training of segmentation tasks. This work proposes MaskGDM, a deep learning architecture combining some ideas from EditGAN, a GAN that jointly models images and their segmentations, together with a generative diffusion model. With careful integration, we find that using a generative diffusion model can improve EditGAN performance results in multiple segmentation datasets, both multi-class and with binary labels. According to the quantitative results obtained, the proposed model improves multi-class image segmentation when compared to the EditGAN and DatasetGAN models, respectively, by [Formula: see text] and [Formula: see text]. Moreover, using the ISIC dataset, our proposal improves the results from other models by up to [Formula: see text] for the binary image segmentation approach.


Asunto(s)
Aprendizaje Profundo , Procesamiento de Imagen Asistido por Computador , Redes Neurales de la Computación , Semántica , Humanos , Procesamiento de Imagen Asistido por Computador/métodos , Aprendizaje Automático Supervisado
10.
Sensors (Basel) ; 24(16)2024 Aug 16.
Artículo en Inglés | MEDLINE | ID: mdl-39205012

RESUMEN

The three Ground Reaction Force (GRF) components can be estimated using pressure insole sensors. In this paper, we compare the accuracy of estimating GRF components for both feet using six methods: three Deep Learning (DL) methods (Artificial Neural Network, Long Short-Term Memory, and Convolutional Neural Network) and three Supervised Machine Learning (SML) methods (Least Squares, Support Vector Regression, and Random Forest (RF)). Data were collected from nine subjects across six activities: normal and slow walking, static with and without carrying a load, and two Manual Material Handling activities. This study has two main contributions: first, the estimation of GRF components (Fx, Fy, and Fz) during the six activities, two of which have never been studied; second, the comparison of the accuracy of GRF component estimation between the six methods for each activity. RF provided the most accurate estimation for static situations, with mean RMSE values of RMSE_Fx = 1.65 N, RMSE_Fy = 1.35 N, and RMSE_Fz = 7.97 N for the mean absolute values measured by the force plate (reference) RMSE_Fx = 14.10 N, RMSE_Fy = 3.83 N, and RMSE_Fz = 397.45 N. In our study, we found that RF, an SML method, surpassed the experimented DL methods.


Asunto(s)
Aprendizaje Profundo , Presión , Aprendizaje Automático Supervisado , Humanos , Masculino , Caminata/fisiología , Redes Neurales de la Computación , Zapatos , Adulto , Femenino , Pie/fisiología , Fenómenos Biomecánicos/fisiología , Adulto Joven
11.
Med Image Anal ; 97: 103291, 2024 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-39121545

RESUMEN

In positron emission tomography (PET) and X-ray computed tomography (CT), reducing radiation dose can cause significant degradation in image quality. For image quality enhancement in low-dose PET and CT, we propose a novel theoretical adversarial and variational deep neural network (DNN) framework relying on expectation maximization (EM) based learning, termed adversarial EM (AdvEM). AdvEM proposes an encoder-decoder architecture with a multiscale latent space, and generalized-Gaussian models enabling datum-specific robust statistical modeling in latent space and image space. The model robustness is further enhanced by including adversarial learning in the training protocol. Unlike typical variational-DNN learning, AdvEM proposes latent-space sampling from the posterior distribution, and uses a Metropolis-Hastings scheme. Unlike existing schemes for PET or CT image enhancement which train using pairs of low-dose images with their corresponding normal-dose versions, we propose a semi-supervised AdvEM (ssAdvEM) framework that enables learning using a small number of normal-dose images. AdvEM and ssAdvEM enable per-pixel uncertainty estimates for their outputs. Empirical analyses on real-world PET and CT data involving many baselines, out-of-distribution data, and ablation studies show the benefits of the proposed framework.


Asunto(s)
Aprendizaje Profundo , Dosis de Radiación , Humanos , Tomografía Computarizada por Rayos X/métodos , Tomografía de Emisión de Positrones/métodos , Aumento de la Imagen/métodos , Redes Neurales de la Computación , Aprendizaje Automático Supervisado , Algoritmos
12.
Med Image Anal ; 97: 103302, 2024 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-39154618

RESUMEN

Semi-supervised medical image segmentation (SSMIS) has witnessed substantial advancements by leveraging limited labeled data and abundant unlabeled data. Nevertheless, existing state-of-the-art (SOTA) methods encounter challenges in accurately predicting labels for the unlabeled data, giving rise to disruptive noise during training and susceptibility to erroneous information overfitting. Moreover, applying perturbations to inaccurate predictions further impedes consistent learning. To address these concerns, we propose a novel cross-head mutual mean-teaching network (CMMT-Net) incorporated weak-strong data augmentations, thereby benefiting both co-training and consistency learning. More concretely, our CMMT-Net extends the cross-head co-training paradigm by introducing two auxiliary mean teacher models, which yield more accurate predictions and provide supplementary supervision. The predictions derived from weakly augmented samples generated by one mean teacher are leveraged to guide the training of another student with strongly augmented samples. Furthermore, two distinct yet synergistic data perturbations at the pixel and region levels are introduced. We propose mutual virtual adversarial training (MVAT) to smooth the decision boundary and enhance feature representations, and a cross-set CutMix strategy to generate more diverse training samples for capturing inherent structural data information. Notably, CMMT-Net simultaneously implements data, feature, and network perturbations, amplifying model diversity and generalization performance. Experimental results on three publicly available datasets indicate that our approach yields remarkable improvements over previous SOTA methods across various semi-supervised scenarios. The code is available at https://github.com/Leesoon1984/CMMT-Net.


Asunto(s)
Aprendizaje Automático Supervisado , Humanos , Procesamiento de Imagen Asistido por Computador/métodos , Algoritmos , Redes Neurales de la Computación
13.
Int J Mol Sci ; 25(15)2024 Jul 28.
Artículo en Inglés | MEDLINE | ID: mdl-39125808

RESUMEN

Multifactorial diseases demand therapeutics that can modulate multiple targets for enhanced safety and efficacy, yet the clinical approval of multitarget drugs remains rare. The integration of machine learning (ML) and deep learning (DL) in drug discovery has revolutionized virtual screening. This study investigates the synergy between ML/DL methodologies, molecular representations, and data augmentation strategies. Notably, we found that SVM can match or even surpass the performance of state-of-the-art DL methods. However, conventional data augmentation often involves a trade-off between the true positive rate and false positive rate. To address this, we introduce Negative-Augmented PU-bagging (NAPU-bagging) SVM, a novel semi-supervised learning framework. By leveraging ensemble SVM classifiers trained on resampled bags containing positive, negative, and unlabeled data, our approach is capable of managing false positive rates while maintaining high recall rates. We applied this method to the identification of multitarget-directed ligands (MTDLs), where high recall rates are critical for compiling a list of interaction candidate compounds. Case studies demonstrate that NAPU-bagging SVM can identify structurally novel MTDL hits for ALK-EGFR with favorable docking scores and binding modes, as well as pan-agonists for dopamine receptors. The NAPU-bagging SVM methodology should serve as a promising avenue to virtual screening, especially for the discovery of MTDLs.


Asunto(s)
Descubrimiento de Drogas , Descubrimiento de Drogas/métodos , Humanos , Simulación del Acoplamiento Molecular , Ligandos , Máquina de Vectores de Soporte , Aprendizaje Profundo , Aprendizaje Automático Supervisado , Aprendizaje Automático
14.
Sci Rep ; 14(1): 17956, 2024 08 02.
Artículo en Inglés | MEDLINE | ID: mdl-39095606

RESUMEN

The symptoms of diseases can vary among individuals and may remain undetected in the early stages. Detecting these symptoms is crucial in the initial stage to effectively manage and treat cases of varying severity. Machine learning has made major advances in recent years, proving its effectiveness in various healthcare applications. This study aims to identify patterns of symptoms and general rules regarding symptoms among patients using supervised and unsupervised machine learning. The integration of a rule-based machine learning technique and classification methods is utilized to extend a prediction model. This study analyzes patient data that was available online through the Kaggle repository. After preprocessing the data and exploring descriptive statistics, the Apriori algorithm was applied to identify frequent symptoms and patterns in the discovered rules. Additionally, the study applied several machine learning models for predicting diseases, including stepwise regression, support vector machine, bootstrap forest, boosted trees, and neural-boosted methods. Several predictive machine learning models were applied to the dataset to predict diseases. It was discovered that the stepwise method for fitting outperformed all competitors in this study, as determined through cross-validation conducted for each model based on established criteria. Moreover, numerous significant decision rules were extracted in the study, which can streamline clinical applications without the need for additional expertise. These rules enable the prediction of relationships between symptoms and diseases, as well as between different diseases. Therefore, the results obtained in this study have the potential to improve the performance of prediction models. We can discover diseases symptoms and general rules using supervised and unsupervised machine learning for the dataset. Overall, the proposed algorithm can support not only healthcare professionals but also patients who face cost and time constraints in diagnosing and treating these diseases.


Asunto(s)
Algoritmos , Aprendizaje Automático Supervisado , Aprendizaje Automático no Supervisado , Humanos , Masculino , Femenino , Máquina de Vectores de Soporte , Persona de Mediana Edad , Adulto , Enfermedad
15.
Environ Sci Pollut Res Int ; 31(40): 52740-52757, 2024 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-39158659

RESUMEN

This study was carried out with the aim of applying Condorcet and Borda scoring algorithms based on Game Theory (GT) to determine flood points and Flood Susceptibility Mapping (FSM) based on Machine Learning Algorithms (MLA) including Random Forest (RF), Support Vector Regression (SVR), Support Vector Machine (SVM), and K-Nearest Neighbors (KNN) in the Cheshmeh-Kileh watershed, Iran. Therefore, first, FS conditioning factors including Aspect (As), Elevation (El), Euclidean distance (Euc), Forest (F), NDVI, Precipitation (P), Plan Curvature (PlC), Profile Curvature (PrC), Residential (Re), Rangeland (Rl), Slope (Sl), Stream Power Index (SPI), Topographic Position Index (TPI), and Topographic Wetness Index (TWI) were quantified in each Sub-Watershed (SW). Based on this, flood and non-flood points were identified based on both GT algorithms. In the following, MLAs including Random Forest (RF), Support Vector Regression (SVR), Support Vector Machines (SVM), and K-Nearest Neighbors (KNN) were used for the distributional mapping of FS. Finally, based on optimal conjunct approaches, FS maps were presented in the study watershed. Based on the results, among the conjunct algorithms in FS classification, RF-Condorcet and RF-Borda models were selected as the most optimal MLA-GT hybrid models. The upstream SWs were highly susceptible. Also, the effectiveness of NDVI and forest conditioning factors in each classification approach was high. The similarity of SW prioritization based on Condorcet algorithm with RF-Condorcet algorithm was about 86.70%. Meanwhile, the degree of similarity in RF-Borda conjunct algorithm was around 73.33%. These results showed that Condorcet algorithm had an optimal classification compared to Borda scoring algorithm.


Asunto(s)
Algoritmos , Inundaciones , Teoría del Juego , Aprendizaje Automático Supervisado , Máquina de Vectores de Soporte , Irán , Aprendizaje Automático
16.
J Environ Manage ; 369: 122250, 2024 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-39213853

RESUMEN

High diversity seabed habitats, such as shellfish aggregations, play a significant role in marine ecosystem sustainability but are susceptible to bottom disturbance induced by anthropogenic activities. Regular monitoring of these habitats with effective mapping methods is therefore essential. Multibeam echosounder (MBES) has been widely used in recent decades for seabed characterization due to its non-destructive manner and extensive spatial coverage compared to traditional methods like bottom sampling. Nevertheless, bottom sampling remains essential to link ground truth with acoustic seabed classification. Using seabed samples and MBES measurements, machine learning techniques are commonly employed to model their relationships and generate classification maps of an extended seabed. However, limited ground truth data, resulting from constraints in regulations, budget, or time, may impede the development of robust machine learning models. To address this challenge, we applied a semi-supervised machine learning method to classify seabed sediments of a blue mussel (Mytilus edulis) cultivation area in the Oosterschelde, the Netherlands. We utilized nine boxcore samples to generate pseudo-labels on MBES data. These pseudo-labels enlarged the training data size, facilitated the training of three comprehensive machine learning algorithms (Gradient Boosting, Random Forest, and Support Vector Machine), and helped to classify the study site into mussel and non-mussel areas. We found the geomorphological and backscatter-related features to be complementary for mussel culture detection. Our classification results were demonstrated effective through expert knowledge of this cultivation area and brought insights for future research on natural mussel habitats.


Asunto(s)
Ecosistema , Animales , Monitoreo del Ambiente/métodos , Aprendizaje Automático Supervisado , Países Bajos , Bivalvos , Aprendizaje Automático , Mytilus edulis
17.
Neural Netw ; 179: 106570, 2024 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-39089151

RESUMEN

Sequential recommendation typically utilizes deep neural networks to mine rich information in interaction sequences. However, existing methods often face the issue of insufficient interaction data. To alleviate the sparsity issue, self-supervised learning is introduced into sequential recommendation. Despite its effectiveness, we argue that current self-supervised learning-based (i.e., SSL-based) sequential recommendation models have the following limitations: (1) using only a single self-supervised learning method, either contrastive self-supervised learning or generative self-supervised learning. (2) employing a simple data augmentation strategy in either the graph structure domain or the node feature domain. We believe that they have not fully utilized the capabilities of both self-supervised methods and have not sufficiently explored the advantages of combining graph augmentation schemes. As a result, they often fail to learn better item representations. In light of this, we propose a novel multi-task sequential recommendation framework named Adaptive Self-supervised Learning for sequential Recommendation (ASLRec). Specifically, our framework combines contrastive and generative self-supervised learning methods adaptively, simultaneously applying different perturbations at both the graph topology and node feature levels. This approach constructs diverse augmented graph views and employs multiple loss functions (including contrastive loss, generative loss, mask loss, and prediction loss) for joint training. By encompassing the capabilities of various methods, our model learns item representations across different augmented graph views to achieve better performance and effectively mitigate interaction noise and sparsity. In addition, we add a small proportion of random uniform noise to item representations, making the item representations more uniform and mitigating the inherent popularity bias in interaction records. We conduct extensive experiments on three publicly available benchmark datasets to evaluate our model. The results demonstrate that our approach achieves state-of-the-art performance compared to 14 other competitive methods: the hit rate (HR) improved by over 14.39%, and the normalized discounted cumulative gain (NDCG) increased by over 18.67%.


Asunto(s)
Redes Neurales de la Computación , Aprendizaje Automático Supervisado , Humanos , Algoritmos , Aprendizaje Profundo
18.
Neural Netw ; 179: 106578, 2024 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-39111158

RESUMEN

Self-supervised contrastive learning draws on power representational models to acquire generic semantic features from unlabeled data, and the key to training such models lies in how accurately to track motion features. Previous video contrastive learning methods have extensively used spatially or temporally augmentation as similar instances, resulting in models that are more likely to learn static backgrounds than motion features. To alleviate the background shortcuts, in this paper, we propose a cross-view motion consistent (CVMC) self-supervised video inter-intra contrastive model to focus on the learning of local details and long-term temporal relationships. Specifically, we first extract the dynamic features of consecutive video snippets and then align these features based on multi-view motion consistency. Meanwhile, we compare the optimized dynamic features for instance comparison of different videos and local spatial fine-grained with temporal order in the same video, respectively. Ultimately, the joint optimization of spatio-temporal alignment and motion discrimination effectively fills the challenges of the missing components of instance recognition, spatial compactness, and temporal perception in self-supervised learning. Experimental results show that our proposed self-supervised model can effectively learn visual representation information and achieve highly competitive performance compared to other state-of-the-art methods in both action recognition and video retrieval tasks.


Asunto(s)
Grabación en Video , Humanos , Redes Neurales de la Computación , Percepción de Movimiento/fisiología , Aprendizaje Automático Supervisado , Movimiento (Física) , Algoritmos
19.
Neural Netw ; 179: 106591, 2024 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-39111162

RESUMEN

Most existing model-based and learning-based image deblurring methods usually use synthetic blur-sharp training pairs to remove blur. However, these approaches do not perform well in real-world applications as the blur-sharp training pairs are difficult to be obtained and the blur in real-world scenarios is spatial-variant. In this paper, we propose a self-supervised learning-based image deblurring method that can deal with both uniform and spatial-variant blur distributions. Moreover, our method does not need for blur-sharp pairs for training. In our proposed method, we design the Deblurring Network (D-Net) and the Spatial Degradation Network (SD-Net). Specifically, the D-Net is designed for image deblurring while the SD-Net is used to simulate the spatial-variant degradation. Furthermore, the off-the-shelf pre-trained model is employed as the prior of our model, which facilitates image deblurring. Meanwhile, we design a recursive optimization strategy to accelerate the convergence of the model. Extensive experiments demonstrate that our proposed model achieves favorable performance against existing image deblurring methods.


Asunto(s)
Procesamiento de Imagen Asistido por Computador , Redes Neurales de la Computación , Aprendizaje Automático Supervisado , Procesamiento de Imagen Asistido por Computador/métodos , Aprendizaje Profundo , Algoritmos , Humanos
20.
Neural Netw ; 179: 106596, 2024 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-39163823

RESUMEN

De novo molecular design is the process of learning knowledge from existing data to propose new chemical structures that satisfy the desired properties. By using de novo design to generate compounds in a directed manner, better solutions can be obtained in large chemical libraries with less comparison cost. But drug design needs to take multiple factors into consideration. For example, in polypharmacology, molecules that activate or inhibit multiple target proteins produce multiple pharmacological activities and are less susceptible to drug resistance. However, most existing molecular generation methods either focus only on affinity for a single target or fail to effectively balance the relationship between multiple targets, resulting in insufficient validity and desirability of the generated molecules. To address the problems, an approach called clustered Pareto-based reinforcement learning (CPRL) is proposed. In CPRL, a pre-trained model is constructed to grasp existing molecular knowledge in a supervised learning manner. In addition, the clustered Pareto optimization algorithm is presented to find the best solution between different objectives. The algorithm first extracts an update set from the sampled molecules through the designed aggregation-based molecular clustering. Then, the final reward is computed by constructing the Pareto frontier ranking of the molecules from the updated set. To explore the vast chemical space, a reinforcement learning agent is designed in CPRL that can be updated under the guidance of the final reward to balance multiple properties. Furthermore, to increase the internal diversity of the molecules, a fixed-parameter exploration model is used for sampling in conjunction with the agent. The experimental results demonstrate that CPRL is capable of balancing multiple properties of the molecule and has higher desirability and validity, reaching 0.9551 and 0.9923, respectively.


Asunto(s)
Algoritmos , Diseño de Fármacos/métodos , Refuerzo en Psicología , Análisis por Conglomerados , Aprendizaje Automático Supervisado , Redes Neurales de la Computación
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA