Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 16 de 16
Filtrar
Mais filtros











Base de dados
Intervalo de ano de publicação
1.
J Appl Stat ; 51(5): 866-890, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38524798

RESUMO

Despite the vast advantages of making antenatal care visits, the service utilization among pregnant women in Nigeria is suboptimal. A five-year monitoring estimate indicated that about 24% of the women who had live births made no visit. The non-utilization induced excessive zeroes in the outcome of interest. Thus, this study adopted a zero-inflated negative binomial model within a Bayesian framework to identify the spatial pattern and the key factors hindering antenatal care utilization in Nigeria. We overcome the intractability associated with posterior inference by adopting a Pólya-Gamma data-augmentation technique to facilitate inference. The Gibbs sampling algorithm was used to draw samples from the joint posterior distribution. Results revealed that type of place of residence, maternal level of education, access to mass media, household work index, and woman's working status have significant effects on the use of antenatal care services. Findings identified substantial state-level spatial disparity in antenatal care utilization across the country. Cost-effective techniques to achieve an acceptable frequency of utilization include the creation of a community-specific awareness to emphasize the importance and benefits of the appropriate utilization. Special consideration should be given to older pregnant women, women in poor antenatal utilization states, and women residing in poor road network regions.

2.
Genes (Basel) ; 15(3)2024 02 24.
Artigo em Inglês | MEDLINE | ID: mdl-38540344

RESUMO

Genomic selection (GS) is revolutionizing plant breeding. However, its practical implementation is still challenging, since there are many factors that affect its accuracy. For this reason, this research explores data augmentation with the goal of improving its accuracy. Deep neural networks with data augmentation (DA) generate synthetic data from the original training set to increase the training set and to improve the prediction performance of any statistical or machine learning algorithm. There is much empirical evidence of their success in many computer vision applications. Due to this, DA was explored in the context of GS using 14 real datasets. We found empirical evidence that DA is a powerful tool to improve the prediction accuracy, since we improved the prediction accuracy of the top lines in the 14 datasets under study. On average, across datasets and traits, the gain in prediction performance of the DA approach regarding the Conventional method in the top 20% of lines in the testing set was 108.4% in terms of the NRMSE and 107.4% in terms of the MAAPE, but a worse performance was observed on the whole testing set. We encourage more empirical evaluations to support our findings.


Assuntos
Genoma de Planta , Genômica , Fenótipo , Aprendizado de Máquina , Redes Neurais de Computação
3.
Entropy (Basel) ; 25(8)2023 Aug 14.
Artigo em Inglês | MEDLINE | ID: mdl-37628237

RESUMO

A novelty signal processing method is proposed for a technical vision system (TVS). During data acquisition of an optoelectrical signal, part of this is random electrical fluctuation of voltages. Information theory (IT) is a well-known field that deals with random processes. A method based on using of the Shannon Entropy for feature extractions of optical patterns is presented. IT is implemented in structural health monitoring (SHM) to augment the accuracy of optoelectronic signal classifiers for a metrology subsystem of the TVS. To enhance the TVS spatial coordinate measurement performance at real operation conditions with electrical and optical noisy environments to estimate structural displacement better and evaluate its health for a better estimation of structural displacement and the evaluation of its health. Five different machine learning (ML) techniques are used in this work to classify optical patterns captured with the TVS. Linear predictive coding (LPC) and Autocorrelation function (ACC) are for extraction of optical patterns. The Shannon entropy segmentation (SH) method extracts relevant information from optical patterns, and the model's performance can be improved. The results reveal that segmentation with Shannon's entropy can achieve over 95.33%. Without Shannon's entropy, the worst accuracy was 33.33%.

4.
BMC Pregnancy Childbirth ; 23(1): 469, 2023 Jun 23.
Artigo em Inglês | MEDLINE | ID: mdl-37353749

RESUMO

BACKGROUND: Early prediction of Gestational Diabetes Mellitus (GDM) risk is of particular importance as it may enable more efficacious interventions and reduce cumulative injury to mother and fetus. The aim of this study is to develop machine learning (ML) models, for the early prediction of GDM using widely available variables, facilitating early intervention, and making possible to apply the prediction models in places where there is no access to more complex examinations. METHODS: The dataset used in this study includes registries from 1,611 pregnancies. Twelve different ML models and their hyperparameters were optimized to achieve early and high prediction performance of GDM. A data augmentation method was used in training to improve prediction results. Three methods were used to select the most relevant variables for GDM prediction. After training, the models ranked with the highest Area under the Receiver Operating Characteristic Curve (AUCROC), were assessed on the validation set. Models with the best results were assessed in the test set as a measure of generalization performance. RESULTS: Our method allows identifying many possible models for various levels of sensitivity and specificity. Four models achieved a high sensitivity of 0.82, a specificity in the range 0.72-0.74, accuracy between 0.73-0.75, and AUCROC of 0.81. These models required between 7 and 12 input variables. Another possible choice could be a model with sensitivity of 0.89 that requires just 5 variables reaching an accuracy of 0.65, a specificity of 0.62, and AUCROC of 0.82. CONCLUSIONS: The principal findings of our study are: Early prediction of GDM within early stages of pregnancy using regular examinations/exams; the development and optimization of twelve different ML models and their hyperparameters to achieve the highest prediction performance; a novel data augmentation method is proposed to allow reaching excellent GDM prediction results with various models.


Assuntos
Diabetes Gestacional , Gravidez , Feminino , Humanos , Diabetes Gestacional/diagnóstico , Estudos Prospectivos , Sensibilidade e Especificidade , Curva ROC , Aprendizado de Máquina
5.
J Digit Imaging ; 36(4): 1608-1623, 2023 08.
Artigo em Inglês | MEDLINE | ID: mdl-37012446

RESUMO

Segmentation of tumor regions in H &E-stained slides is an important task for a pathologist while diagnosing different types of cancer, including oral squamous cell carcinoma (OSCC). Histological image segmentation is often constrained by the availability of labeled training data since labeling histological images is a highly skilled, complex, and time-consuming task. Thus, data augmentation strategies become essential to train convolutional neural networks models to overcome the overfitting problem when only a few training samples are available. This paper proposes a new data augmentation strategy, named Random Composition Augmentation (RCAug), to train fully convolutional networks (FCN) to segment OSCC tumor regions in H &E-stained histological images. Given the input image and their corresponding label, a pipeline with a random composition of geometric, distortion, color transfer, and generative image transformations is executed on the fly. Experimental evaluations were performed using an FCN-based method to segment OSCC regions through a set of different data augmentation transformations. By using RCAug, we improved the FCN-based segmentation method from 0.51 to 0.81 of intersection-over-union (IOU) in a whole slide image dataset and from 0.65 to 0.69 of IOU in a tissue microarray images dataset.


Assuntos
Carcinoma de Células Escamosas , Neoplasias Bucais , Humanos , Processamento de Imagem Assistida por Computador/métodos , Carcinoma de Células Escamosas/diagnóstico por imagem , Neoplasias Bucais/diagnóstico por imagem , Redes Neurais de Computação
6.
Sensors (Basel) ; 23(3)2023 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-36772604

RESUMO

Ankle injuries caused by the Anterior Talofibular Ligament (ATFL) are the most common type of injury. Thus, finding new ways to analyze these injuries through novel technologies is critical for assisting medical diagnosis and, as a result, reducing the subjectivity of this process. As a result, the purpose of this study is to compare the ability of specialists to diagnose lateral tibial tuberosity advancement (LTTA) injury using computer vision analysis on magnetic resonance imaging (MRI). The experiments were carried out on a database obtained from the Vue PACS-Carestream software, which contained 132 images of ATFL and normal (healthy) ankles. Because there were only a few images, image augmentation techniques was used to increase the number of images in the database. Following that, various feature extraction algorithms (GLCM, LBP, and HU invariant moments) and classifiers such as Multi-Layer Perceptron (MLP), Support Vector Machine (SVM), k-Nearest Neighbors (kNN), and Random Forest (RF) were used. Based on the results from this analysis, for cases that lack clear morphologies, the method delivers a hit rate of 85.03% with an increase of 22% over the human expert-based analysis.


Assuntos
Traumatismos do Tornozelo , Ligamentos Laterais do Tornozelo , Humanos , Tornozelo/diagnóstico por imagem , Articulação do Tornozelo , Ligamentos Laterais do Tornozelo/diagnóstico por imagem , Ligamentos Laterais do Tornozelo/lesões , Imageamento por Ressonância Magnética/métodos , Traumatismos do Tornozelo/diagnóstico por imagem , Computadores
7.
Comput Methods Biomech Biomed Engin ; 26(9): 1008-1017, 2023 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-35862582

RESUMO

The classification of sEMG signals is fundamental in applications that use mechanical prostheses, making it necessary to work with generalist databases that improve the accuracy of those classifications. Therefore, synthetic signal generation can be beneficial in enriching a database to make it more generalist. This work proposes using a variant of generative adversarial networks to produce synthetic biosignals of sEMG. A convolutional neural network (CNN) was used to classify the movements. The results showed good performance with an increase of 4.07% in a set of movement classification accuracy when 200 synthetic samples were included for each movement. We compared our results to other methodologies, such as Magnitude Warping and Scaling. Both methodologies did not have the same performance in the classification.


Assuntos
Membros Artificiais , Redes Neurais de Computação , Eletromiografia/métodos , Movimento
8.
Int J Mach Learn Cybern ; 14(1): 171-186, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-35432624

RESUMO

Deep Learning methods have important applications in the building construction image classification field. One challenge of this application is Convolutional Neural Networks adoption in a small datasets. This paper proposes a rigorous methodology for tuning of Data Augmentation hyperparameters in Deep Learning to building construction image classification, especially to vegetation recognition in facades and roofs structure analysis. In order to do that, Logistic Regression models were used to analyze the performance of Convolutional Neural Networks trained from 128 combinations of transformations in the images. Experiments were carried out with three architectures of Deep Learning from the literature using the Keras library. The results show that the recommended configuration (Height Shift Range = 0.2; Width Shift Range = 0.2; Zoom Range =0.2) reached an accuracy of 95.6 % in the test step of first case study. In addition, the hyperparameters recommended by proposed method also achieved the best test results for second case study: 93.3 % .

9.
Physiol Meas ; 43(7)2022 07 25.
Artigo em Inglês | MEDLINE | ID: mdl-35728793

RESUMO

Objective.This study proposes a U-net shaped Deep Neural Network (DNN) model to extract remote photoplethysmography (rPPG) signals from skin color signals to estimate Pulse Rate (PR).Approach.Three input window sizes are used in the DNN: 256 samples (5.12 s), 512 samples (10.24 s), and 1024 (20.48 s). A data augmentation algorithm based on interpolation is also used here to artificially increase the number of training samples.Main results.The proposed model outperformed a prior-knowledge rPPG method by using input signals with window of 256 and 512 samples. Also, it was found that the data augmentation procedure only increased the performance for the window of 1024 samples. The trained model achieved a Mean Absolute Error (MAE) of 3.97 Beats per Minute (BPM) and Root Mean Squared Error (RMSE) of 6.47 BPM, for the 256 samples window, and MAE of 3.00 BPM and RMSE of 5.45 BPM for the window of 512 samples. On the other hand, the prior-knowledge rPPG method got a MAE of 8.04 BPM and RMSE of 16.63 BPM for the window of 256 samples, and MAE of 3.49 BPM and RMSE of 7.92 BPM for the window of 512 samples. For the longest window (1024 samples), the concordance of the predicted PRs from the DNNs and the true PRs was higher when applying the data augmentation procedure.Significance.These results demonstrate a big potential of this technique for PR estimation, showing that the DNN proposed here may generate reliable rPPG signals even with short window lengths (5.12 s and 10.24 s), suggesting that it needs less data for a faster rPPG measurement and PR estimation.


Assuntos
Aprendizado Profundo , Fotopletismografia , Algoritmos , Frequência Cardíaca , Redes Neurais de Computação , Fotopletismografia/métodos , Processamento de Sinais Assistido por Computador
10.
Sensors (Basel) ; 22(4)2022 Feb 21.
Artigo em Inglês | MEDLINE | ID: mdl-35214585

RESUMO

In this research, we analyse data obtained from sensors when a user handwrites or draws on a tablet to detect whether the user is in a specific mood state. First, we calculated the features based on the temporal, kinematic, statistical, spectral and cepstral domains for the tablet pressure, the horizontal and vertical pen displacements and the azimuth of the pen's position. Next, we selected features using a principal component analysis (PCA) pipeline, followed by modified fast correlation-based filtering (mFCBF). PCA was used to calculate the orthogonal transformation of the features, and mFCBF was used to select the best PCA features. The EMOTHAW database was used for depression, anxiety and stress scale (DASS) assessment. The process involved the augmentation of the training data by first augmenting the mood states such that all the data were the same size. Then, 80% of the training data was randomly selected, and a small random Gaussian noise was added to the extracted features. Automated machine learning was employed to train and test more than ten plain and ensembled classifiers. For all three moods, we obtained 100% accuracy results when detecting two possible grades of mood severities using this architecture. The results obtained were superior to the results obtained by using state-of-the-art methods, which enabled us to define the three mood states and provide precise information to the clinical psychologist. The accuracy results obtained when detecting these three possible mood states using this architecture were 82.5%, 72.8% and 74.56% for depression, anxiety and stress, respectively.


Assuntos
Ansiedade , Aprendizado de Máquina , Ansiedade/diagnóstico , Distribuição Normal , Análise de Componente Principal , Máquina de Vetores de Suporte
11.
Plants (Basel) ; 10(10)2021 Sep 22.
Artigo em Inglês | MEDLINE | ID: mdl-34685786

RESUMO

Recently, deep-learning techniques have become the foundations for many breakthroughs in the automated identification of plant diseases. In the agricultural sector, many recent visual-computer approaches use deep-learning models. In this approach, a novel predictive analytics methodology to identify Tobacco Mosaic Virus (TMV) and Pepper Huasteco Yellow Vein Virus (PHYVV) visual symptoms on Jalapeño pepper (Capsicum annuum L.) leaves by using image-processing and deep-learning classification models is presented. The proposed image-processing approach is based on the utilization of Normalized Red-Blue Vegetation Index (NRBVI) and Normalized Green-Blue Vegetation Index (NGBVI) as new RGB-based vegetation indices, and its subsequent Jet pallet colored version NRBVI-Jet NGBVI-Jet as pre-processing algorithms. Furthermore, four standard pre-trained deep-learning architectures, Visual Geometry Group-16 (VGG-16), Xception, Inception v3, and MobileNet v2, were implemented for classification purposes. The objective of this methodology was to find the most accurate combination of vegetation index pre-processing algorithms and pre-trained deep- learning classification models. Transfer learning was applied to fine tune the pre-trained deep- learning models and data augmentation was also applied to prevent the models from overfitting. The performance of the models was evaluated using Top-1 accuracy, precision, recall, and F1-score using test data. The results showed that the best model was an Xception-based model that uses the NGBVI dataset. This model reached an average Top-1 test accuracy of 98.3%. A complete analysis of the different vegetation index representations using models based on deep-learning architectures is presented along with the study of the learning curves of these deep-learning models during the training phase.

12.
PeerJ Comput Sci ; 7: e549, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34084940

RESUMO

Due to the application of vital signs in expert systems, new approaches have emerged, and vital signals have been gaining space in biometrics. One of these signals is the electroencephalogram (EEG). The motor task in which a subject is doing, or even thinking, influences the pattern of brain waves and disturb the signal acquired. In this work, biometrics with the EEG signal from a cross-task perspective are explored. Based on deep convolutional networks (CNN) and Squeeze-and-Excitation Blocks, a novel method is developed to produce a deep EEG signal descriptor to assess the impact of the motor task in EEG signal on biometric verification. The Physionet EEG Motor Movement/Imagery Dataset is used here for method evaluation, which has 64 EEG channels from 109 subjects performing different tasks. Since the volume of data provided by the dataset is not large enough to effectively train a Deep CNN model, it is also proposed a data augmentation technique to achieve better performance. An evaluation protocol is proposed to assess the robustness regarding the number of EEG channels and also to enforce train and test sets without individual overlapping. A new state-of-the-art result is achieved for the cross-task scenario (EER of 0.1%) and the Squeeze-and-Excitation based networks overcome the simple CNN architecture in three out of four cross-individual scenarios.

13.
Sensors (Basel) ; 20(17)2020 Aug 26.
Artigo em Inglês | MEDLINE | ID: mdl-32858803

RESUMO

Monitoring biomass of forages in experimental plots and livestock farms is a time-consuming, expensive, and biased task. Thus, non-destructive, accurate, precise, and quick phenotyping strategies for biomass yield are needed. To promote high-throughput phenotyping in forages, we propose and evaluate the use of deep learning-based methods and UAV (Unmanned Aerial Vehicle)-based RGB images to estimate the value of biomass yield by different genotypes of the forage grass species Panicum maximum Jacq. Experiments were conducted in the Brazilian Cerrado with 110 genotypes with three replications, totaling 330 plots. Two regression models based on Convolutional Neural Networks (CNNs) named AlexNet and ResNet18 were evaluated, and compared to VGGNet-adopted in previous work in the same thematic for other grass species. The predictions returned by the models reached a correlation of 0.88 and a mean absolute error of 12.98% using AlexNet considering pre-training and data augmentation. This proposal may contribute to forage biomass estimation in breeding populations and livestock areas, as well as to reduce the labor in the field.


Assuntos
Ração Animal , Biomassa , Aprendizado Profundo , Plantas/classificação , Tecnologia de Sensoriamento Remoto , Animais , Brasil , Gado , Fenótipo
14.
Stat Med ; 36(30): 4873-4892, 2017 Dec 30.
Artigo em Inglês | MEDLINE | ID: mdl-29067731

RESUMO

A major challenge when monitoring risks in socially deprived areas of under developed countries is that economic, epidemiological, and social data are typically underreported. Thus, statistical models that do not take the data quality into account will produce biased estimates. To deal with this problem, counts in suspected regions are usually approached as censored information. The censored Poisson model can be considered, but all censored regions must be precisely known a priori, which is not a reasonable assumption in most practical situations. We introduce the random-censoring Poisson model (RCPM) which accounts for the uncertainty about both the count and the data reporting processes. Consequently, for each region, we will be able to estimate the relative risk for the event of interest as well as the censoring probability. To facilitate the posterior sampling process, we propose a Markov chain Monte Carlo scheme based on the data augmentation technique. We run a simulation study comparing the proposed RCPM with 2 competitive models. Different scenarios are considered. RCPM and censored Poisson model are applied to account for potential underreporting of early neonatal mortality counts in regions of Minas Gerais State, Brazil, where data quality is known to be poor.


Assuntos
Modelos Estatísticos , Distribuição de Poisson , Algoritmos , Teorema de Bayes , Bioestatística , Brasil/epidemiologia , Simulação por Computador , Humanos , Lactente , Mortalidade Infantil , Recém-Nascido , Cadeias de Markov , Método de Monte Carlo , Probabilidade
15.
Genet Epidemiol ; 40(3): 253-63, 2016 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-27027518

RESUMO

The goal of this paper is to present an implementation of stochastic search variable selection (SSVS) to multilevel model from item response theory (IRT). As experimental settings get more complex and models are required to integrate multiple (and sometimes massive) sources of information, a model that can jointly summarize and select the most relevant characteristics can provide better interpretation and a deeper insight into the problem. A multilevel IRT model recently proposed in the literature for modeling multifactorial diseases is extended to perform variable selection in the presence of thousands of covariates using SSVS. We derive conditional distributions required for such a task as well as an acceptance-rejection step that allows for the SSVS in high dimensional settings using a Markov Chain Monte Carlo algorithm. We validate the variable selection procedure through simulation studies, and illustrate its application on a study with genetic markers associated with the metabolic syndrome.


Assuntos
Teorema de Bayes , Genômica/métodos , Modelos Genéticos , Algoritmos , Marcadores Genéticos/genética , Humanos , Cadeias de Markov , Síndrome Metabólica/genética , Modelos Estatísticos , Método de Monte Carlo , Polimorfismo de Nucleotídeo Único/genética , Processos Estocásticos
16.
Biom J ; 56(2): 198-218, 2014 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-24338809

RESUMO

In this paper, we consider a piecewise exponential model (PEM) with random time grid to develop a full semiparametric Bayesian cure rate model. An elegant mechanism enjoying several attractive features for modeling the randomness of the time grid of the PEM is assumed. To model the prior behavior of the failure rates of the PEM we assume a hierarchical modeling approach that allows us to control the degree of parametricity in the right tail of the survival curve. Properties of the proposed model are discussed in detail. In particular, we investigate the impact of assuming a random time grid for the PEM on the estimation of the cure fraction. We further develop an efficient collapsed Gibbs sampler algorithm for carrying out posterior computation. A Bayesian diagnostic method for assessing goodness of fit and performing model comparisons is briefly discussed. Finally, we illustrate the usefulness of the new methodology with the analysis of a melanoma clinical trial that has been discussed in the literature.


Assuntos
Biometria/métodos , Modelos Estatísticos , Algoritmos , Teorema de Bayes , Ensaios Clínicos como Assunto , Feminino , Humanos , Masculino , Melanoma/terapia , Análise de Sobrevida , Resultado do Tratamento
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA