Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Más filtros











Base de datos
Intervalo de año de publicación
1.
IEEE Trans Pattern Anal Mach Intell ; 45(11): 13766-13777, 2023 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-37467087

RESUMEN

Millions of papers are submitted and published every year, but researchers often do not have much information about the journals that interest them. In this paper, we introduced the first dynamical clustering algorithm for symbolic polygonal data and this was applied to build scientific journals profiles. Dynamic clustering algorithms are a family of iterative two-step relocation algorithms involving the construction of clusters at each iteration and the identification of a suitable representation or prototype (means, axes, probability laws, groups of elements, etc.) for each cluster by locally optimizing an adequacy criterion that measures the fitting between clusters and their corresponding prototypes The application gives a powerful vision to understand the main variables that describe journals. Symbolic polygonal data can represent summarized extensive datasets taking into account variability. In addition, we developed cluster and partition interpretation indices for polygonal data that have the ability to extract insights about clustering results. From these indices, we discovered, e.g., that the number of difficult words in abstract is fundamental to building journal profiles.

2.
Pattern Anal Appl ; 26(1): 39-59, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-35873880

RESUMEN

Interval-valued data have been commonly encountered in practice, and Symbolic Data Analysis provides a solution to the statistical treatment of these data. Regression analysis for interval-valued symbolic data is a topic that has been widely investigated in the literature of symbolic data analysis, and several models from different paradigms have been proposed. There are basic regression assumptions, and it is essential to validate them. This paper introduces an approach to check interval regression model adequacy based on residual analysis. Concepts of ordinary and standardized interval residual are presented, and graphical analysis of these residuals is also proposed. To show the usefulness of the proposed approach, an application for estimating school dropout in the scenario of Brazilian municipalities is performed. We observed some outliers from the interval residuals analysis, and interval robust regression models are more suitable for estimating school dropout.

3.
Med Biol Eng Comput ; 55(6): 873-884, 2017 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-27629552

RESUMEN

Breast cancer is one of the leading causes of death in women. Because of this, thermographic images have received a refocus for diagnosing this cancer type. This work proposes an innovative approach to classify breast abnormalities (malignant, benignant and cyst), employing interval temperature data in order to detect breast cancer. The learning step takes into account the internal variation of the intervals when describing breast abnormalities and uses a way to map these intervals into a space where they can be more easily separated. The method builds class prototypes, and the allocation step is based on a parameterized Mahalanobis distance for interval-valued data. The proposed classifier is applied to a breast thermography dataset from Brazil with 50 patients. We investigate two different scenarios for parameter configuration. The first scenario focuses on the overall misclassification rate and achieves 16 % misclassification rate and 93 % sensitivity to the malignant class. The second scenario maximizes the sensitivity to the malignant class, achieving 100 % sensitivity to this specific class, along with 20 % overall misclassification rate. We compare the performances of our approach and of many methods taken from the literature of interval data classification for the breast thermography task. Results show that our method outperforms competing algorithms.


Asunto(s)
Neoplasias de la Mama/diagnóstico , Neoplasias de la Mama/patología , Mama/patología , Algoritmos , Brasil , Femenino , Humanos , Sensibilidad y Especificidad , Temperatura , Termografía/métodos
4.
Neural Netw ; 80: 19-33, 2016 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-27152933

RESUMEN

Some complex data types are capable of modeling data variability and imprecision. These data types are studied in the symbolic data analysis field. One such data type is interval data, which represents ranges of values and is more versatile than classic point data for many domains. This paper proposes a new prototype-based classifier for interval data, trained by a swarm optimization method. Our work has two main contributions: a swarm method which is capable of performing both automatic selection of features and pruning of unused prototypes and a generalized weighted squared Euclidean distance for interval data. By discarding unnecessary features and prototypes, the proposed algorithm deals with typical limitations of prototype-based methods, such as the problem of prototype initialization. The proposed distance is useful for learning classes in interval datasets with different shapes, sizes and structures. When compared to other prototype-based methods, the proposed method achieves lower error rates in both synthetic and real interval datasets.


Asunto(s)
Inteligencia Artificial , Aprendizaje , Algoritmos , Modelos Teóricos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA