Búsqueda | Portal Regional de la BVS

Hyperpixels: Flexible 4D Over-Segmentation for Dense and Sparse Light Fields.

Hamad, Maryam; Conti, Caroline; Nunes, Paulo; Soares, Luis Ducla.

IEEE Trans Image Process ; 32: 3790-3805, 2023.

Artículo en Inglés | MEDLINE | ID: mdl-37405879

RESUMEN

4D Light Field (LF) imaging, since it conveys both spatial and angular scene information, can facilitate computer vision tasks and generate immersive experiences for end-users. A key challenge in 4D LF imaging is to flexibly and adaptively represent the included spatio-angular information to facilitate subsequent computer vision applications. Recently, image over-segmentation into homogenous regions with perceptually meaningful information has been exploited to represent 4D LFs. However, existing methods assume densely sampled LFs and do not adequately deal with sparse LFs with large occlusions. Furthermore, the spatio-angular LF cues are not fully exploited in the existing methods. In this paper, the concept of hyperpixels is defined and a flexible, automatic, and adaptive representation for both dense and sparse 4D LFs is proposed. Initially, disparity maps are estimated for all views to enhance over-segmentation accuracy and consistency. Afterwards, a modified weighted K -means clustering using robust spatio-angular features is performed in 4D Euclidean space. Experimental results on several dense and sparse 4D LF datasets show competitive and outperforming performance in terms of over-segmentation accuracy, shape regularity and view consistency against state-of-the-art methods.

Remote Gait Type Classification System Using Markerless 2D Video.

Albuquerque, Pedro; Machado, João Pedro; Verlekar, Tanmay Tulsidas; Correia, Paulo Lobato; Soares, Luís Ducla.

Diagnostics (Basel) ; 11(10)2021 Oct 02.

Artículo en Inglés | MEDLINE | ID: mdl-34679521

RESUMEN

Several pathologies can alter the way people walk, i.e., their gait. Gait analysis can be used to detect such alterations and, therefore, help diagnose certain pathologies or assess people's health and recovery. Simple vision-based systems have a considerable potential in this area, as they allow the capture of gait in unconstrained environments, such as at home or in a clinic, while the required computations can be done remotely. State-of-the-art vision-based systems for gait analysis use deep learning strategies, thus requiring a large amount of data for training. However, to the best of our knowledge, the largest publicly available pathological gait dataset contains only 10 subjects, simulating five types of gait. This paper presents a new dataset, GAIT-IT, captured from 21 subjects simulating five types of gait, at two severity levels. The dataset is recorded in a professional studio, making the sequences free of background camouflage, variations in illumination and other visual artifacts. The dataset is used to train a novel automatic gait analysis system. Compared to the state-of-the-art, the proposed system achieves a drastic reduction in the number of trainable parameters, memory requirements and execution times, while the classification accuracy is on par with the state-of-the-art. Recognizing the importance of remote healthcare, the proposed automatic gait analysis system is integrated with a prototype web application. This prototype is presently hosted in a private network, and after further tests and development it will allow people to upload a video of them walking and execute a web service that classifies their gait. The web application has a user-friendly interface usable by healthcare professionals or by laypersons. The application also makes an association between the identified type of gait and potential gait pathologies that exhibit the identified characteristics.

A Spatiotemporal Deep Learning Approach for Automatic Pathological Gait Classification.

Albuquerque, Pedro; Verlekar, Tanmay Tulsidas; Correia, Paulo Lobato; Soares, Luís Ducla.

Sensors (Basel) ; 21(18)2021 Sep 16.

Artículo en Inglés | MEDLINE | ID: mdl-34577408

RESUMEN

Human motion analysis provides useful information for the diagnosis and recovery assessment of people suffering from pathologies, such as those affecting the way of walking, i.e., gait. With recent developments in deep learning, state-of-the-art performance can now be achieved using a single 2D-RGB-camera-based gait analysis system, offering an objective assessment of gait-related pathologies. Such systems provide a valuable complement/alternative to the current standard practice of subjective assessment. Most 2D-RGB-camera-based gait analysis approaches rely on compact gait representations, such as the gait energy image, which summarize the characteristics of a walking sequence into one single image. However, such compact representations do not fully capture the temporal information and dependencies between successive gait movements. This limitation is addressed by proposing a spatiotemporal deep learning approach that uses a selection of key frames to represent a gait cycle. Convolutional and recurrent deep neural networks were combined, processing each gait cycle as a collection of silhouette key frames, allowing the system to learn temporal patterns among the spatial features extracted at individual time instants. Trained with gait sequences from the GAIT-IT dataset, the proposed system is able to improve gait pathology classification accuracy, outperforming state-of-the-art solutions and achieving improved generalization on cross-dataset tests.

Asunto(s)

Aprendizaje Profundo , Análisis de la Marcha , Humanos , Movimiento (Física) , Redes Neurales de la Computación , Caminata

Temporal shape error concealment by global motion compensation with local refinement.

Soares, Luis Ducla; Pereira, Fernando.

IEEE Trans Image Process ; 15(6): 1331-48, 2006 Jun.

Artículo en Inglés | MEDLINE | ID: mdl-16764261

RESUMEN

This paper presents an original temporal shape error concealment technique based on a combination of global and local motion compensation. For this technique, which is especially useful for object-based video applications in error-prone environments (e.g., mobile networks), it is assumed that the shape of the corrupted object at hand is in the form of a binary alpha plane and some of the shape data is missing due to channel errors. To conceal the corrupted shape, the decoder first assumes that a global motion model can describe the shape changes in consecutive time instants. This way, based on locally estimated global motion parameters, the decoder attempts to conceal the corrupted alpha plane by global motion compensating the shape data from the previous time instant. Afterwards, since a global motion model cannot perfectly describe all alpha plane changes, a local motion refinement is applied to improve the concealment in areas of the object with significant local motion.

Asunto(s)

Algoritmos , Artefactos , Redes de Comunicación de Computadores , Aumento de la Imagen/métodos , Interpretación de Imagen Asistida por Computador/métodos , Reconocimiento de Normas Patrones Automatizadas/métodos , Grabación en Video/métodos , Simulación por Computador , Compresión de Datos/métodos , Interpretación Estadística de Datos , Modelos Genéticos , Modelos Estadísticos , Movimiento (Física) , Procesamiento de Señales Asistido por Computador , Factores de Tiempo

Spatial shape error concealment for object-based image and video coding.

Soares, Luis Ducla; Pereira, Fernando.

IEEE Trans Image Process ; 13(4): 586-99, 2004 Apr.

Artículo en Inglés | MEDLINE | ID: mdl-15376592

RESUMEN

In this paper, an original spatial shape error-concealment technique, to be used in the context of object-based image and video coding schemes, is proposed. In this technique, it is assumed that the shape of the corrupted object at hand is in the form of a binary alpha plane, in which some of the shape data is missing due to channel errors. From this alpha plane, a contour corresponding to the border of the object can be extracted. However, due to errors, some parts of the contour will be missing and, therefore, the contour will be broken. The proposed technique relies on the interpolation of the missing contours with Bézier curves, which is done based on the available surrounding contours. After all the missing parts of the contour have been interpolated, the concealed alpha plane can be easily reconstructed from the fully recovered contour and used instead of the erroneous one improving the final subjective impact.

Asunto(s)

Algoritmos , Compresión de Datos/métodos , Aumento de la Imagen/métodos , Interpretación de Imagen Asistida por Computador/métodos , Reconocimiento de Normas Patrones Automatizadas , Procesamiento de Señales Asistido por Computador , Grabación en Video/métodos , Reproducibilidad de los Resultados , Sensibilidad y Especificidad , Técnica de Sustracción

Refreshment need metrics for improved shape and texture object-based resilient video coding.

Soares, Luis Ducla; Pereira, Fernando.

IEEE Trans Image Process ; 12(3): 328-40, 2003.

Artículo en Inglés | MEDLINE | ID: mdl-18237912

RESUMEN

Video encoders may use several techniques to improve error resilience. In particular, for video encoders that rely on predictive (inter) coding to remove temporal redundancy, intra coding refreshment is especially useful to stop error propagation when errors occur in the transmission or storage of the coded streams, which can cause the decoded quality to decay very rapidly. In the context of object-based video coding, the video encoder can apply intra coding refreshment to both the shape and the texture data. In this paper, shape refreshment need and texture refreshment need metrics are proposed which can be used by object-based video encoders, notably MPEG-4 video encoders, to determine when the shape and the texture of the various video objects in the scene should be refreshed in order to improve the decoded video quality, e.g., for a given bitrate.

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA