MRD-YOLO: A Multispectral Object Detection Algorithm for Complex Road Scenes.

Sun, Chaoyue; Chen, Yajun; Qiu, Xiaoyang; Li, Rongzhen; You, Longxiang

Sun, Chaoyue; Chen, Yajun; Qiu, Xiaoyang; Li, Rongzhen; You, Longxiang.

Afiliación

Sun C; School of Electronic Information Engineering, China West Normal University, Nanchong 637009, China.
Chen Y; School of Electronic Information Engineering, China West Normal University, Nanchong 637009, China.
Qiu X; School of Electronic Information Engineering, China West Normal University, Nanchong 637009, China.
Li R; School of Electronic Information Engineering, China West Normal University, Nanchong 637009, China.
You L; School of Electronic Information Engineering, China West Normal University, Nanchong 637009, China.

Sensors (Basel) ; 24(10)2024 May 18.

Article en En | MEDLINE | ID: mdl-38794076

ABSTRACT

ABSTRACT

Object detection is one of the core technologies for autonomous driving. Current road object detection mainly relies on visible light, which is prone to missed detections and false alarms in rainy, night-time, and foggy scenes. Multispectral object detection based on the fusion of RGB and infrared images can effectively address the challenges of complex and changing road scenes, improving the detection performance of current algorithms in complex scenarios. However, previous multispectral detection algorithms suffer from issues such as poor fusion of dual-mode information, poor detection performance for multi-scale objects, and inadequate utilization of semantic information. To address these challenges and enhance the detection performance in complex road scenes, this paper proposes a novel multispectral object detection algorithm called MRD-YOLO. In MRD-YOLO, we utilize interaction-based feature extraction to effectively fuse information and introduce the BIC-Fusion module with attention guidance to fuse different modal information. We also incorporate the SAConv module to improve the model's detection performance for multi-scale objects and utilize the AIFI structure to enhance the utilization of semantic information. Finally, we conduct experiments on two major public datasets, FLIR_Aligned and M3FD. The experimental results demonstrate that compared to other algorithms, the proposed algorithm achieves superior detection performance in complex road scenes.

Palabras clave

autonomous vehicle; computer vision; multi-modality fusion; object detection

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: Sensors (Basel) Año: 2024 Tipo del documento: Article País de afiliación: China Pais de publicación: Suiza

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google