Night UAV Vehicle Detection Based on Infrared-Visible Fusion and Improved YOLO11
DOI:
https://doi.org/10.71204/ew979106Keywords:
Nighttime Vehicle Detection, Drone Aerial Photography, Multimodal Fusion, YOLO11, Feature Pyramid NetworkAbstract
To address the challenges of vehicle detection accuracy in nighttime UAV aerial photography scenarios caused by low light conditions, strong noise, and dense small object distribution, this paper proposes an improved YOLO11 vehicle detection method based on multi-modal fusion. First, a collaborative enhancement strategy combining CLAHE and Gamma correction is applied to preprocessing nighttime visible light images, effectively restoring vehicle texture details in dark areas. Second, the enhanced visible light images are fused with infrared images through fixed-weight weighted fusion (infrared weight α=0.7), fully leveraging the complementary advantages of both modalities. Finally, the MANet module is introduced on top of YOLO11n to enhance backbone multi-scale feature extraction capabilities, while the ADFPN-DASI module is designed to collaboratively optimize neck multi-scale feature representation. Experiments conducted on the DroneVehicle dataset demonstrate that the proposed method achieves an mAP50 of 80.53%, representing an improvement of 11.45 percentage points over the YOLO11n baseline. The method maintains lightweight advantages in parameter and computational resources, outperforming mainstream comparison methods such as YOLOv5n.
References
Feng, Y., Huang, J., Du, S., et al. (2024). Hyper-YOLO: When visual object detection meets hypergraph computation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 47(4), 2388–2401.
Khanam, R., & Hussain, M. (2024). YOLO11: An overview of the key architectural enhancements. arXiv preprint arXiv:2410.17725.
Li, X., Li, X., et al. (2024). A survey of object detection for UAVs based on deep learning. Remote Sensing, 16(1), 149.
Ma, J., Ma, Y., & Li, C. (2019). Infrared and visible image fusion methods and applications: A survey. Information Fusion, 45, 153–178.
Ma, M., Wang, H., & Wang, J. (2023). An underwater image enhancement algorithm based on improved MSRCR–CLAHE fusion. Infrared Technology, 45(1), 23–32.
Mittal, P., Singh, R., & Sharma, A. (2020). Deep learning-based object detection in low-altitude UAV datasets: A survey. Image and Vision Computing, 104, 104046.
Persiya, J., & Sasithradevi, A. (2025). Synergistic fusion: An integrated pipeline of CLAHE, YOLO models, and advanced super-resolution for enhanced thermal eye detection. PLOS ONE, 20(7), e0328227.
Redmon, J., & Farhadi, A. (2018). YOLOv3: An incremental improvement. arXiv preprint arXiv:1804.02767.
Sun, Y., Cao, B., Zhu, P., et al. (2022). Drone-based RGB-infrared cross-modality vehicle detection via uncertainty-aware learning. IEEE Transactions on Circuits and Systems for Video Technology, 32(10), 6700–6713.
Tian, D., Yan, X., Zhou, D., et al. (2024). IV-YOLO: A lightweight dual-branch object detection network. Sensors, 24(19), 6181.
Tian, Y., Ye, Q., & Doermann, D. (2025). YOLOv12: Attention-centric real-time object detectors. arXiv preprint arXiv:2502.12524.
Wang, G., Wang, G., et al. (2023). UAV-YOLOv8: A small-object-detection model based on improved YOLOv8 for UAV aerial photography scenarios. Sensors, 23(16), 7190.
Yuan, Z., Zeng, J., Wei, Z., et al. (2023). CLAHE-based low-light image enhancement for robust object detection in overhead power transmission system. IEEE Transactions on Power Delivery, 38(3), 2240–2243.
Zhang, Q., Qiu, L., Zhou, L., et al. (2024). ESM-YOLO: Enhanced small target detection based on visible and infrared multi-modal fusion. In Proceedings of the Asian Conference on Computer Vision (pp. 1454–1469).
Zhang, Y., Dai, Z., Pan, C., et al. (2025). NOC-YOLO: An exploration to enhance small-target vehicle detection accuracy in aerial infrared images. Infrared Physics & Technology, 149, 105905.
Zhu, P., Wen, L., Du, D., et al. (2021). Detection and tracking meet drones challenge. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(11), 7380–7399.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Hongxia Su (Author)

This work is licensed under a Creative Commons Attribution 4.0 International License.
All articles published in this journal are licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author(s) and source are properly credited. Authors retain copyright of their work, and readers are free to copy, share, adapt, and build upon the material for any purpose, including commercial use, as long as appropriate attribution is given.