-
CiteScore
1.08
Impact Factor
IECE Transactions on Intelligent Systematics, 2024, Volume 1, Issue 1: 30-39

Free Access | Research Article | 27 May 2024 | Cited: 5
1 School of Computer Science and Artificial Intelligence, Beijing Technology and Business University, Beijing 100048, China
* Corresponding author: Huijun Ma, email: [email protected]
Received: 16 February 2024, Accepted: 19 May 2024, Published: 27 May 2024  

Abstract
In recent years, deep learning techniques have been increasingly applied to the detection of remote sensing images. However, the substantial size variation and dense distribution of objects in these images present significant challenges to detection algorithms. Current methods often suffer from low efficiency, missed detections, and inaccurate bounding boxes. To address these issues, this paper presents an improved YOLO algorithm, YOLOv7-bw, designed for efficient remote sensing image detection, thereby advancing object detection applications in the remote sensing industry. YOLOv7-bw enhances the original SPPCSPC pooling pyramid network by incorporating a Bi-level Routing Attention module, which focuses on densely populated target areas to improve the network's feature extraction capabilities. Additionally, it introduces a dynamic non-monotonic WIoUv3 loss function to replace the original CIoU loss function. This substitution ensures that the loss function's gradient allocation strategy aligns more effectively with the current detection scenario, enhancing the network's focus on the detection object. Through comparative experiments on the DIOR remote sensing image dataset, we found that YOLOv7-bw achieved a high [email protected] of 85.63% and a high [email protected]:0.95 of 65.93%, surpassing the previous results of 83.7% and 63.9% by approximately 1.93% and 2.03%, respectively. Moreover, compared with commonly used algorithms, YOLOv7-bw demonstrated superior performance, thereby validating the feasibility and enhanced applicability of our proposed algorithm for remote sensing image detection.

Graphical Abstract
YOLOv7-Bw: A Dense Small Object Efficient Detector Based on Remote Sensing Image

Keywords
Remote Sensing Image
YOLO
Object Detection
mAP

References

[1]Nie, G. T., & Huang, H. (2021). A survey of object detection in optical remote sensing images. Acta Automatica Sinica, 47(8), 1749-1768.

[2]Liu, G., Sun, X., Fu, K., & Wang, H. (2012). Aircraft recognition in high-resolution satellite images using coarse-to-fine shape prior. IEEE Geoscience and Remote Sensing Letters, 10(3), 573-577.

[3]Liu, Q., Xiang, X., Wang, Y., Luo, Z., & Fang, F. (2020). Aircraft detection in remote sensing image based on corner clustering and deep learning. Engineering Applications of Artificial Intelligence, 87, 103333.

[4]Zhu, C., Zhou, H., Wang, R., & Guo, J. (2010). A novel hierarchical method of ship detection from spaceborne optical image based on shape and texture features. IEEE Transactions on geoscience and remote sensing, 48(9), 3446-3456.

[5]Bi, F., Zhu, B., Gao, L., & Bian, M. (2012). A visual search inspired computational model for ship detection in optical satellite images. IEEE Geoscience and Remote Sensing Letters, 9(4), 749-753.

[6]Ren, S., He, K., Girshick, R., & Sun, J. (2016). Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE transactions on pattern analysis and machine intelligence, 39(6), 1137-1149.

[7]Pang, J., Chen, K., Shi, J., Feng, H., Ouyang, W., & Lin, D. (2019). Libra r-cnn: Towards balanced learning for object detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 821-830).

[8]Nie, X., Duan, M., Ding, H., Hu, B., & Wong, E. K. (2020). Attention mask R-CNN for ship detection and segmentation from remote sensing images. Ieee Access, 8, 9325-9334.

[9]Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 779-788).

[10]Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C. Y., & Berg, A. C. (2016). Ssd: Single shot multibox detector. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14 (pp. 21-37). Springer International Publishing.

[11]Tian, Z., Shen, C., Chen, H., & He, T. (1904). FCOS: Fully convolutional one-stage object detection. arXiv 2019. arXiv preprint arXiv:1904.01355.

[12]Tong, Z., Chen, Y., Xu, Z., & Yu, R. (2023). Wise-IoU: bounding box regression loss with dynamic focusing mechanism. arXiv preprint arXiv:2301.10051.

[13]Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). Attention is all you need. Advances in neural information processing systems, 30.

[14]Wang, C. Y., Bochkovskiy, A., & Liao, H. Y. M. (2023). YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 7464-7475).

[15]Cai, W., Qian, P., Ding, Y., Bi, M., Ning, X., Hong, D., & Bai, X. (2023). Graph Structured Convolution-Guided Continuous Context Threshold-Aware Networks for Hyperspectral Image Classification. IEEE Transactions on Geoscience and Remote Sensing.

[16]Cai, W., Gao, M., Ding, Y., Ning, X., Bai, X., & Qian, P. (2023). Stereo Attention Cross-Decoupling Fusion-Guided Federated Neural Learning for Hyperspectral Image Classification. IEEE Transactions on Geoscience and Remote Sensing.

[17]Li, X., Ding, M., & Pižurica, A. (2021). Spectral feature fusion networks with dual attention for hyperspectral image classification. IEEE Transactions on Geoscience and Remote Sensing, 60, 1-14.

[18]Yang, X., Yang, X., Yang, J., Ming, Q., Wang, W., Tian, Q., & Yan, J. (2021). Learning high-precision bounding box for rotated object detection via kullback-leibler divergence. Advances in Neural Information Processing Systems, 34, 18381-18394.

[19]Zhang, M., Liu, T., Piao, Y., Yao, S., & Lu, H. (2021, October). Auto-msfnet: Search multi-scale fusion network for salient object detection. In Proceedings of the 29th ACM international conference on multimedia (pp. 667-676).

[20]Jiang, S., Zhang, J., Wang, W., & Wang, Y. (2023). Automatic inspection of bridge bolts using unmanned aerial vision and adaptive scale unification-based deep learning. Remote Sensing, 15(2), 328.

[21]Wang, Y., Wang, L., Wang, H., & Li, P. (2019). End-to-end image super-resolution via deep and shallow convolutional networks. IEEE Access, 7, 31959-31970.

[22]Yang, F., Li, W., Hu, H., Li, W., & Wang, P. (2020). Multi-scale feature integrated attention-based rotation network for object detection in VHR aerial images. Sensors, 20(6), 1686.

[23]Yao, H., Yu, W., Luo, W., Qiang, Z., Luo, D., & Zhang, X. (2023). Learning global-local correspondence with semantic bottleneck for logical anomaly detection. IEEE Transactions on Circuits and Systems for Video Technology.

[24]Yan, R., Yan, L., Cao, Y., Geng, G., & Zhou, P. (2024). One-stop multiscale reconciliation attention network with scribble supervision for salient object detection in optical remote sensing images. Applied Intelligence, 54(5), 3737-3755.

[25]Zhang, H., & Wu, Y. (2024). CSEF-Net: Cross-Scale SAR Ship Detection Network Based on Efficient Receptive Field and Enhanced Hierarchical Fusion. Remote Sensing, 16(4), 622.

[26]Roy, A. M., & Bhaduri, J. (2023). DenseSPH-YOLOv5: An automated damage detection model based on DenseNet and Swin-Transformer prediction head-enabled YOLOv5 with attention mechanism. Advanced Engineering Informatics, 56, 102007.

[27]Mahaadevan, V. C., Narayanamoorthi, R., Gono, R., & Moldrik, P. (2023). Automatic Identifier of Socket for Electrical Vehicles using SWIN-Transformer and SimAM Attention Mechanism-based EVS YOLO. IEEE Access.

[28]Liu, Q., Xia, T., Cheng, L., Van Eijk, M., Ozcelebi, T., & Mao, Y. (2021). Deep reinforcement learning for load-balancing aware network control in IoT edge systems. IEEE Transactions on Parallel and Distributed Systems, 33(6), 1491-1502.


Cite This Article
APA Style
Jin, X., Tong, A., Ge, X., Ma, H., Li, J., Fu, H., & Gao, L. (2024). YOLOv7-Bw: A Dense Small Object Efficient Detector Based on Remote Sensing Image. IECE Transactions on Intelligent Systematics, 1(1), 30-39. https://doi.org/10.62762/TIS.2024.137321

Article Metrics
Citations:

Crossref

1

Scopus

5

Web of Science

5
Article Access Statistics:
Views: 7981
PDF Downloads: 791

Publisher's Note
IECE stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions
IECE or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
IECE Transactions on Intelligent Systematics

IECE Transactions on Intelligent Systematics

ISSN: 2998-3355 (Online) | ISSN: 2998-3320 (Print)

Email: [email protected]

Portico

Portico

All published articles are preserved here permanently:
https://www.portico.org/publishers/iece/

Copyright © 2024 Institute of Emerging and Computer Engineers Inc.