Education, Science, Technology, Innovation and Life
Open Access
Sign In

Vehicle Target Detection Algorithm Based on Improved Faster R-CNN for Remote Sensing Images

Download as PDF

DOI: 10.23977/jaip.2024.070105 | Downloads: 21 | Views: 298

Author(s)

Yiran Yang 1

Affiliation(s)

1 Beijing University of Civil Engineering and Architecture, Beijing, 102616, China

Corresponding Author

Yiran Yang

ABSTRACT

Aiming at the problems that remote sensing image vehicle targets are susceptible to complex background interference, multi-scale differences, and difficulties in detecting small targets, this paper proposes a remote sensing image vehicle target detection algorithm based on improved Faster R-CNN. In this paper, based on the framework of Faster R-CNN, firstly, a multi-scale feature extraction network (EM-FPN) is designed by using the FPN structure and ResNet50 network, so that the network extracts rich target features; secondly, the ECA attention mechanism is introduced, so that the feature extraction network focuses on the target features, suppresses the interference of irrelevant background information, and constructs the multirate dilated convolution module (MDCM) to enhance the network's ability to perceive the contextual information of the target; finally, ROI Align is used instead of ROI Pooling to reduce the feature quantization error. The experimental results prove that the accuracy of the proposed algorithm reaches 88.6%, which can effectively detect vehicle targets in remote sensing images.

KEYWORDS

Remote sensing imagery; target detection; vehicles; feature extraction; attention mechanism

CITE THIS PAPER

Yiran Yang, Vehicle Target Detection Algorithm Based on Improved Faster R-CNN for Remote Sensing Images. Journal of Artificial Intelligence Practice (2024) Vol. 7: 27-33. DOI: http://dx.doi.org/10.23977/jaip.2024.070105.

REFERENCES

[1] Li K., Wan G., Cheng G., Meng L., Han J. (2020) Object Detection in Optical Remote Sensing Images: A Survey and A New Benchmark 2019. ISPRS J. Photogramm. Remote Sens. 159, 296–307.
[2] Lv Y., Zhu H., Meng J., Cui C., Song Q. (2022) A review and adaptability study of deep learning models for vehicle detection based on high-resolution remote sensing images. Remote Sensing for Natural Resources, 34(04):22-32.
[3] Liu W., Anguelov D., Erhan D., et al. (2016) Ssd: Single shot multibox detector. European conference on computer vision, 21-37.
[4] Bochkovskiy A, Wang C Y, Liao H Y M. Yolov4: Optimal speed and accuracy of object detection [J]. arXiv preprint arXiv: 2004.10934, 2020.
[5] Zhu, X., Lyu, S., Wang, X., Zhao, Q. (2021) TPH-YOLOv5: Improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios. Proceedings of the IEEE/CVF international conference on computer vision, 2778-2788.
[6] Girshick, R., Donahue, J., Darrell, T., Malik, J., Berkeley, UC. (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition, 580-587.
[7] Ren, S., He, K., Girshick, R., Sun, J. (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems, 28.
[8] Qu H., Wang M., Chai R. (2023) Efficient Vehicle Detection in Remote Sensing Images with Bi-directional Multi-scale Feature Fusion. Computer Engineering and Applications, 1-13.
[9] Zhao Q., Yang Y. (2023) Small object detection algorithm for lightweight remote snesing vehicles with multiple pyramids. Electronic Measurement Technology, 46(13):88-94.
[10] Zhang C., Lam K M., Wang Q. (2023) Cof-net: A progressive coarse-to-fine framework for object detection in remote-sensing imagery. IEEE Transactions on Geoscience and Remote Sensing, 61: 1-17.
[11] Tian Z., Huang J., Yang Y., Nie W. (2023) KCFS-YOLOv5: A High-Precision Detection Method for Object Detection in Aerial Remote Sensing Images. Applied Sciences, 13(1): 649.
[12] Targ S., Almeida D., Lyman K. (2016) Resnet in resnet: Generalizing residual architectures. arXiv preprint arXiv:1603.08029. 
[13] Wang Q., Wu B., Zhu P., et al. (2020) ECA-Net: Efficient channel attention for deep convolutional neural networks. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 11534-11542.
[14] Yu F., Koltun V. (2015) Multi-scale context aggregation by dilated convolutions. arXiv preprint arXiv:1511.07122.

Downloads: 6025
Visits: 181831

Sponsors, Associates, and Links


All published work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright © 2016 - 2031 Clausius Scientific Press Inc. All Rights Reserved.