Deep convolutional network design based on YOLO framework with efficiency enhancement method in target detection tasks

Tao Wang1, Yuming Xue1, Luoxin Wang1, Tianen Li2, Hongli Dai1
1Institute of New Energy Intelligence Equipment, Tianjin Key Laboratory of Film Electronic & Communication Devices, School of Integrated Circuit Science and Engineering, Tianjin University of Technology, Tianjin, 300384, China
2Institute of Mechanical Engineering, Baoji University of Arts & Science, Baoji, Shaanxi, 721013, China

Abstract

Deep learning-based target detection algorithms outperform traditional methods by eliminating the need for manual feature design and improving accuracy and efficiency. This paper constructs a YOLOv5 target detection model using a deep convolutional neural network. To enhance accuracy, generalization, and detection speed, three data augmentation techniques—mosaic data enhancement, adaptive anchor frame, and adaptive image scaling—are applied. The model is further optimized with an attention mechanism and a modified YOLOv5 framework. A loss function and global average pooling enhance feature mapping for a fully convolutional network. Experimental results show that the improved YOLOv5n model achieves a 2.9979 percentage point increase in MAP, a 31% improvement in FPS, and a training time reduction of 10 minutes, completing 100 rounds in 20 minutes.

Keywords: YOLOv5 algorithm, data enhancement, attention mechanism, deep convolutional network, target detection task