Multi-level information extraction based on convolutional neural network and its optimal training method

Xinwen Chen1
1College of Information Engineering, Ezhou Vocational University, Ezhou, Hubei, 436000, China

Abstract

This paper focuses on the characteristics of multilevel information extraction, based on the convolutional neural network model (CNN), introduces the multi-scale feature fusion and multilevel feature fusion strategy to study the multilevel information extraction method, and proposes the full convolutional neural network based on the attention mechanism and residual connection to form the multilevel information extraction model. Aiming at the gradient disappearance and saddle point problem of convolutional neural network, an activation gradient (AG) algorithm is proposed to optimize its training, which is improved to a class of activation gradient convolutional neural network (AG-CNN). The practical application effect of the multilevel information extraction model in this paper is verified by the information extraction work of net-pen culture in river-type reservoirs. Compared with the classical models such as UNet and ResUNet, the intersection and integration ratio (IoU), recall rate, precision rate, and F1 score of this paper’s model reach the highest 80.28%, 91.02%, 87.18%, and 89.03% among all the models, which possesses a stronger extraction capability. And in the multilevel information extraction experiments on Cifar100 and Caltech256 datasets, when the number of batch training data is greater than 100, the accuracy rate and performance of the experimental group basically remain stable.

Keywords: convolutional neural network, multi-scale feature fusion, multilevel feature fusion, activation gradient algorithm, multilevel information extraction