Contents

Construction and analysis of a business English classroom teaching quality evaluation system based on multimodal neural network model assessment

Rui Chai1
1Department of Fundamental Courses, Tianjin Vocational Institute, Tianjin 300410, Tianjin, China.

Abstract

In this paper, we propose a method for effectively evaluating the quality of business English teaching in colleges and universities. The approach is based on a multimodal neural network model integrated with grey correlation analysis. By employing the optimal data clustering criterion, we identify teaching quality evaluation indices. Subsequently, we establish a teaching quality evaluation index system using a genetic algorithm (GA) optimized Radial Basis Function (RBF) neural network. Grey correlation analysis is then applied to assess the quality of business English teaching by considering the relationship between the correlation degree and the evaluation level. The results indicate a correlation degree exceeding 0.90, signifying excellent teaching quality. The reliability of the selected evaluation indicators, assessed through retesting, surpasses 0.700, validating the evaluation results.

Keywords: Business English, Teaching quality, Evaluation method, Neural network model, Grey correlation analysis, Genetic algorithm, RBF neural network

1. Introduction

As the reform of teaching materials, teaching modes, and teaching methods based on the Internet and multimedia has intensified, there are various teaching materials and methods for English at the university. Research results related to them are abundant, but their disadvantages should not be underestimated [1]. Currently, most schools still adopt traditional methods of section and term examinations to evaluate students’ academic performance. However, due to differences in teaching materials, schedules, student populations, and teacher standards, term examination papers are bound to have great variability, with some being irregular, leading to low reliability and validity of the papers’ quality. Moreover, the proportion of process evaluation methods is unreasonable, and the content and methods of assessment are irregular, resulting in subjective evaluations by teachers with obvious partiality and unfair grading [2]. The lack of an effective teacher teaching quality evaluation system to monitor and guide teachers in terms of content, quality, methods, and assessment further complicates ensuring consistent English teaching quality across geographical, professional, and student population differences [3].

Significant differences exist between high school and university English in terms of teaching aims, requirements, modes, methods, and assessment methods [4]. Despite these differences, there is a prevalence of repeated teaching content, such as the repetition of vocabulary and grammar teaching, which can lead to a loss of student interest and motivation to progress. Freshmen often approach learning with a passive mindset [5]. It is crucial to familiarize them with the university English course requirements, actively adjust their learning strategies, and help them make a smooth transition from high school English. Additionally, listening and speaking skills are generally uneven among university freshmen in ordinary colleges and universities [6]. This issue is primarily influenced by the rate of advancement in secondary and higher education examinations. Many regional secondary schools lack systematic training classes on listening and speaking. A considerable number of students report difficulties understanding college English classes, struggling to follow them, and feeling reluctant to speak. Implementing the course requirements for listening and leading classroom teaching in college English becomes challenging [7]. Furthermore, the English language teaching is not consistently implemented, leading many students to view English study at the university as solely for passing Level 4 and 6 exams. Once these exams are passed, they lose motivation to progress [8]. Presently, university students generally lack clear English learning goals, a positive attitude, and independent learning ability. The phenomenon where university students not only fail to use English fluently after four years of study but also regress in English during university requires sufficient attention [9].

There are various methods for evaluating teaching quality, including qualitative, quantitative, and integrated methods that combine both [10]. Each method has its unique advantages and scope of application. In the detailed application process, it is essential to choose a suitable method based on the assessment objectives and actual assessment needs [11]. The grey correlation analysis method is a qualitative and quantitative evaluation method that targets grey systems with incomplete and clear information. This method has a less complex structure, lower computational requirements, and is not strictly constrained by the number of evaluation samples, making it more applicable [12]. In this context, a study [8] constructed a distance learning quality evaluation model and implemented teaching quality evaluation based on support vector machines, achieving improved evaluation results. Another work [31] proposed an intelligent algorithm-based teaching quality evaluation method, using a genetic algorithm to enhance the BP neural network for teaching quality evaluation, resulting in faster convergence speed and improved evaluation accuracy.

This paper addresses the issue of evaluating the quality of English teaching in universities and proposes a method based on a genetic algorithm to enhance the RBF neural network for teaching quality evaluation. The method utilizes a genetic algorithm to optimize the initial weights and radial basis parameters of the RBF neural network, and the evaluation indexes are selected through principal component analysis.

2. A Method for Evaluating the Quality of Business English Teaching in Universities Based on Grey Correlation Analysis

2.1. Evaluation Index Selection Method Based on Optimal Data Clustering Criteria

The prerequisite for optimal data clustering is that the differences between the indicators of business English teaching quality evaluation in higher education are small [14]. The standard interval for setting the clustering type of the college business English teaching quality evaluation indicators is \(f\), which needs to be at the optimal value according to the optimal clustering requirement. \(f\) is usually set by experts so that the attributes of the clustering results can be maximized [15]. The attributes of the university business English teaching quality evaluation indicators are set to \(X\), the clustering categories of the university business English teaching quality evaluation indicators are set to \(c\), the value of each attribute among the university business English teaching quality evaluation indicators is set to \(x_{n}\), and the clustering deviation \(\delta^{2}(X)\) among each evaluation indicator is as follows: \[\label{GrindEQ__1_} \delta^{2}(X)=\frac{1}{n} \sum _{k=1}^{n}\left(x_{n} -\bar{x}\right)^{2},\tag{1}\] where \(\bar{x}\) is the mean value of the set of evaluation indicator attributes \(X\). In other words, this value is larger if the values of the evaluation indicator attributes in \(\bar{x}=\frac{1}{n} \sum _{k=1}^{n}x_{k}\) are more different from each other, and vice versa, indicating a higher degree of approximation between the values of the evaluation indicator attributes in \(X\). \(n\) is the number of indicators. The error \(\delta^{2}\left(X_{i}, \bar{X}_{i}\right)\) of the evaluation indicator clustering result \(i\) is Equation \ref{GrindEQ__2_}: \[\label{GrindEQ__2_} \delta^{2}\left(X_{i}, \bar{X}_{i}\right)=\frac{1}{n} \sum _{k=1}^{n}\left(x_{n} -\bar{X}_{i}\right)^{2}.\tag{2}\]

The mean of the \(c\) clustering errors is given by Equation \ref{GrindEQ__3_}, \[\label{GrindEQ__3_} \delta^{2}(X, \bar{X}) = \frac{\frac{1}{c} \sum_{i=1}^{n}\delta^{2}\left(X_{i}, \bar{X}_{i}\right)}{\xi^{2}(X)},\tag{3}\] where \(\bar{X}_{i}\) and \(\delta^{2}\left(X_{i}, \bar{X}_{i}\right)\) are the mean value of \(c\) clusters and the mean value of clustering close water, respectively. \(\delta^{2}\left(X_{i}, \bar{X}_{i}\right)\) A smaller value represents a smaller spacing of \(c\) evaluation indicators; a larger spacing of evaluation indicators results in a larger value, and this value can describe the clustering effect of the clustering evaluation indicators.

The mean value of the distance between the \(c\) clusters \(D(X, R)\) is given by Equation \ref{GrindEQ__4_} \[\label{GrindEQ__4_} D(X, R) = \frac{1}{2c} \sum_{i=1}^{c}\sum_{j=1}^{c}\left|r_{i} – r_{j}\right|,\tag{4}\] where the centers of clusters \(i\) and \(j\) are set to \(r_{i}\) and \(r_{j}\) in that order.

The clustering needs to take into account the inter-class closeness and the inter-class distribution [16]. Setting the weights to \(\alpha\) and \(\beta\), the evaluation index for obtaining the optimal clustering is given by Equation \ref{GrindEQ__5_} \[\label{GrindEQ__5_} S(X, R) = \frac{\delta^{2}(X, \bar{X})}{\beta /\alpha} + \frac{\alpha}{D(X, R)},\tag{5}\] where \(\delta^{2}(X, \bar{X})\) describes the high level of clustering closeness; \(\frac{\alpha}{D(X, R)}\) describes the correlation between the state of the cluster distribution and the geometric orientation of the cluster centers, which is used to balance the interference of the \(2\) evaluation indicators with the clustering criterion.

2.2. Quality Evaluation Index System for Business English Teaching in Higher Education Institutions

The process of constructing a quality assessment system for business English teaching and learning in higher education is illustrated in Figure 1.

According to Figure 1, the process of constructing the evaluation index system involves several key steps. Firstly, for the evaluation index system of the quality of business English teaching in colleges and universities, the application of the evaluation index screening method based on the optimal data clustering criterion is crucial to obtain the relevant indices. Secondly, a set of primary and secondary evaluation indicators is defined to comprehensively assess the teaching quality. Finally, employing the Telfer method is essential to evaluate the critical level of indicators. This leads to the removal of non-critical indicators based on their critical level, ensuring the retention of indicators with a high critical level (\ref{GrindEQ__1_}, \ref{GrindEQ__2_}, \ref{GrindEQ__3_}).

3. Genetic Algorithm Optimization of RBF Neural Networks

3.1. RBF Neural Networks

RBF neural networks, rooted in function approximation theory, represent feed-forward neural networks renowned for their robust global optimization capability. Widely applied in signal processing, image processing, and pattern recognition, they have proven effectiveness [17].

3.2. Genetic Algorithms

To determine the optimal initial parameters of the RBF neural network before training, a genetic algorithm was employed. This algorithm ascertains the optimization direction by randomly generating multiple starting points and executing adaptive fast parameter search through selection, crossover, and variation operations [18].

The RBF neural network involves a continuous parameter optimization process, and to enhance parameter optimization accuracy, floating-point coding is employed. This coding method avoids subsequent encoding and decoding in selection, crossover, and mutation processes, improving convergence efficiency. It also overcomes the limitations of traditional binary encoding word length, ensuring high parameter optimization accuracy [19].

For the optimization problem of RBF neural network parameters, the fitness function for the evolutionary operation of the genetic algorithm utilizes the inverse of the mean square error between the desired output and the actual output of all individuals in the population. Mathematically, it is expressed as follows: \[\label{GrindEQ__6_} E=\frac{1}{\sum _{k=1}^{N}\left(T_{k} -Y_{k} \right)^{2}},\tag{6}\] here, \(N\) denotes the number of chromosomes in the population, \(Y\) represents the actual output of the model, and \(T\) corresponds to the desired output of the model.

The classical proportional selection method, known as the roulette wheel method, is employed to compute the individual fitness function of each chromosome and rank them based on their fitness function values. This method establishes that the higher the fitness function value, the greater the probability of selecting the corresponding chromosome. For a chromosome \(b_{i}\) with a fitness function value of \(E_{b_{i}}\), the selection probability is given by \[\label{GrindEQ__7_} P\left(b_{i}\right)=\frac{E_{b_{i}}}{E}.\tag{7}\] Here, \(E\) represents the overall fitness function for all chromosomes in the population. Equation \ref{GrindEQ__7_} elucidates that the chromosome’s fitness determines its likelihood of selection. However, to enhance population diversity and prevent entrapment in a local optimum, it is essential to also select some of the less adapted chromosomes for inheritance in the next generation population.

Crossover is a process where the codes of two chromosomes are exchanged according to certain rules to enhance population diversity, resulting in the evolution of two new chromosomes. In genetic algorithms, most new chromosomes arise from crossover operations, which are crucial for genetic merit search [20]. In the early stages of genetic evolution, where the fitness function of individual chromosomes is small, high-probability crossover operations are needed to enhance the global merit search ability of the genetic algorithm. In the late stages, as crossover operations lead to larger fitness functions for chromosome individuals, operations should be performed with smaller probability to improve the local merit search ability. The crossover probability in genetic evolution is set as follows: \[\label{GrindEQ__8_} P_{c}^{\prime} =\left\{\begin{array}{c} {P_{c\max}, \quad E_{\max} < E_{\text{{mean}}}} \\ P_{c\max} -\frac{P_{c\max} -P_{c\min}}{{\text{{iter}}_{\max}}}\times {\text{{iter}}, \quad E_{\max} \ge E_{\text{{mean}}}} \end{array}\right.\tag{8}\] Here, \(E_{\max}\) represents the maximum fitness function value of two chromosomes to be crossed in the parent population, \(E_{\text{{mean}}}\) represents the average fitness function value of chromosomes in the parent population, \(\text{{iter}}\) represents the current iteration times of genetic evolution, \(\text{{iter}}_{\max}\) represents the maximum iteration times of genetic evolution, and \(P_{c\max}\) is the preset maximum crossover probability value.

The mutation operation involves disturbing and mutating certain gene loci of chromosomes to produce new chromosome individuals in the process of biological evolution. In the evolution of the genetic algorithm, mutation is a crucial step to update chromosome individuals and increase the optimization ability. The adaptive setting of mutation probability is carried out with reference to the crossover probability setting method. The mutation probability setting method is: \[\label{GrindEQ__9_} P_{m}^{\prime} =\left\{\begin{array}{c} {P_{m\max}, \quad E<E_{\text{{mean}}}} \\ P_{m\max} -\frac{P_{m\max} -P_{m\min}}{{\text{{iter}}_{\max}}}\times {\text{{iter}}, \quad E\ge E_{\text{{mean}}}} \end{array}\right.\tag{9}\] Here \(E\) represents the chromosome fitness function value to be mutated in the parent population, and \(\text{{iter}}\) and \(\text{{iter}}_{\max}\) represent the current iteration times and maximum iteration times, respectively. Equation \ref{GrindEQ__9_} indicates that when the fitness function value of a chromosome in the early stage of evolution is lower than the mean value, the variation probability setting is small to retain excellent chromosome individuals. With the continuous progress of genetic evolution, when the fitness function value of a chromosome is higher than the mean value, the mutation probability can be adjusted to increase the local optimization ability of the genetic algorithm.

3.3. Parameter Optimization Model of RBF Neural Network

The genetic algorithm can optimize the dynamic weight of the RBF neural network, including the center and width of the radial basis function. The structure of the RBF neural network optimized based on the genetic algorithm for evaluating English teaching quality is illustrated in Figure 2.

3.4. Teaching Quality Evaluation Model

The evaluation index system serves as the foundation for English teaching quality evaluation, and a scientifically sound index system is a crucial factor in ensuring the effectiveness of teaching quality evaluation. Currently, various principles govern the construction of index systems for evaluating teaching quality, such as those based on teaching content and students’ achievements. In the context of evaluating English teaching quality, this paper establishes a specific evaluation index system, depicted in Figure 3.

The English teaching quality evaluation process proposed in this paper unfolds as follows:

Step 1
Construct the evaluation index system and collect relevant data for teaching quality assessment.
Step 2
Perform data standardization to generate the evaluation index correlation coefficient matrix.
Step 3
Utilize principal component analysis to select English teaching quality evaluation indices, eliminating redundant ones and achieving feature dimensionality reduction.
Step 4
Employ the RBF neural network, optimized by the genetic algorithm, to assess teaching quality and produce evaluation results.

4. Model Testing and Analysis

Using three teachers A, B, and C from a university as examples, this paper’s methodology was applied to evaluate the teaching quality of these three business English teachers in a specific university. The evaluation index system established by this method is illustrated in Figure 2. The evaluation indices for the three business English teachers are presented in Figure 4.

Due to differences in dimension, order of magnitude, and performance mode among the evaluation indices in the teaching quality evaluation index system, a direct comparison is not feasible. Consequently, the experiment uses the secondary index of College Business English teaching quality evaluation, as depicted in Figure 2, as an example for normalization. The normalization results are illustrated in Figure 5. The weights of the 19 secondary indicators are presented in Figure 6.

The results of the correlation analysis for the teaching quality of the three business English teachers in the university are depicted in Figure 7. The highest correlation is 0.95, and the corresponding evaluation level for this maximum correlation is deemed “very good.” Consequently, the evaluation result for the teacher’s teaching quality is considered very good. For Teacher B, the correlations for teaching quality are 0.98, 0.88, 0.65, and 0.45, with the maximum correlation being 0.98. The corresponding evaluation level for this maximum correlation is “very good,” indicating an overall very good evaluation for Teacher B’s teaching quality. Similarly, for Teacher C, the correlations are 0.98, 0.88, 0.65, and 0.45, with the maximum correlation being 0.98, and the corresponding rating level is “very good.” Thus, the teaching quality for Teacher C is also evaluated as very good. Overall, the evaluation results for the quality of business English teaching in all three universities are very good.

3.5. Selection of Evaluation Indicators by Principal Component Analysis

Principal component analysis was employed to analyze the English teaching quality evaluation indicators in Figure 4. The indicators contributing the most to the evaluation of teaching quality were selected. The results of the principal component analysis are illustrated in Figure 8.

Figure 8 illustrates that the cumulative contribution of the first 8 principal component indicators to the evaluation of the quality of English language teaching has reached 94.3%, surpassing the predetermined threshold for cumulative contribution. This suggests that the first 8 indicators can effectively represent the crucial information contained in all the indicators, serving as a reliable basis for future evaluations of English teaching quality.

3.6. Analysis of Teaching Quality Evaluation Results

The genetic algorithm was applied to optimize the RBF neural network model for evaluating the quality of English teaching. As shown in Figure 9, the original RBF neural network converged in approximately 45 iterations. After optimization using the genetic algorithm, convergence occurred in about 30 iterations, indicating that the genetic algorithm significantly enhances the convergence speed of the RBF neural network and reduces the model’s training time. Additionally, the mean squared error of the RBF neural network optimized by the genetic algorithm consistently remains lower than that of the original RBF neural network, implying that the genetic algorithm contributes to improving the prediction accuracy of the model.

Figure 10 demonstrates that the accuracy of the original RBF neural network exceeds 81%, but the genetic algorithm-optimized RBF neural network exhibits superior accuracy. Statistical results indicate that the accuracy of the genetically optimized RBF neural network model surpasses 90% for 36 out of 40 test samples, with 30 of them exceeding 93

Conclusions

This paper presents an English teaching quality evaluation model based on a genetic algorithm-optimized RBF neural network. The model employs principal component analysis to refine evaluation indexes and utilizes a genetic algorithm to optimize the weight parameters of the RBF network, achieving high accuracy in English teaching quality evaluation. The research introduces a novel method for assessing teaching quality in universities. The method, tested for evaluating the quality of business English teaching in three universities, demonstrates significant stability and consistency in evaluation results over time.

References:

References

  1. Juan, D., and Hong Wei, Y., 2021. Particle swarm optimization neural network for research on artificial intelligence college English classroom teaching framework. Journal of Intelligent and Fuzzy Systems, 40(2), pp.3655-3667.[Google Scholor]
  2. Diao, L., and Hu, P. 2021., Deep learning and multimodal target recognition of complex and ambiguous words in automated English learning system. Journal of Intelligent and Fuzzy Systems, 40(4), pp.7147-7158.[Google Scholor]
  3. Zhang, Y., 2021. The development of an evaluation model to assess the effect of online English teaching based on fuzzy mathematics. International Journal of Emerging Technologies in Learning (iJET), 16(12), pp.186-200.[Google Scholor]
  4. Zawacki-Richter, O., Mar\’in, V. I., Bond, M., and Gouverneur, F., 2019. Systematic review of research on artificial intelligence applications in higher education-where are the educators?. International Journal of Educational Technology in Higher Education, 16(1), pp.1-27.[Google Scholor]
  5. Mostafa, M. M., and El-Masry, A. A., 2013. Citizens as consumers: Profiling e-government services’ users in Egypt via data mining techniques. International Journal of Information Management, 33(4), pp.627-641.[Google Scholor]
  6. Zhang, C., and Guo, Y., 2021. RETRACTED ARTICLE: Mountain rainfall estimation and online English teaching evaluation based on RBF neural network. Arabian Journal of Geosciences, 14(17), pp.1-11.[Google Scholor]
  7. Venkatesh, V., 2000. Determinants of perceived ease of use: Integrating control, intrinsic motivation, and emotion into the technology acceptance model. Information systems research, 11(4), pp.342-365.[Google Scholor]
  8. Venkatesh, V., and Davis, F. D., 2000. A theoretical extension of the technology acceptance model: Four longitudinal field studies. Management science, 46(2), pp.186-204.[Google Scholor]
  9. Philippe, S., Souchet, A. D., Lameras, P., Petridis, P., Caporal, J., Coldeboeuf, G., and Duzan, H., 2020. Multimodal teaching, learning and training in virtual reality: a review and case study. Virtual Reality and Intelligent Hardware, 2(5), pp.421-442.[Google Scholor]
  10. Lin, J., Li, J., and Chen, J., 2021. An analysis of English classroom behavior by intelligent image recognition in IoT. International Journal of System Assurance Engineering and Management, pp.1-9.[Google Scholor]
  11. Haixiang, G., Yijing, L., Shang, J., Mingyun, G., Yuanyue, H., and Bing, G., 2017. Learning from class-imbalanced data: Review of methods and applications. Expert systems with applications, 73, pp.220-239.[Google Scholor]
  12. Charitopoulos, A., Rangoussi, M., and Koulouriotis, D., 2020. On the use of soft computing methods in educational data mining and learning analytics research: A review of years 2010-2018. International Journal of Artificial Intelligence in Education, 30(3), pp.371-430.[Google Scholor]
  13. Yuan, Z., (2021). Interactive intelligent teaching and automatic composition scoring system based on linear regression machine learning algorithm. Journal of Intelligent and Fuzzy Systems, 40(2), pp.2069-2081.[Google Scholor]
  14. Moore, G. C., and Benbasat, I. 1991., Development of an instrument to measure the perceptions of adopting an information technology innovation. Information systems research, 2(3), pp.192-222.[Google Scholor]
  15. Poria, S., Cambria, E., Howard, N., Huang, G. B., and Hussain, A., 2016. Fusing audio, visual and textual clues for sentiment analysis from multimodal content. Neurocomputing, 174, pp.50-59.[Google Scholor]
  16. Davis, F. D., Bagozzi, R. P., and Warshaw, P. R., 1989. User acceptance of computer technology: A comparison of two theoretical models. Management science, 35(8), pp.982-1003.[Google Scholor]
  17. Li, L., 2021. Emotion Recognition of Foreign Language Teachers in College English Classroom Teaching. Frontiers in psychology, 12, pp.788552-788552.[Google Scholor]
  18. Poria, S., Cambria, E., Bajpai, R., and Hussain, A., 2017. A review of affective computing: From unimodal analysis to multimodal fusion. Information Fusion, 37, pp.98-125.[Google Scholor]
  19. Winters, P. R., 1960. Forecasting sales by exponentially weighted moving averages. Management science, 6(3), pp.324-342.[Google Scholor]
  20. DeSanctis, G., and Gallupe, R. B., 1987. A foundation for the study of group decision support systems. Management science, 33(5), pp.589-609.[Google Scholor]