The scientific knowledge graph is an emerging research method in this context. In the research of physical education teaching, the research and sorting out of the research results of physical education teaching in my country from the perspective of scientometrics and information visualization is still slightly insufficient. The similarity between the frontiers of physical education teaching research in China and the United States in the past five years is that both countries have paid more attention to research topics such as physical education teaching methods and physical education courses. This paper proposes a rough set knowledge reduction algorithm based on improved genetic algorithm. The support and importance of conditional attributes to decision attributes are introduced into the information system, which are added to the genetic algorithm as heuristic information, and the concepts of population dissimilarity and individual dissimilarity are proposed to improve the genetic algorithm. The research on school physical education in my country is biased towards problem research, while the research on physical education teaching methods in the United States is biased towards student health; In addition, starting from the national conditions, the hotspots in the field of physical education teaching in my country tend to be “Sports and Health Curriculum Standards”, physical education teachers, physical education ideas, educational theories and college sports, while the hotspots in the field of physical education teaching in the United States tend to be physical activity, children and adolescents , students, women, exercise education, physical education, self-determination theory, and the integration of psychological motivation and physical education. Experimental data analysis my country’s physical education curriculum research should appropriately increase the attention to the details of physical education curriculum, and my country’s physical education teaching practice research should appropriately increase the research on physical education from the perspective of public health.
At present, after more than ten years of development, evolution, and systematization, physical education teaching theory has begun to show the prototype of its discipline system [13,15]. It has developed its own discipline-specific terminology, research paradigms, and theoretical systems, and occupies an important position in the physical education discipline system. The research object of physical education theory is physical education [9,5], which is a subordinate concept of teaching [11]. Like other forms of teaching, physical education is an educational process that includes attributes such as imparting knowledge and skills.
With the rapid development of science and technology, the continuous introduction of new foreign physical education concepts and methods, and the country’s growing demand for educational reform, research in the field of physical education in China has become very active, yielding many important results. According to the analysis of foreign physical education research from 2004 to 2013 by scholars such as Gao Ming, it is evident that most of the current research in physical education globally comes from developed sports countries such as those in North America and Europe, where the research results dominate [11]. Therefore, the author believes that, at present, physical education in western countries, particularly the United States, has developed more successfully than in China, with more abundant and diverse research outcomes [3]. A comparative analysis of the physical education research fields in China and the United States reveals that there are many lessons to be learned from the development of physical education in the West [8,12].
Faced with these abundant research results, questions arise: What are the recent research hotspots in physical education teaching in China and the United States? What changes have been made in the research issues? What research results exist, and where will the research trends go? Addressing these questions will help grasp the direction of research in the field of physical education teaching in China and the United States, which is of great significance for the development of this field in China [12,1].
Currently, domestic research in the field of physical education teaching in China mainly focuses on the analysis of individual basic elements in physical education [10,2], while overall research in the field is relatively insufficient. Most of the research is centered around content analysis, research hotspots [4], status quo analysis [6], and trends [16], but comparative studies with foreign frontiers and hotspots remain scarce. As a result, it takes a lot of time for researchers to understand the existing research results both at home and abroad.
In today’s era of rapid informatization, where vast amounts of digital information are available, how to effectively mine and discover its characteristics and patterns has become one of the major research focuses in the field of scientometrics. The scientific knowledge graph is an emerging research method in this context. Based on the above situation, this paper proposes a visual analysis and comparison of physical education teaching in China and the United States from the perspective of a knowledge graph, hoping to provide some reference for scholars in the field of physical education teaching in the future.
Knowledge reduction is the core problem of rough set theory, and its essence is to eliminate irrelevant or unimportant redundant knowledge while maintaining the classification ability of the knowledge base. Typically, the knowledge reduction method of a knowledge base is not unique, and the complexity of computing knowledge reduction increases exponentially with the size of the decision table. Existing reduction algorithms mainly focus on the kernel of rough sets and use heuristic search methods to construct reductions with the least number of conditional attributes, known as minimum reductions. However, this algorithm is not applicable to all knowledge expression systems, and as the problem size increases, the complexity also increases. Genetic algorithms (GA) offer the advantages of global optimization and implicit parallelism, making them suitable for solving knowledge reduction problems [7,14].
Based on the analysis of the rough set knowledge reduction method, this paper proposes an improved genetic algorithm (IGA+RS) for rough set knowledge reduction. By incorporating the support and importance of conditional attributes to decision attributes as heuristic information, and introducing the concepts of population dissimilarity and individual dissimilarity, the proposed IGA+RS algorithm effectively improves the efficiency of knowledge reduction.
Rough set theory, introduced by Polish scholar Pawlak in 1982, is a mathematical framework for analyzing imprecise and uncertain information. It is widely used to extract decision or classification rules for various problems. Below, we provide an overview of rough set theory and its core concepts.
An information system (or information table) is denoted as
This quadruple is also referred to as a decision system or decision table.
For an information system
For a subset
Given an information system
Let
For
In the information system
The value of
–
–
A larger value of
In the information system
For the special case where
Thus, the importance of an attribute
Additionally, the relative kernel of the condition attribute set can be defined as:
The relative kernel represents the subset of attributes in
To accelerate the convergence of the genetic algorithm, this study proposes an improved genetic algorithm based on rough set theory to identify the minimal reduction in decision-making problems. Specifically, it aims to find the attribute set with the fewest conditional attributes among all relative reductions. The implementation and improvement of the genetic algorithm are carried out in the following aspects.
The key operating parameters in a genetic algorithm include
population size (
Another critical parameter is the termination condition, which determines whether the population has stabilized and no longer exhibits evolutionary progress. Two common criteria for this are:
The difference in the optimal fitness value between successive generations is less than a specified minimum threshold.
The variance of fitness values across the entire population is below a specified minimum threshold.
In the proposed
The fitness function serves as the sole deterministic metric for evaluating individual bit strings in the genetic algorithm. According to the definition of attribute reduction, the fitness of an individual is influenced by two factors:
1. The number of attributes it contains: The number of “1”s in the chromosome represents the number of attributes. Fewer attributes increase the likelihood of selection.
2. The attribute’s ability to distinguish: The greater the number of instances distinguished by the chromosome, the higher its likelihood of being selected.
The fitness function is defined as:
–
–
–
–
–
The goal of
For a given population of size
Population diversity is a critical measure of the evolutionary state
of a genetic algorithm. To optimize the crossover operation, both
population dissimilarity and individual dissimilarity are defined. The
degree of similarity between individuals
The dissimilarity between individuals
For a population of size
As the genetic algorithm evolves, population dissimilarity decreases. Random pairing may lead to invalid inheritance, where offspring are identical to parents. To address this, a dissimilarity-based pairing algorithm is employed:
Sort individuals by fitness values in descending order.
Select the individual with the highest fitness,
Remove
In this paper, the support and importance of condition attributes to decision attributes are introduced into the genetic algorithm as heuristic information. The algorithm flow of the IGA+RS algorithm is shown in Figure 1.
The IGA+RS algorithm first calculates the support and importance of
the condition attribute set
1. Compute the support
2. Calculate the importance
Let the relative core of
If
This paper uses CiteSpace V (version 5.1.R8) to analyze the
co-citation network knowledge graph of 946 documents from 2012 to 2016.
The parameters used include Years Per Slice in Time Slicing set to 1,
Top
A total of 39 clusters were formed in the clustered knowledge graph of the frontiers of physical education research in the United States. Among them, the 8 clusters with the largest areas are #0, #1, #2, #3, #4, #5, #9, and #10, as shown in Table 1. These clusters reflect the major research fronts in the field of physical education teaching research in the United States.
Cluster Number | Size | Outline | Clustering Keywords |
#0 | 39 | 0.941 | Practice-Based Model |
#1 | 34 | 0.902 | Physical Activity Items |
#2 | 32 | 0.765 | Sports Courses |
#3 | 32 | 0.857 | Quantitative Study |
#4 | 29 | 0.812 | Prospective Cross-Cutting Survey |
#5 | 18 | 0.961 | Preservice Teacher Education |
#9 | 6 | 0.973 | Role of Teachers |
#10 | 6 | 0.969 | Self Achievement Goal |
As shown in Figure 2, the cluster names of the 8 clusters reflecting the main research frontiers in the field of physical education teaching in the United States are:
#0: Models-Based Practice,
#1: Physical Activity Program,
#2: Adventure-Physical Education Lesson,
#3: Quantitative Findings,
#4: Prospective Cross-Domain Investigation,
#5: Initial Teacher Education,
#9: Teachers Support, and
#10: Self-Reported Achievement Goal.
The main research front area #0 includes 39 cited literatures
spanning 2006–2015. Eight key documents represented by nodes are listed
in Table 2. These documents cover topics such as physical
education teachers’ professional development, physical education models,
reforms, teaching methods, and cooperative learning in sports. Among the
19 citing documents, 11 exhibit a citation activity of
U | C1 | C2 | C3 | C4 | C5 | C6 | C7 | C8 | C9 | d |
U1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 2 |
U2 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 2 | 21 | 2 |
U3 | 1 | 1 | 2 | 1 | 1 | 2 | 2 | 2 | 2 | 2 |
U4 | 1 | 1 | 2 | 1 | 1 | 2 | 1 | 2 | 1 | 3 |
U5 | 1 | 2 | 2 | 2 | 2 | 1 | 3 | 2 | 2 | 2 |
U6 | 2 | 2 | 2 | 1 | 2 | 1 | 3 | 2 | 3 | 1 |
U7 | 1 | 2 | 2 | 2 | 1 | 2 | 2 | 2 | 2 | 2 |
U8 | 2 | 2 | 2 | 2 | 2 | 2 | 3 | 2 | 2 | 1 |
U9 | 1 | 1 | 2 | 1 | 1 | 2 | 1 | 1 | 2 | 2 |
U10 | 2 | 2 | 2 | 1 | 2 | 1 | 2 | 2 | 2 | 1 |
Take the population size
Algorithm | Minimum Convergence Algebra | Attribute Reduction Set | Average Operation Time |
General Reduction Genetic Algorithm | 15 |
|
1034ms |
IGA+RS Algorithm | 8 |
|
629ms |
The experimental results in Table 3 demonstrate that the IGA+RS algorithm is correct and effective. Compared with the general reduction genetic algorithm, it operates faster, converges better, and more effectively achieves the minimum reduction.
The scientific knowledge graph is an emerging research method that provides new insights into this context. This paper proposes a rough set knowledge reduction algorithm based on an improved genetic algorithm. The support and importance of conditional attributes to decision-making attributes are introduced into the information system and integrated into the genetic algorithm as heuristic information. Additionally, the concepts of population dissimilarity and individual dissimilarity are proposed to enhance the genetic algorithm. The study finds that physical education teaching research in China tends to focus on problem-oriented research, while research in the United States emphasizes student health. By starting from the basic concept of an information system, this paper introduces the support and importance of conditional attributes to decision-making attributes as heuristic information into the genetic algorithm and proposes the IGA+RS algorithm for knowledge reduction. The experimental results demonstrate that the IGA+RS algorithm is an effective knowledge reduction method. It can significantly improve the efficiency of rough set knowledge reduction, offering both speed and effectiveness. In future research, this algorithm could be applied to the reduction of mass production data, which would have certain guiding significance for optimizing production operations.
This work was sponsored in part by Hunan Provincial Department of Education Scientific Research Project (23B1032).
1970-2025 CP (Manitoba, Canada) unless otherwise stated.