Topological Indices (TIs) are quantitative measures derived from molecular geometry and are utilized to predict physicochemical properties. Although more than 3000 TIs have been documented in the published literature, only a limited number of TIs have been effectively employed owing to certain limitations. A significant drawback is the higher degeneracy resulting from the lower discriminative power. TIs utilize simple graphs in which atoms and bonds are conceptualized as the vertices and edges of mathematical graphs. As multiple edges are not supported in these graphs, double and triple bonds are considered single. Consequently, the molecular structure undergoes alterations during the conversion process, which ultimately affects the discriminative power. In this investigation, indices for double-bond incorporation were formulated to preserve structural integrity. This study addresses, demonstrates, and verifies a set of double-bonded indices. The indices demonstrated promising results, exhibiting enhanced discriminative power when validated for polycyclic aromatic hydrocarbons using regression analysis. These indices and their potential applications will significantly contribute to QSAR/QSPR studies.
Topological indices are interdisciplinary research that apply graph theory concepts (mathematics) to the molecular structures of chemical compounds. They have a wide range of applications in pharmaceutical sciences, material studies, nanostructures, and many others through quantitative structure-property/activity relationship (QSAR) [10]. Topological analysis of molecular structures yields numerical descriptors or polynomials that reflect atomic interconnectivity. These mathematical representations, derived from the chemical topographic structure, can be expressed as simple graphs. The underlying principle relies on the binary relationships between atoms within the molecular framework [35]. Simple graphs were constructed by depicting the atoms as vertices, and the bonds in the constitutional formulas were represented as edges [2]. In these simple chemical graphs, the type of bond is neglected, that is, all double and triple bonds are considered single bonds [36]. The resonance phenomenon constitutes the primary rationale for disregarding double and triple bonds [35]. Resonance, a fundamental chemical concept, describes the representation of a single molecular entity as a composite of multiple electronic structures, wherein the atomic arrangement remains constant while the electron distribution varies [55]. Therefore, to mitigate the uncertainty associated with bond positioning during the calculation of TIs, researchers have excluded double and triple bonds from consideration. For instance, the resonance phenomenon in Naphthalene, a compound comprising two benzene rings, exhibits two contributing structures, as in Figure 1.
Thus, the position of the bonds resonates within the chemical compound, as in Figure 2.
Topological indices (TIs) are primarily classified into two categories: those based on distance and those based on degree. The pioneering TI, known as the Wiener index W(G), falls under the distance-based category and was first introduced by Wiener in 1947. This index was formulated using a set of numerical values derived from the distances between vertices (atoms), which were calculated by determining the minimum number of edges (bonds) separating any two atoms [17]. It can be easily understood that the minimum distance between any two atoms remains the same irrespective of the nature (single/double) of the bond. This negligence does not affect the correlation ability of distance-based TIs.
The quantity of edges incident on any atom is known as the vertex degree, which corresponds to the valence of the atom in any molecular structure [3]. A wide range of degree-based indices, such as Sombar and Zagreb, have been developed to predict the physiochemical features of a group of chemical compounds [22], [50], [8]. As per survey, approximately 3000 indices have been formulated https://onlinelibrary.wiley.com/doi/toc/10.1155/1469.si.474120.
However, a few well-established TIs have been widely used because of a few drawbacks. One of these is the degeneracy created by the less discriminative power of TIs. Degeneracy occurs when the same TI value is obtained for two different molecular structures. For instance, the numerals obtained for cyclohexane and benzene were the same when calculated for any TI. This is because during the conversion of a chemical compound to a simple graph, the structure of the molecule is not retained because the double bonds are considered single bonds. The simple graph of both molecules possesses the same structure, as shown in Table 1.
To address this issue, a literature survey was performed on TIs, their new formulations, and the molecular structures used to validate these TIs. The survey ensured the necessity of formulating indices that incorporate double bonds. Further investigation of various other sets of chemical compounds is necessary to enable the practical use of TIs in the field of QSAR/QSPR. In this study, new indices incorporating double bonds were evaluated for polycyclic aromatic compounds (PAHs) using an edge partition technique. These proposed indices use the concept of duplicate graphs, which has never been explored previously in chemical graphs. Therefore, the process of edge partitioning using these simple and duplicate graphs is illustrated using the naphthalene structure. The results are used to envisage the properties specifically, Boiling Point (BP), Flash Point (FP), Enthalpy of Vaporization (EV), Index of Refraction (IR), Polarizability (Pol), Octanol water partition coefficient (Log P), Surface Tension (ST), Molar Mass (MM), Molar Volume (MV), Retention Index (RI), Molecular Weight (MW), X log P3-AA, Heavy Atom Count (HAC), and Complexity. Predictions were validated using linear and quadratic regression analyses. The analysis is portrayed visually, and the prospects of the study are discussed in the final section. The results of this study would make a substantial difference in QSAR/QSPR studies, thereby increasing the practical use of TIs in virtual screening of lead compounds.
The extant literature reveals a paucity of topological indices (TIs) capable of accommodating multiple bonds, with only a select few—namely Sze, X, J, w, and IDW functioning as edge-weighted and vertex-weighted indices [14]. These TIs encapsulate fundamental atomic properties, including atomic number, electronegativity, van der Waals area and volume, and covalent radius [30, 25, 26, 6]. Formulated using vertex-adjacency and atom-connectivity matrices of multigraphs, these indices necessitate an evaluation of their predictive efficacy to broaden their applicability [28]. Notably, a comprehensive analysis of multigraph indices for elucidating the general physicochemical characteristics of chemical compounds remains unexplored. The past decade and a half have witnessed a conspicuous absence of extensive research on chemical structures utilizing multigraph-based TIs [48, 27, 44, 39, 60, 58, 56, 41, 4, 47, 61, 16, 9, 29, 21, 12, 40, 31, 23, 32, 42, 43, 15, 57].
Current research methodologies fail to account for resonant structures, prompting the integration of novel formulations into established degree-based indices to encompass double bonds. These refined indices may prove instrumental in mitigating degeneracy in future studies involving diverse sets of chemical compounds.
In literature, new formulations are given for existing TIs [37, 1] aimed at enhancing their discriminative power [45]. These proposed TIs undergo validation through regression analysis and various chemical compound sets, with a focus on Polycyclic Aromatic Hydrocarbons (PAHs). Notable examples include the theta, Pi and Sadhana indices, eccentricity-based topological indices [50], Neighbourhood eccentricity-based indices [59], two novel temperature-based TIs [19], elliptic Sombar index and connection number-based Zagreb indices are validated using PAHs for QSAR/QSPR models [34]. A multitude of PAHs exists such as Dodeca-benzo-circumcoronene [52], Hexa-peri-hexabenzocoronene, Hexa-cata-hexabenzocoronene [50] with numerous others [51, 59, 34, 38, 5, 20] predicted using TIs.
In PAH assessment, the generalized Randić and sum-connectivity indices demonstrate significantly higher efficiency compared to other established degree-based indices [34]. However, a trade-off exists as efficient TIs often exhibit reduced discriminative power. This allows for a meaningful comparison between newly developed and existing indices for PAHs. Specific PAH compounds are chosen for validation, as detailed in subsequent sections.
E. Sampath Kumar first proposed the concept of a duplicate graph of the parent graph in 1974. It is possible to create a duplicate graph from every finite, undirected graph [46]. Such duplicate graphs are widely employed in graph labelling techniques [53, 13, 24]. Let 𝒢(V, 𝒳) be a simple graph with l number of vertices and a total of m edges. A duplicate graph 𝒢d(A, B) of 𝒢 contains two sets of vertices and two sets of edges satisfying: (i) A = V ∪ V′ and φ = V ∩ V′, where ξ : V → V′ is a bijective relation (ii) B = Ei / ei & ξ(e′i) ∈ B, where i = 1, 2,…, m [24]. An illustration of a duplicate graph of butane is portrayed in Figures 4 to 6.
Any molecular structure was converted into a simple graph to compute TIs. Duplicate graphs are employed along with simple graphs to formulate the double bond TIs in the following steps:(a) The duplicate graph of the considered molecular structure is drawn from the simple graph (b) The weights 1, 0, w are assigned to the edges of the duplicate graph according to their correspondence with the molecular structure, (c) The edge partitioning technique is used to duplicate graphs. The process of calculating the double-bond indices is shown in Figure 7.
These formulated indices incorporate double bonds when different weights are added to different categories of edges in the duplicated graph. As a result, both the actual structure of the chemical compound and the mathematical characteristics of the graph were maintained during the entire process. The formulation of the double-bond degree-based TIs and their symbolization are listed in Table 2.
In the formulation Wi, it takes three different values 1, 0, θi according to their correspondence with the molecular structure, where θi = τi / N, τi = Number of parallel edges in each edge partition, and N is the total edge cardinality. Because these indices have not been explored in detail, an illustration of the molecular structure of naphthalene is presented. The molecular structure of naphthalene and its corresponding simple graphs are shown in Figure 8.
S.No. | Symbol | TIs | Formulation |
---|---|---|---|
1 | Inverse Sum Indeg Index – Double | ||
2 | Min-Max Roder Index – Double | ||
3 | Max-Min Roder Index – Double | ||
4 | Symmetric Division Degree Index – Double | ||
5 | Sum Connectivity Index – Double | ||
6 | Atom Bond Connectivity Index – Double | ||
7 | First Zagreb Index – Double | ||
8 | Second Modified Zagreb Index – Double | ||
9 | Second Zagreb Index – Double | ||
10 | Shigehalli & Kanbur Index – Double | ||
11 | Geometric Arithmetic Index – Double | ||
12 | Randic Index – Double | ||
13 | Sombar Index – Double |
The figure shows that the simple graph of naphthalene contains ten vertices and 11 edges. Hence, the duplicate graph of naphthalene contains 20 (two sets of 10) vertices placed parallel and twenty-two (two sets of 11) edges connecting the vertices. The process of assigning weights to the edges while preserving the properties of the graph is as follows. The weights for the 22 edges in the duplicate graph are assigned according to the nature of the corresponding edges in the two generated graph components, as shown. The edges are presented in three colors to better understand the idea of assigning weights. In the naphthalene graph, edges of weight zero are added between vertices where single bonds are present. The above graph does not possess the properties of a simple connected graph.
However, it contains two graph components (named C1 and C2), which are simply connected graphs. The two sets of vertices ϑi and ϑ′i exist and ϑi = ϑ′i, for all i = 1, 2, … 10. By using the vertex relations and labelling theorems, the vertices in C1 and C2 can be alternatively named. The entire process is portrayed in Figure 9.
Figure 9 ensures the presence of two sets of edges ei and e′i, corresponding to every edge Ei of the simple graph. Thus, two sets of vertices and edges satisfying the conditions of duplicate graphs exist in the graph components. The edges of the duplicate graph correspond with the components’ edges which enable assigning weights.
Remaining edges which correspond to the double bond are assigned weight θi. The duplicate graph of the simple graph of Naphthalene is portrayed in Figure 10.
The vertices ϑ5, ϑ10, ϑ′5, ϑ′10 are of degree 3 according to the sum of edges incident on them. Remaining vertices occur with degree 2. The details of edges and weights are in Table 3.
From the Table 3, it is evident that the total number of edges with non-zero weights is 16. Hence N=16. The edge set of the duplicate graph (naphthalene) possesses the partitions:
The cardinality of each partition is listed in Table 4.
S.No | Cardinality of E | Cardinality of E’ |
---|---|---|
1 | ||
2 | ||
3 | ||
Sum |
Thus, sum of cardinalities of E is 11,and sum of cardinalities of E’ is 5 yielding N=16. Hence,
The physiochemical properties of the PAHs obtained from the recognized databases and the predicted values using double-bond TIs are presented in Tables 6 to 9.
S.No | Chemical/ Property |
BP ( |
EV (kJ/mol) |
FP ( |
IR | Pol |
MV ( |
ST (mN/ m) |
1 | Naphthalene | 221.5 | 43.9 | 78.9 | 1.632 | 17.5 | 123.5 | 40.2 |
2 | Acenaphthylene | 298.9 | 51.7 | 137.2 | 1.732 | 20.3 | 128.2 | 54.7 |
3 | Acenaphthene | 279.0 | 49.7 | 135.3 | 1.692 | 20.5 | 134.9 | 49.2 |
4 | Fluorene | 293.6 | 51.2 | 133.1 | 1.645 | 21.3 | 148.3 | 46.2 |
5 | Phenanthrene | 337.4 | 55.8 | 146.6 | 1.715 | 24.6 | 157.7 | 48.0 |
6 | Fluoranthene | 375.0 | 59.8 | 168.4 | 1.852 | 28.7 | 162.0 | 59.4 |
7 | Benz(a) anthracene | 436.7 | 66.7 | 209.1 | 1.771 | 31.6 | 191.8 | 53.5 |
8 | Benzo(b) fluoranthene | 467.5 | 70.2 | 228.6 | 1.887 | 35.8 | 196.1 | 63.5 |
9 | Indeno (1,2,3-cd)pyrene | 497.1 | 73.6 | 247.2 | 2.009 | 40.0 | 200.0 | 74.2 |
10 | Acephenan -thrylene | 405.7 | 63.2 | 188.6 | 1.796 | 27.4 | 162.3 | 60.4 |
11 | Benzo[j] fluoranthene | 467.5 | 70.2 | 228.6 | 1.887 | 35.8 | 196.1 | 63.5 |
12 | Benzo[ghi] fluoranthene | 406.0 | 63.2 | 189.9 | 1.997 | 32.9 | 166.3 | 72.0 |
13 | Benzo[b] triphenylene | 518.0 | 76.1 | 264.5 | 1.812 | 38.7 | 225.9 | 57.7 |
14 | Dibenz[a,h] acridine | 534.0 | 78.0 | 240.3 | 1.824 | 37.9 | 219.1 | 62.8 |
15 | Coronene | 525.6 | 77.0 | 265.2 | 2.140 | 44.1 | 204.7 | 85.8 |
16 | Dibenzo [b,def] chrysene | 552.3 | 80.2 | 282.0 | 1.913 | 42.9 | 230.2 | 66.5 |
S.No | Chemical/Property | MM | RI | MW | XlogP3-AA | HAC | Complexity | LogP |
---|---|---|---|---|---|---|---|---|
1 | Naphthalene | 128.062 | 200 | 128.17 | 3.3 | 10 | 80.6 | 3.35 |
2 | Acenaphthylene | 152.192 | 244 | 152.19 | 3.7 | 12 | 184 | 3.93 |
3 | Acenaphthene | 154.078 | 247.8 | 154.21 | 3.9 | 12 | 155 | 3.9 |
4 | Fluorene | 166.0782 | 270.8 | 166.22 | 4.2 | 13 | 165 | 4.8 |
5 | Phenanthrene | 178.078 | 300 | 178.23 | 4.5 | 14 | 174 | 4.46 |
6 | Fluoranthene | 202.078 | 300 | 202.26 | 5.2 | 16 | 243 | 5.16 |
7 | Benz(a)anthracene | 228.0939 | 400 | 228.3 | 5.8 | 18 | 294 | 5.76 |
8 | Benzo(b)fluoranthene | 252.094 | 442.1 | 252.3 | 6.4 | 20 | 372 | 5.78 |
9 | Indeno(1,2,3-cd)pyrene | 276.0939 | 495.3 | 276.3 | 7.0 | 22 | 453 | 6.58 |
10 | Acephenanthrylene | 202.078 | 348.14 | 202.25 | 4.9 | 16 | 303 | — |
11 | Benzo[j]fluoranthene | 252.0939 | 443 | 252.3 | 6.4 | 20 | 372 | — |
12 | Benzo[ghi]fluoranthene | 226.078 | 391.6 | 226.3 | 5.4 | 18 | 314 | 7.23 |
13 | Benzo[b]triphenylene | 278.1095 | 495.9 | 278.3 | 6.7 | 22 | 361 | — |
14 | Dibenz[a,h]acridine | 279.1047 | 488.55 | 279.3 | 6.0 | 22 | 405 | 5.73 |
15 | Coronene | 300.352 | 593.5 | 300.4 | 7.2 | 24 | 376 | — |
16 | Dibenzo[b,def]chrysene | 302.1095 | 559.9 | 302.4 | 7.0 | 24 | 436 | — |
S.No | Chemical/TI | ISD | M2RD | M2RD |
SDD |
SC |
M1 |
ABC |
---|---|---|---|---|---|---|---|---|
1 | Naphthalene | 13.74 | 10.51 | 12.196 | 26.322 | 5.323 | 51.2 | 7.926 |
2 | Acenaphthylene | 18.395 | 13.18 | 15.671 | 34.192 | 6.548 | 69.4 | 9.988 |
3 | Acenaphthene | 18.2211 | 13.14 | 15.635 | 34.123 | 6.529 | 69.2632 | 9.962 |
4 | Fluorene | 19.6429 | 14.15 | 16.677 | 36.246 | 7.041 | 73.3333 | 10.69 |
5 | Phenanthrene | 20.613 | 15.2 | 17.653 | 38.109 | 7.552 | 77.3913 | 11.4 |
6 | Fluoranthene | 25.437 | 18.18 | 20.661 | 44.16 | 8.77 | 95.4074 | 13.4 |
7 | Benzo(a)anthracene | 28.5667 | 19.45 | 23.562 | 51.828 | 9.745 | 103.4 | 14.9 |
8 | Benzo(b)fluoranthene | 31.2121 | 22.01 | 24.89 | 53.078 | 10.54 | 116.455 | 16.15 |
9 | Indeno(1,2,3-cd)pyrene | 37.1351 | 25.41 | 29.554 | 63.856 | 12.18 | 139.351 | 18.88 |
10 | Acephenanthrylene | 25.25556 | 17.815 | 21.1109 | 45.9938 | 8.7564 | 95.4074 | 13.439 |
11 | Benzo[j]fluoranthene | 32.37647 | 22.805 | 26.1185 | 56.0294 | 10.979 | 121.412 | 16.853 |
12 | Benzo[ghi]fluoranthene | 30.19355 | 20.799 | 24.1173 | 52.0323 | 9.9761 | 113.419 | 15.436 |
13 | Benzo[b]triphenylene | 33.17143 | 23.414 | 27.5781 | 59.9214 | 11.554 | 123.343 | 17.637 |
14 | Dibenz[a,h]acridine | 34.52432 | 24.075 | 29.0185 | 63.6937 | 11.954 | 129.405 | 18.35 |
15 | Coronene | 41.8286 | 28.08 | 32.983 | 71.571 | 13.4 | 157.429 | 20.92 |
16 | Dibenzo[b,def]chrysene | 39.09024 | 27.082 | 32.0006 | 69.6301 | 13.176 | 147.415 | 20.345 |
S.No | Chemical/TI | M2D |
SZD |
SKD |
GAD |
RD |
SOD |
|
---|---|---|---|---|---|---|---|---|
1 | Naphthalene | 2.333 | 58.33 | 25.6 | 11.18 | 5.08743 | 36.49327 | |
2 | Acenaphthylene | 2.643 | 84.65 | 34.7 | 14.2 | 6.08198 | 49.50032 | |
3 | Acenaphthene | 2.633 | 84.53 | 34.63158 | 14.1 | 6.06264 | 49.40392 | |
4 | Fluorene | 2.889 | 88.52 | 36.66667 | 15.2 | 6.57487 | 52.28794 | |
5 | Phenanthrene | 3.147 | 92.65 | 38.69565 | 16.2 | 7.08717 | 55.14402 | |
6 | Fluoranthene | 3.474 | 119.7 | 47.7037 | 19.2 | 8.0785 | 67.88852 | |
7 | Benzo(a)anthracene | 3.921 | 125.7 | 51.7 | 21.1 | 9.04859 | 73.81968 | |
8 | Benzo(b)fluoranthene | 4.114 | 147.8 | 58.22727 | 23.2 | 9.65521 | 82.84016 | |
9 | Indeno(1,2,3-cd)pyrene | 4.574 | 179.7 | 69.67568 | 27.06 | 11.02706 | 99.24781 | |
10 | Acephenanthrylene | 3.4465 | 118.7 | 47.7037 | 19.13 | 8.061659 | 68.02855 | |
11 | Benzo[j]fluoranthene | 4.2778 | 153.71 | 60.70588 | 24.13 | 10.05911 | 86.41946 | |
12 | Benzo[ghi]fluoranthene | 3.7751 | 145.74 | 56.70968 | 22.13 | 9.055223 | 80.76877 | |
13 | Benzo[b]triphenylene | 4.6381 | 151.54 | 61.67143 | 25.08 | 10.70699 | 87.93075 | |
14 | Dibenz[a,h]acridine | 4.7245 | 159.68 | 64.7027 | 26.05 | 11.0287 | 92.35122 | |
15 | Coronene | 4.885 | 205.9 | 78.71429 | 30 | 12.018 | 112.159 | |
16 | Dibenzo[b,def]chrysene | 5.0556 | 186.76 | 73.70732 | 29.05 | 12.02459 | 105.0815 |
Quantifying the relationship between the properties of any chemical compound and the corresponding TIs is an essential element of QSPR investigation [50]. Regression analysis and modelling is used as a tool in QSPR investigations to accurately predict or estimate the properties of any compound using TIs [49]. The predictions derived from the double-bond TIs (Tables 8 and 9) are correlated with the physicochemical property values (Tables 6 and 7) of the considered PAHs. The linear and quadratic models of regression further confirmed this association, as good correlation values were obtained. The TIs obtained for the molecular structures of PAHs were regarded as independent variables, and the physicochemical attributes were designated as dependent variables [33]. The quadratic and linear regression models are given by the following equation, respectively.
where pc denotes the physio-chemical property,
Linear regression models were created using the set of 13 TIs for each physicochemical property using Eq. (1). For the physiochemical characteristics described above that correlate with the double bond indices, the corresponding intercepts and constants were calculated. F-statistics were used to test linear regression models. The best prediction models were provided by the five recently proposed TIs. The following are the best predictions obtained using this model:
The statistical parameters of the best-fit linear regression model are listed in Table 10. The results obtained for the quadratic regression models are presented graphically in Figure 11 to Figure 25.
S.No | Property | TI | r |
r |
RMSE | F | p-Value |
1 | BP |
M |
0.9905 | 0.98115 | 14.7003 | 728.647 | 1.79E-13 |
2 | EV |
M |
0.992 | 0.98406 | 1.49266 | 864.5014 | 5.51E-14 |
3 | FP |
M |
0.9811 | 0.96256 | 11.77642 | 359.9385 | 2.2E-11 |
4 | IR |
SZ |
0.8985 | 0.8070 | 0.06318 | 58.67078 | 2.26E-06 |
5 | Log P |
M |
0.9678 | 0.9366 | 0.3317 | 206.843 | 8.87E-10 |
6 | PO |
M |
0.9965 | 0.99312 | 0.76674 | 937.955 | 8.82E-15 |
7 | ST |
SZ |
0.9058 | 0.82055 | 5.0692 | 64.01541 | 1.37E-06 |
8 | MV |
M |
0.9744 | 0.9494 | 8.0648 | 262.6252 | 1.82E-10 |
9 | MM |
R |
0.9983 | 0.99652 | 3.4463 | 4017.393 | 1.28E-18 |
10 | MP |
SZ |
0.8462 | 0.716 | 56.7147 | 27.7373 | 0.000266 |
11 | RI |
M |
0.9902 | 0.98049 | 17.4164 | 703.6756 | 2.27E-13 |
12 | MW |
R |
0.9982 | 0.99651 | 3.4568 | 3997.824 | 1.32E-18 |
13 | X Log P |
M |
0.97844 | 0.9573 | 0.2736 | 314.2205 | 5.48E-11 |
14 | HAC |
R |
0.99858 | 0.99717 | 0.2507 | 4941.147 | 3.01E-19 |
15 | Complexity |
SC |
0.9568 | 0.91556 | 34.04815 | 151.8049 | 6.66E-09 |
Quadratic models were also created with 13 double-bond indices for the aforementioned properties. Only three TIs provided the best predictions for physicochemical properties in this model. They are
The best predictive Quadratic regression models obtained from Eq. (2) are:
The best-fit quadratic regression model and parameters (statistical) are listed in Table 11, and their corresponding visualizations are shown in Figures 26 to 28.
S.No | Property | TI | r | r |
RMSE | F | p-Value |
---|---|---|---|---|---|---|---|
1 | BP | R |
0.98936 | 0.97884 | 16.1622 | 300.6883 | 1.31E-11 |
2 | FP | SZ |
0.9653 | 0.93173 | 16.50276 | 88.7102 | 2.65E-08 |
3 | EV | SDD |
0.98423 | 0.968705 | 2.1706 | 201.2 | 1.66E-10 |
Thus, best-fit linear and quadratic regression models were developed for PAHs using double bond indices. Widely used degree-based indices have been extended to double-bond indices. The new indices show stronger, and sometimes marginally weaker, correlations with various properties than the former ones. Hence, to ensure the efficiency of these indices, their results must be compared to those of existing indices. Therefore, an intercorrelation matrix and sensitivity analysis were used. The values of the existing TIs of the PAHs are presented in Tables 12 and 13.
S.No | Chemical/TI | IS | M2R | M2RD | SDD | SC | ABC | M1 |
---|---|---|---|---|---|---|---|---|
1 | Naphthalene | 12.3 | 10.266 | 11.899 | 25.667 | 5.197 | 7.737 | 50 |
2 | Acenaphthylene | 16.7 | 12.898 | 15.348 | 33.5 | 6.408 | 9.778 | 68 |
3 | Acenaphthene | 16.7 | 12.898 | 15.348 | 33.5 | 6.408 | 9.778 | 68 |
4 | Fluorene | 17.7 | 13.899 | 16.348 | 35.5 | 6.908 | 10.485 | 73.333 |
5 | Phenanthrene | 18.7 | 14.899 | 17.348 | 37.5 | 7.408 | 11.192 | 77.391 |
6 | Fluoranthene | 23.2 | 17.899 | 20.348 | 43.5 | 8.633 | 13.399 | 94 |
7 | Benzo(a)anthracene | 25 | 19.165 | 23.247 | 51.167 | 9.605 | 14.687 | 102 |
8 | Benzo(b)fluoranthene | 28.4 | 21.715 | 24.573 | 52.416 | 10.396 | 15.939 | 115 |
9 | Indeno(1,2,3-cd)pyrene | 34 | 25.165 | 29.247 | 63.167 | 12.054 | 18.687 | 138 |
10 | Acephenanthrylene | 23.1 | 17.532 | 20.798 | 45.333 | 8.618 | 13.233 | 94 |
11 | Benzo[j]fluoranthene | 29.6 | 22.532 | 25.798 | 55.333 | 10.844 | 16.647 | 120 |
12 | Benzo[ghi]fluoranthene | 27.6 | 20.532 | 23.798 | 51.333 | 9.844 | 15.233 | 112 |
13 | Benzo[b]triphenylene | 30 | 23.165 | 27.247 | 59.167 | 11.422 | 17.435 | 122 |
14 | Dibenz[a,h]acridine | 31.4 | 23.798 | 23.697 | 63 | 11.816 | 18.142 | 128 |
15 | Coronene | 38.4 | 27.798 | 32.697 | 71 | 12.265 | 20.728 | 156 |
16 | Dibenzo[b,def]chrysene | 35.9 | 26.798 | 31.697 | 69 | 13.041 | 20.142 | 146 |
S.No | Chemical/TI | SZ | M2 | SK | GA | R | SO |
---|---|---|---|---|---|---|---|
1 | Naphthalene | 2.277 | 57 | 25 | 10.919 | 4.966 | 35.635 |
2 | Acenaphthylene | 2.5833 | 83 | 34 | 13.878 | 5.949 | 48.503 |
3 | Acenaphthene | 2.5833 | 83 | 34 | 13.878 | 5.949 | 48.503 |
4 | Fluorene | 2.8333 | 87 | 36 | 14.878 | 6.449 | 51.332 |
5 | Phenanthrene | 3.0833 | 91 | 38 | 15.878 | 6.949 | 54.16 |
6 | Fluoranthene | 3.1466 | 118 | 47 | 18.878 | 7.949 | 66.888 |
7 | Benzo(a)anthracene | 3.8611 | 124 | 51 | 20.797 | 8.915 | 72.825 |
8 | Benzo(b)fluoranthene | 4.0555 | 146 | 57.5 | 22.858 | 9.524 | 81.807 |
9 | Indeno(1,2,3-cd)pyrene | 4.528 | 178 | 69 | 29.797 | 10.915 | 98.28 |
10 | Acephenanthrylene | 3.388 | 117 | 47 | 18.838 | 7.933 | 67.028 |
11 | Benzo[j]fluoranthene | 4.222 | 152 | 60 | 23.838 | 9.932 | 85.413 |
12 | Benzo[ghi]fluoranthene | 3.722 | 144 | 56 | 18.878 | 8.933 | 79.756 |
13 | Benzo[b]triphenylene | 4.583 | 150 | 61 | 24.797 | 10.852 | 86.967 |
14 | Dibenz[a,h]acridine | 4.667 | 158 | 64 | 25.757 | 10.898 | 91.349 |
15 | Coronene | 4.833 | 204 | 78 | 29.757 | 11.898 | 111.15 |
16 | Dibenzo[b,def]chrysene | 5 | 185 | 73 | 28.757 | 11.898 | 104.08 |
An intercorrelation matrix can be used to quantify the behavior of the indices. The correlation coefficient, r, was used to frame the matrix. It is evident from the table that all the r values lie between 1 ≥ r ≥ 0.96. There is a substantial connection between the pairs of TIs whose r ≥ 0.98. Table 15 presents an intercorrelation matrix framed between the suggested new indices and current indices. Consequently, the new indices that have been expanded show remarkable connections with the degree-based topological indices examined. Therefore, the use of current indices to anticipate the physicochemical features of double-bond TIs is equally likely.
The phenomenon in which two or more distinct chemical structures have the same TI value at times is known as TIs’ degeneracy. The predictive capacity of TIs is constrained by reduced discriminative power. The analysis introduced by Konstantinova was used to quantitatively measure the degeneracy sensitivity [61], which is defined as follows:
where,
S.No | TI-Double | Sensitivity Analysis | TI | Sensitivity Analysis | |
---|---|---|---|---|---|
1 | ISD |
1 | IS | 0.875 | |
2 | M2R |
1 | M2R | 0.875 | |
3 | M2RD |
1 | M2RD | 0.875 | |
4 | SDD |
1 | SDD | 0.875 | |
5 | SC |
1 | SC | 0.875 | |
6 | ABC |
1 | ABC | 0.875 | |
7 | M1 |
1 | M1 | 0.75 | |
8 | M2 |
1 | M2 | 0.875 | |
9 | SZ |
1 | SZ | 0.875 | |
10 | SK |
1 | SK | 0.875 | |
11 | GA |
1 | GA | 0.75 | |
12 | R |
1 | R | 0.75 | |
13 | SO |
1 | SO | 0.875 |
The tabular data unequivocally demonstrates that each double-bond
index possesses a 100
This investigation’s results provide a foundation for enhancing the accuracy of physicochemical property predictions in polycyclic aromatic hydrocarbons (PAHs). The naphthalene representation effectively maintains both the structural attributes of the chemical compound and the mathematical characteristics of the simple connected graph. The intercorrelation matrix and sensitivity analysis validate the extended double-bond indices’ predictive capacity and comprehensive discriminative power. Table 16 encapsulates the primary points evaluated within the same table.
TIs | ||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | ||||||||||||||||||||||||||
0.999 | 1 | |||||||||||||||||||||||||
0.998 | 0.998 | 1 | ||||||||||||||||||||||||
0.998 | 0.998 | 1 | 1 | |||||||||||||||||||||||
0.997 | 0.996 | 0.997 | 0.997 | 1 | ||||||||||||||||||||||
0.986 | 0.986 | 0.984 | 0.984 | 0.980 | 1 | |||||||||||||||||||||
0.995 | 0.993 | 0.994 | 0.994 | 0.999 | 0.977 | 1 | ||||||||||||||||||||
0.995 | 0.993 | 0.994 | 0.994 | 0.999 | 0.977 | 1 | 1 | |||||||||||||||||||
0.996 | 0.995 | 0.998 | 0.998 | 0.999 | 0.980 | 0.998 | 0.998 | 1 | ||||||||||||||||||
SC | 0.987 | 0.985 | 0.992 | 0.992 | 0.993 | 0.970 | 0.991 | 0.991 | 0.996 | 1 | ||||||||||||||||
0.998 | 0.997 | 0.999 | 0.999 | 1 | 0.982 | 0.998 | 0.998 | 1 | 0.994 | 1 | ||||||||||||||||
ABC | 0.998 | 0.997 | 0.999 | 0.999 | 0.999 | 0.982 | 0.998 | 0.998 | 1 | 0.994 | 1 | 1 | ||||||||||||||
0.999 | 1 | 0.998 | 0.998 | 0.996 | 0.985 | 0.994 | 0.994 | 0.995 | 0.986 | 0.997 | 0.997 | 1 | ||||||||||||||
0.999 | 1 | 0.998 | 0.998 | 0.996 | 0.985 | 0.994 | 0.994 | 0.995 | 0.986 | 0.997 | 0.997 | 1 | 1 | |||||||||||||
0.981 | 0.977 | 0.987 | 0.987 | 0.990 | 0.962 | 0.989 | 0.989 | 0.993 | 0.995 | 0.990 | 0.990 | 0.977 | 0.978 | 1 | ||||||||||||
0.978 | 0.974 | 0.983 | 0.983 | 0.988 | 0.960 | 0.988 | .988 | 0.991 | 0.992 | 0.988 | 0.987 | 0.975 | 0.975 | 0.997 | 1 | |||||||||||
0.997 | 0.998 | 0.994 | 0.994 | 0.990 | 0.984 | 0.986 | 0.986 | 0.988 | 0.976 | 0.991 | 0.991 | 0.998 | 0.998 | 0.964 | 0.960 | 1 | ||||||||||
SZ | 0.997 | 0.999 | 0.994 | 0.994 | 0.990 | 0.984 | 0.986 | 0.987 | 0.988 | 0.976 | 0.991 | 0.991 | 0.998 | 0.998 | 0.964 | 0.961 | 1 | 1 | ||||||||
0.999 | 1 | 0.998 | 0.998 | 0.996 | 0.985 | 0.994 | 0.994 | 0.995 | 0.986 | 0.997 | 0.997 | 1 | 1 | 0.977 | 0.975 | 0.998 | 0.998 | 1 | ||||||||
SK | 0.999 | 1 | 0.998 | 0.998 | 0.996 | 0.985 | 0.994 | 0.994 | 0.995 | 0.986 | 0.997 | 0.997 | 1.000 | 1 | 0.977 | 0.975 | 0.998 | 0.998 | 1 | 1 | ||||||
0.999 | 0.998 | 1 | 1 | 0.999 | 0.983 | 0.997 | 0.997 | 0.999 | 0.993 | 1.000 | 1 | 0.998 | 0.998 | 0.989 | 0.986 | 0.993 | 0.993 | 0.998 | 0.998 | 1 | ||||||
GA | 0.982 | 0.981 | 0.983 | 0.983 | 0983 | 0.969 | 0.981 | 0.981 | 0.984 | 0.979 | 0.984 | 0.984 | 0.981 | 0.981 | 0.974 | 0.972 | 0.975 | 0.975 | 0.981 | 0.981 | 0.984 | 1 | ||||
0.993 | 0.990 | 0.996 | 0.996 | 0.998 | 0.975 | 0.996 | 0.996 | 0.999 | 0.997 | 0.998 | 0.998 | 0.991 | 0.991 | 0.997 | 0.994 | 0.981 | 0.982 | 0.991 | 0.991 | 0.997 | 0.982 | 1 | ||||
0.991 | 0.988 | 0.995 | 0.995 | 0.996 | 0.975 | 0.995 | 0.995 | 0.998 | 0.996 | 0.997 | 0.997 | 0.989 | 0.989 | 0.998 | 0.995 | 0.979 | 0.979 | 0.989 | 0.989 | 0.996 | 0.981 | 1 | 1 | |||
0.999 | 1 | 0.998 | 0.998 | 0.997 | 0.985 | 0.994 | 0.994 | 0.995 | 0.986 | 0.997 | 0.997 | 1 | 1 | 0.978 | 0.975 | 0.998 | 0.998 | 1 | 1 | 0.998 | 0.981 | 0.991 | 0.989 | 1 | ||
0.999 | 1 | 0.998 | 0.998 | 0.997 | 0.985 | 0.994 | 0.994 | 0.995 | 0.986 | 0.997 | 0.997 | 1 | 1 | 0.978 | 0.975 | 0.998 | 0.998 | 1 | 1 | 0.998 | 0.981 | 0.991 | 0.989 | 1 | 1 |
Topological Indices | Extended Topological Indices |
---|---|
Neglect to detect double bonding | Double bonds are recognized |
Less discriminating power | Higher discriminating power |
Chemical undergoes structural modification | Structure of the compound is not modified |
These promising results lead to many futures works especially,
More than 500,000 chemical entities are synthesized and characterized annually [11]. The prediction of biological activity in molecular structures is paramount for enhancing therapeutic efficacy and mitigating adverse effects, including toxicity. While traditional experimental methods for determining physicochemical properties are resource-intensive and costly, topological indices (TIs) offer a more economical approach. These indices serve as valuable tools in optimizing the therapeutic index of lead compounds, thereby streamlining the drug development process.
The indices examined in this study demonstrate robust predictive capabilities for physicochemical properties, significantly enhancing the virtual screening process of PAHs. Notable accomplishments of this research include improved discriminatory power and the preservation of double-bonded molecular structures. Expanding this concept to diverse molecular structures could substantially broaden the scope and impact of this investigation. The findings presented herein are expected to make significant contributions to QSAR/QSPR studies, ultimately increasing the practical utility of TIs in the virtual screening of lead compounds within pharmaceutical sciences.
The authors do not declare any conflicting interest. The writers alone are in charge of the composition and contents of this work.
1970-2025 CP (Manitoba, Canada) unless otherwise stated.