|
1.INTRODUCTIONWith the development of urbanization, cables are more and more widely used in power systems. Compared with primary equipment such as transformers, switches, and circuit breakers, which are more centralized in the power grid, cables have a large number, long mileage, and wide distribution, and it is not easy to realize comprehensive monitoring and exact perception of their status. Moreover, the operation and maintenance management of cables has not yet formed a system, especially for 10 kV distribution cables, which are mostly randomly placed in cable trenches, which are often full of water and have poor air circulation [1]. Such a harsh operating environment is likely to cause damage to the cables, accelerating the aging of the cable insulation and leading to failures. Therefore, it is necessary to accurately diagnose faults on faulty cables in order to formulate appropriate maintenance and disposal plans, especially for early faults, which need to be paid attention to and handled before they develop into permanent faults, so as to prevent the deterioration of cable insulation from causing serious faults [2]. For normal cables in service, it is necessary to accurately assess the state of their insulation and grasp their current state of health in order to provide a reasonable maintenance program [3]. Accurate diagnosis of cable faults and reliable assessment of cable insulation status, and comprehensive monitoring of cable status are of great significance for the timely detection of potential cable faults [4], slowing down the occurrence of permanent faults, reducing the risk of grid operation, and ensuring the reliability of power supply in the power system. Along with the development of artificial intelligence technology in recent years, some scholars have begun to apply data-driven ideas to the research of cable fault state diagnosis, and have also achieved certain results [5]. Southwest Jiaotong University realized the identification of early cable faults based on the probabilistic neural network [6] and support vector machine [7] respectively; Wang and Lu applied a restricted Boltzmann machine and stacked autoencoder to classify cable faults in distribution networks [8]; Professor Shin Yong-jun of Yonsei University in South Korea took fault waveforms and their reflection waveforms as the research object, and extracted the three features of fault waveforms, namely, the time delay, peak voltage, and time-frequency phase difference, as the inputs of generalized regression neural network as the input of the generalized regression neural network, so as to diagnose a variety of cable fault states [9]. From this type of research, it is not difficult to find that in the field of cable fault identification, artificial intelligence has a greater potential for development [10]. Compared with the traditional electrical threshold method and other state diagnosis methods, the data-driven cable fault state method is more adaptable and robust [11]. However, the current data-driven cable fault state diagnosis method still has shortcomings such as less research on cable weak fault diagnosis [12] and lack of generalization ability of the cable fault state diagnosis model [13]. Therefore, in this paper, the identification of weak cable faults is incorporated into the diagnostic model to achieve more comprehensive cable fault state identification; the data subset approximation method is used to reduce the size of the training set and improve the model training speed. At the same time, a hybrid Gaussian multi-classification algorithm is used to train the tunnel cable state fast identification model, and the correlation between cable states is deeply explored. Finally, an online updating mechanism of the model is proposed to improve the generalization performance of the cable fault state diagnosis model. 2.ACQUISITION OF MODEL TRAINING DATABy analyzing the mechanism of weak and permanent faults of cables, suitable models are selected to simulate weak and permanent faults of cables. Use MATLAB or PSCAD/EMTDC to establish the cable model [14] and build the cable weak fault [15], permanent fault and overcurrent [16], and overvoltage perturbation model [17]. By simulating different cable fault conditions, different grounding resistances and forms of perturbation, the fault current, grounding current, partial discharge and other data under different conditions are obtained, and at the same time, the cable network operation temperature is obtained through the cable modem and the cable trench modeling simulation, and the monitoring quantities of the fault current, grounding current, partial discharge, and the cable network operation temperature are used as the input variable x, and the cable state is used as the output variable m, and the value of m is taken as follows: At the same time, cable factory experimental data, online monitoring data, and power failure detection data can be added to the database. The data under each operating condition and fault condition form the original data set. K samples are uniformly selected from the original dataset as the new dataset by the data subset approximation method. 3.FAST IDENTIFICATION OF TUNNEL CABLE STATUS BASED ON SPARSE ONLINE HYBRID GAUSSIAN MULTI-CLASSIFICATION ALGORITHM3.1Gaussian mixture modelingThe input variable x and the output variable m are combined into a new vector of variables y = [xT mT], then the training set is Y = [y1, y2, …, yK]. A Gaussian mixture model of the joint probability density of the inputs and outputs is built [18]: where M is the number of unknown Gaussian components, j represents the jth Gaussian component, p(y) represents the probability density function of y, and p(j) represents the probability density function of j, which is calculated as follows: where 𝛾𝑗 is the probability weight of the jth Gaussian component, 𝛿𝑗 = {𝜂𝑗, Σ𝑗},𝑓(𝑗|𝛿𝑗) is calculated as follows: where e is the dimension of y, where 𝜂𝑗 is the mean vector of the jth Gaussian component, and Σ𝑗 is the covariance matrix of the jth Gaussian component. p(y/j) represents the conditional probability density function of y with respect to j: 3.2Acquisition of model parametersIt is first necessary to determine the range of M, i.e., to determine the maximum value, Mmax, and the minimum value, Mmin, of M. The absolute increasing log-likelihood criterion (AIL) is chosen to select the optimal Gaussian component [19]. The AIL criterion is defined as where log F and log F denote the maximum log-likelihood function of a Gaussian mixture model with M and M-1 Gaussian components, respectively, and the maximum log-likelihood function of a Gaussian mixture model with M Gaussian components is computed as follows: The maximum log-likelihood function for a Gaussian mixture model with M-1 Gaussian components is similar. where K is the number of samples and is the total parameters in the complete Gaussian mixture model with M components [20]. Let M be equal to Mmax and use the K-means algorithm to initialize the parameters of the Gaussian mixture model with M Gaussian components, denoted as . 3.2.1EM algorithm [21] for obtaining estimated model parametersE step, calculate the posterior probability of the ith training sample for the jth Gaussian component at the kth iteration: M step, the probability weight , mean and covariance of the jth Gaussian component at the (k+1)st iteration are computed: Let , denote the (k+1)th total model parameter. The total parameters obtained from each iteration are combined to obtain the sequence {α}, where each element is α. Maximize the log-likelihood function to find the model parameters : 3.2.2Calculate the corresponding absolute increasing log-likelihood criterion AILRemove the least probable Gaussian component and merge it to the closest Gaussian component. Select the least likely component by finding the component with the smallest mixing probability: Then the component s closest to the rth component is chosen. where the Bs function is the symmetric KL scatter between the rth Gaussian component and the other components, and the symmetric KL scatter is a common measure of the difference between probability density functions: The rth and sth Gaussian components are combined into a single component whose mixed probability, mean, and covariance are updated as Obtain a Gaussian mixture model with M-1 Gaussian components with initial parameters . Let M = M-1 and repeat EM algorithm flow if M ≥ Mmin, otherwise proceed to the next step. Obtain the optimal Mopt by maximizing the AIL. 3.3Computational model outputAssume that the new input data is xnew and the true value of the corresponding output is mnew. divide the mean vector and covariance matrix of each Gaussian component into input and output parts as follows: Estimate the posterior probability of each Gaussian component of the new input data: Compute the conditional distribution of the output of each Gaussian component of the new input data with respect to the input: where and are the mean and covariance parameters of the jth conditional Gaussian distribution, calculated as follows: The conditional distributions of the test set output with respect to the inputs are combined into a single Gaussian distribution. Finally, each component is weighted and combined to obtain the final prediction of the output: Set the threshold function to Tunnel cable equipment fault state diagnosis, online monitoring equipment, and inspection robots collected cable equipment running current, grounding current, partial discharge, cable network temperature, and other monitoring quantities as the input to the cable fault state diagnosis model, to get the model output value ŷ. When ŷ=0, the cable is in the normal state, when ŷ=1, the cable is in a weak fault 1 state, when ŷ=2, the cable is in a weak fault 2 state, when ŷ=n1, the cable is in a permanent fault 1 state, when ŷ=n2, the cable is in a permanent fault 2 state, and so on. The model is updated online when new training points are added, and the process can be regarded as a process of adding new training points one by one. When the number of samples in the training set is increased to K+1, let the added samples be yK+1=(x+1, mK+1), then the mean and covariance parameter of the jth conditional Gaussian distribution are updated as: where is Gaussian white noise with zero variance. The overall flow of the algorithm is shown in Fig. 1. 4.SIMULATION ANALYSIS4.1Evaluation indicatorsThe common metrics of the recognition model are accuracy, precision, recall, F1 comprehensive evaluation metrics, etc., among which accuracy is the most commonly used evaluation metric in fault recognition, which is calculated as shown below: where Nt is the number of samples correctly predicted by the recognition model and Nf is the number of samples incorrectly predicted by the recognition model. The accuracy can only describe the recognition results of the recognition model for all test samples, but cannot reflect the recognition performance of the model in specific categories. In order to comprehensively assess the performance of the recognition model, the F1 comprehensive evaluation index is introduced for comprehensive evaluation. The accuracy is calculated as follows: where NTk is the number of samples correctly predicted by the discriminative model in the kth category sample, and NFk is the number of samples incorrectly predicted by the discriminative model in the kth category sample. The F1 composite evaluation index is calculated as follows: The larger the value of the F1 comprehensive evaluation index, the stronger the comprehensive performance of the identification model. 4.2Experimental setupThere are 8 cable operation states set up, which are coded as shown in Table 1. The data obtained through the cable and fault modeling simulation consists of the original dataset, which has 12000 samples, of which 2000 samples obtained by changing the states other than the eight cable operation states in the table are used as test set 2, 8000 samples are selected from the remaining 1000 samples as the training set, and the remaining 2000 samples are used as test set 1. Table 1.Cable operation status code
SVM, DT, RF, and KNN diagnostic models are used as a comparison of the proposed method. The simulation is carried out on MATLAB and the processor of the simulation device is 5800X with 32G RAM. 4.3Simulation results and analysisFive different methods of tunnel cable equipment fault state diagnosis models were trained using the training set, and the diagnostic results using test set 1 as the test set are shown in Fig. 2 and Table 2. Table 2.Failure evaluation indicator values for each method
In order to evaluate the generalization ability of the tunnel cable equipment fault state diagnosis models trained by each method, test set 2 is used as the test set. The diagnostic results are shown in Fig. 3 and Table 3. Table 3.Failure evaluation indicator values for each method
The simulation results of each method show that the proposed sparse online hybrid Gaussian multi-classification algorithm-based tunnel cable state fast identification method has higher identification accuracy than the SVM, DT, RF, and KNN-based tunnel cable state identification methods, especially in the new operating conditions, the cable fault identification algorithms based on MLRM, SVM, and ANN produce larger measurement errors, but the proposed recognition algorithm has a smaller increase in the cable fault state recognition error, which indicates that it is more capable of adapting to new operating conditions, i.e., the generalization ability of this soft sensor is stronger. The hybrid Gaussian process is the union of multiple Gaussian components, and the number of Gaussian components is the number of different operating conditions of the system, so different from the general global identification, the hybrid Gaussian will automatically identify the number of operating conditions, and judge the current operating conditions during the actual measurement, so as to select the corresponding Gaussian components. Due to the huge difference between different operating conditions, global regression methods often cannot maintain high prediction accuracy over the whole operating domain, and the hybrid Gaussian process divides the whole operating domain and maintains good prediction performance in each operating condition. Since it is generally not possible for the training set to contain all cable states, the online updating capability of the model becomes more important. In this paper, we introduce an online updating mechanism for the model, which can introduce new training data and update the model faster, without the need to start the whole model training process from scratch. Therefore, the proposed method can still maintain a good recognition accuracy under new operating conditions. The model training time of each method is shown in Table 4: Table 4.Model training time for each method (unit: s)
From the above table, it can be seen that the model training time of the proposed method is the shortest. Since the training time of the hybrid Gaussian multi-classification algorithm has a linear relationship with the third power of the size of the training samples, the introduction of the sparse technique in the proposed method in this paper can greatly reduce the training time of the model in the case of a small increase in the error, which provides the possibility of online identification of the cable fault status. 5.SUMMARYIn this paper, a fast identification method of tunnel cable status based on a sparse online hybrid Gaussian multi-classification algorithm is proposed. The main novelty of the method is the use of a sparse online hybrid Gaussian multi-classification algorithm for modeling the fast tunnel cable condition identification model. The results show that for the tunnel cable state diagnosis problem with complex operating conditions and high dimensionality of variables, the method has high accuracy and efficiency, and can quickly obtain the current state of the tunnel cable. 6.ACKNOWLEDGEMENTThis paper was completed with the financial support from the fund of State Grid Beijing Electric Power Company’s scientific and technological project “Research and Application of Key Technology for Intelligent Inspection of Large Urban Cable Tunnels Based on Panoramic Lidar Technology” (520246230002). 7.7.REFERENCESZhou, Chengke, Huajie Yi, and **ang Dong,
“Review of recent research towards power cable life cycle management,”
High voltage, 2
(3), 179
–187
(2017). https://doi.org/10.1049/hve2.v2.3 Google Scholar
Chen, **aolong, et al.,
“BN-RA: a hybrid model for risk analysis of overload-induced early cable fires,”
Applied Sciences, 11
(19), 8922
(2021). https://doi.org/10.3390/app11198922 Google Scholar
Song, Yuxuan, et al.,
“Online multi-parameter sensing and condition assessment technology for power cables: A review,”
Electric Power Systems Research, 210 108140
(2022). https://doi.org/10.1016/j.epsr.2022.108140 Google Scholar
Benesl, Lukas, et al.,
“Cable monitoring using broadband power line communication,”
Sensors, 22
(8), 3019
(2022). https://doi.org/10.3390/s22083019 Google Scholar
Huo, Yinjia, et al.,
“Advanced smart grid monitoring: Intelligent cable diagnostics using neural networks,”
in 2020 IEEE International Symposium on Power Line Communications and its Applications (ISPLC),
(2020). Google Scholar
Chi, Peng, et al.,
“A CNN recognition method for early stage of 10 kV single core cable based on sheath current,”
Electric Power Systems Research, 184 106292
(2020). https://doi.org/10.1016/j.epsr.2020.106292 Google Scholar
Zhu, Wenwei, et al.,
“Anchor Fault Identification Method for High-Voltage DC Submarine Cable Based on VMD- Volterra-SVM,”
Energies, 16
(7), 3053
(2023). https://doi.org/10.3390/en16073053 Google Scholar
Wang, Ying, et al.,
“Cable incipient fault identification using restricted Boltzmann machine and stacked autoencoder,”
IET Generation, Transmission & Distribution, 14
(7), 1242
–1250
(2020). https://doi.org/10.1049/gtd2.v14.7 Google Scholar
Kwon G Y, Lee C K, Shin Y J.,
“Diagnosis of shielded cable faults via regression-based reflectometry[J],”
IEEE Transactions on Industrial Electronics, 66
(3), 2122
–2131
(2018). https://doi.org/10.1109/TIE.41 Google Scholar
Kumar, Haresh, et al.,
“A Review on the Classification of Partial Discharges in Medium-Voltage Cables: Detection, Feature Extraction, Artificial Intelligence-Based Classification, and Optimization Techniques,”
Energies, 17
(5), 1142
(2024). https://doi.org/10.3390/en17051142 Google Scholar
Tang, Zehua, et al.,
“Multi-source data-cooperated neutral low-resistance grounding cable grid faulty segment identification,”
IEEE Transactions on Power Systems, 37
(2), 1413
–1424
(2021). https://doi.org/10.1109/TPWRS.2021.3106691 Google Scholar
Laib, Abderrzak, et al.,
“Enhanced artificial intelligence technique for soft fault localization and identification in complex aircraft microgrids,”
Engineering Applications of Artificial Intelligence, 127 107289
(2024). https://doi.org/10.1016/j.engappai.2023.107289 Google Scholar
Liu, Qing, et al.,
“Health Assessment of HV Cable Based on Data Correlation Analysis and Fuzzy Rules Joint Algorithm,”
in 2022 6th International Conference on Power and Energy Engineering (ICPEE),
(2022). Google Scholar
Smith, Dominic M., Lee S. Cunningham, and Lujia Chen,
“Efficient finite element modelling of helical strand cables utilising periodicity,”
International Journal of Mechanical Sciences, 263 108792
(2024). https://doi.org/10.1016/j.ijmecsci.2023.108792 Google Scholar
Lyu, Kehong, et al.,
“Mechanism analysis and dynamic model construction of intermittent fault of weak-current aviation cable under vibration stress,”
Advances in Mechanical Engineering, 14
(3),
(2022). https://doi.org/10.1177/16878140221074820 Google Scholar
Wang, Xudong, et al.,
“Overcurrent tests and numerical simulations on a 66-kV-class RE123 high-temperature superconducting model cable,”
IEEE transactions on applied superconductivity, 22
(3), 5800904
–5800904
(2011). https://doi.org/10.1109/TASC.2011.2178973 Google Scholar
Ren, Hongtao, and Ying Zhang,
“Analysis on switching overvoltage and suppression method of cable joint in 500 kV cable line,”
Energy Reports, 7 567
–575
(2021). https://doi.org/10.1016/j.egyr.2021.08.004 Google Scholar
Yuan, Ziyun, et al.,
“Knowledge-informed Variational Bayesian Gaussian mixture regression model for predicting mixed oil length,”
Energy, 285 129248
(2023). https://doi.org/10.1016/j.energy.2023.129248 Google Scholar
Saves, Paul, et al.,
“A mixed-categorical correlation kernel for Gaussian process,”
Neurocomputing, 550 126472
(2023). https://doi.org/10.1016/j.neucom.2023.126472 Google Scholar
Ghahroodi, Z. Rezaei, R. Aliakbari Saba, and T. Baghfalaki,
“Gaussian copula–based regression models for the analysis of mixed outcomes: an application on household’s utilization of health services data,”
Journal of Statistical Theory and Applications, 18
(3), 182
–197
(2019). Google Scholar
Wu, Di, and **wen Ma.,
“A two-layer mixture model of Gaussian process functional regressions and its MCMC EM algorithm,”
IEEE Transactions on Neural Networks and Learning Systems, 29
(10), 4894
–4904
(2018). https://doi.org/10.1109/TNNLS.2017.2782711 Google Scholar
|