Abstract

Traditional onefold data-driven methods for fault detection in complex process industrial systems with high-dimensional, linear, nonlinear, Gaussian, and non-Gaussian coexistence often have less than satisfactory monitoring performance because only a single distribution of process variables is considered. To address this problem, a hybrid fault detection model based on PCA-KPCA-ICA-KICA-BI (Bayesian inference) is proposed, taking into account the advantages of principal component analysis (PCA), kernel principal component analysis (KPCA), independent component analysis (ICA), and kernel independent component analysis (KICA) in terms of dimensionality reduction and feature extraction. Foremost, this paper proposed a nonlinear evaluation method and divided the feature variables into Gaussian linear blocks, Gaussian nonlinear blocks, non-Gaussian linear blocks, and non-Gaussian nonlinear blocks by using the Jarque–Bera (JB) test and nonlinear discrimination method. Each division was monitored by the PCA-KPCA-ICA-KICA model, and finally the Bayesian fusion strategy proposed in this study is used to synthesize the detection results for each block. The hybrid model helps in evaluating variable features and bettering detection performance. Ultimately, the superiority of this hybrid model was verified through the Tennessee Eastman (TE) process and the Continuous Stirred Tank Reactor (CSTR) process, and the fault monitoring results showed an average accuracy of 85.91% for this hybrid model.

1. Introduction

People now set a higher threshold for the production quality, system performance, economical efficiency, and increasingly complicated grow process industry systems at the structure and automation level, amid the rapid advance of artificial intelligence and sensor detection technology. Reliability and safety of complex industrial processes now jostle for increasing attention and have to be safeguarded urgently. Industrial process fault detection shines in effectively improving product quality, safe operation, and continuous production. Condition monitoring can be performed by knowledge- and model-based or data-driven approaches [15]. The first two kinds face great challenges in dealing with complex process industrial system as they require a large number of model parameters and prior knowledge. Operation data can be recorded, transmitted, and stored; these features lay the foundation for data-driven fault detection techniques [68].

Data-driven fault detection methods have been intensively studied over the past years. Great progress was seen in the Multivariate Statistical Process Monitoring (MSPM). For example, principal component analysis (PCA), partial least squares (PLS), canonical correlation analysis (CCA), independent component analysis (ICA), and other methods have been widely studied and applied [3, 912]. MSPM dimensionally reduces high-dimensional process variables to low-dimensional space by projection under normal circumstances, thus outputting a process monitoring model. To address the issue of early fault detection, because the memory monitoring charts are sensitive to incipient anomalies in the process mean, the advantages of PCA and multivariate memory monitoring schemes are used for early abnormality detection, and the results showed that the scheme could detect early anomalies in multivariate data [13]. Given complex industrial process systems, varying basic MSPM improvement schemes have been proposed. For example, to reduce computational cost and time while ensuring that important information is not lost or deleted, a dynamic simplification method which enhances the nonlinear process monitoring capability of KPLS is proposed [14]. In addition, an iterative robust kernel principal component analysis is brought forth to improve the robustness of the fault detection model by iteratively optimizing the function index and the kernel method [15]. The KPCA was revised to deal with nonlinear optimization problems. The newly improved approach greatly enhanced the fault detection performance of the original kernel principal component analysis [16]. Facts prove that not all variables of complex industrial processes completely obey the Gaussian distribution. ICA outperforms PLS as it saves the trouble of determining the input/output relationship of the system. Therefore, a new kernel independent component analysis method was driven by the aforementioned non-Gaussian nonlinear distribution. The proposed method effectively captures the nonlinear relationship in the process variables [17]. The above methods have improved the fault detection performance and model stability to a certain extent. However, the above methods all have certain assumptions, such as variables obeying a Gaussian or non-Gaussian distribution and linear or nonlinear correlation between variables. Such assumptions are unrealistic in complex industrial processes. Therefore, complex industrial processes require further research.

Given the characteristic that the process variables do not fully obey the Gaussian distribution, [18] an approach integrating ICA and PCA was proposed, which monitors non-Gaussian distributed variables and Gaussian distributed variables making use of ICA and PCA, respectively. Reference [19] proposed Gaussian and non-Gaussian dual subspace statistical process monitoring. Foremost, the D test identified the normality of the process variables, then divided the process variables into Gaussian and non-Gaussian subspaces, and finally classified the PCA and ICA models for fault detection in Gaussian and non-Gaussian subspaces, respectively. Reference [20] proposed a dynamic non-Gaussian mixture serial modeling method for industrial process monitoring, using the multivariate non-Gaussian evaluation method to divide the industrial process variables into Gaussian variable subspace and non-Gaussian variable subspace. Then, using DICA and DPCA, the two subspaces were monitored, and the monitoring performance was improved to a certain extent. Aiming at the characteristics of incomplete linear correlation of process variables, [21] proposed a new hybrid linear-nonlinear statistical model (SPCA) for nonlinear process monitoring, using PCA to extract the characteristics of linear features to monitor the linear subspace. Using KPCA can extract the characteristics of nonlinear features to monitor the nonlinear subspace, which makes better use of the underlying process to improve the monitoring performance. Reference [22] proposed a parallel PCA-KPCA (P-PCA-KPCA) model and monitoring scheme combining stochastic algorithm (RA) and genetic algorithm (GA). The GA-based optimization method was used to determine whether the parallel principal component analysis model (P-PCA) and parallel KPCA models (P-KPCA) contain variables, and the proposed method can effectively handle nonlinear processes. In addition, some studies have adopted layering, subspace division, and Bayesian decision fusion mechanisms for process monitoring, and the monitoring performance has been improved to a certain extent [2325].

Although the aforementioned research has been successful, the detection of faults in complex industrial processes where multiple features coexist remains a challenge. Complex industrial process variables always present high-dimensional, linear, nonlinear, Gaussian, and non-Gaussian coexistence. The proposed hybrid model provides a new direction for complex industrial process fault detection; however, the lack of in-depth research on nonlinear evaluation methods for process variables has led to the poor performance of existing hybrid models for complex industrial process fault detection with linear, nonlinear, Gaussian, and non-Gaussian feature coexistence. For example, the PCA-ICA model has good performance in industrial process monitoring where linear, Gaussian, and non-Gaussian features coexist, but there are always false positives and false negatives in nonlinear industrial processes. Therefore, it is important for theoretical research and engineering practice to carry out nonlinear evaluation of process variables and to build models for monitoring complex industrial processes.

In this study, a PCA-KPCA-ICA-KICA-BI mixed model is proposed for the coexistence of high-dimensional, linear, nonlinear, Gaussian, and non-Gaussian variables in complex industrial processes. First, a multivariate feature evaluation method is proposed. The original variables are classified using the Jarque–Bera test and the nonlinear discriminant method. This method calculates the kurtosis and skewness statistics of the variables. According to the normality test results, the process variables are divided into Gaussian blocks and non-Gaussian blocks; then we use Pearson correlation coefficient, maximum mutual information coefficient, and nonlinear evaluation function to divide Gaussian blocks and non-Gaussian blocks into Gaussian linear blocks, Gaussian nonlinear blocks, non-Gaussian linear blocks, and non-Gaussian blocks nonlinear blocks; then we apply the PCA-KPCA-ICA-KICA model to monitor the blocking process; and we finally propose a Bayesian inference fusion strategy to comprehensively decide the detection results of each block.

Section 2 briefly reviews PCA, ICA, kernel methods, and Jarque–Bera tests. Section 3 proposes nonlinear discriminant methods and Bayesian inference fusion strategies. Section 4 describes in detail the PCA-KPCA-ICA-KICA-BI method and its construction. Section 5 presents the application of PCA-KPCA-ICA-KICA-BI in the TE process and CSTR process and analyzes the corresponding monitoring performance. Section 6 concludes the paper.

2. Preliminaries

2.1. Principal Component Analysis and Kernel Principal Component Analysis

Principal component analysis constitutes one of the common data dimensionality reduction methods in linear correlation, Gaussian, and high-dimensional process monitoring. It maps industrial process data to a low-dimensional space and then performs the analysis according to the data characteristics of the decomposed main subspace and residual subspace. For a given process dataset , n represents the number of samples and m represents the number of variables. After mapping, X is decomposed intoin which X is the normalized data matrix, is the loading matrix obtained by eigenvalue decomposition of the covariance matrix C, is the score matrix, E is the residual subspace, and k is the number of the principal components obtained from the cumulative percent variance (CPV):in which is the eigenvalues of the covariance matrix C.

When a new sample is obtained, according to the aforementioned decomposition model, the score vector and residual subspace vector e of the new sample are as follows:

In process monitoring, PCA algorithms typically adopt residual squared prediction error (SPE) and T2 statistic to monitor operational status. T2 statistic measures the PCA main subspace, and the SPE statistic considers the residual subspace. The loading matrix contains a lot of variance variation information. Since T2 statistic is modeled based on a loading matrix with large singular values, it is sensitive to low singular value inaccuracies, which can be addressed by the SPE statistic [26, 27]. Similarly, the calculation formulas of the T2 statistics and SPE statistics of the new sample are as follows [26]:in which is the diagonal matrix containing the eigenvalues of the data variance matrix.

T 2 and SPE control limits are calculated from normal operating data. Since the T2 statistic is in line with the F distribution, given a certain degree of confidence (usually 99%), the statistical control limits and are as follows:in which the freedom of is ; is the sum of the residual eigenvalues to the power of 1, 2, and 3; and is the intermediate outcome variable, .

2.2. Independent Component Analysis

Unlike PCA, ICA can monitor the non-Gaussian processes by extracting independent non-Gaussian features from industrial process variables. The given process data vector can be represented as a linear combination of statistically independent d non-Gaussian sources , in which . After normalization and preprocessing, the relationship between the original industrial process data and the independent components is expressed as follows:in which is the mixture matrix and is the residual matrix. The basic problem of ICA is to estimate the mixture matrix and the independent component vector . Because the FastICA algorithm is characterized in fast calculation speed, this study uses the FastICA method [28] to calculate the decomposition matrix and reconstruct as follows:

To eliminate the correlation between the process data and facilitate the calculation, normalization and whitening are required. This paper adopts the commonly used PCA whitening method to whiten the process data. Through singular value decomposition, the obtained covariance matrix and whitened data are as follows:in which represents the expectation, is the covariance matrix, is the whitening matrix, and is the whitened data. Orthogonal matrix B can be calculated by the following formula:

From the above formula, can be expressed as follows:

The relationship between the decomposition matrix and can be expressed by the following formula:

When is determined, process monitoring can be performed by establishing and statistical magnitude . The specific process is as follows:

The kernel density estimation (KDE) is used to calculate the thresholds of the above two statistics [28].

2.3. Kernel Method

KPCA and KICA models map nonlinear process variables into high-dimensional space for processing through kernel methods. This section introduces the kernel method at length since the monitoring schemes of PCA and ICA models have been explicated in Sections 2.1 and 2.2.

Assume the process data is a normalized 0-mean dataset. Through nonlinear mapping , the covariance matrix of the mapped data in the feature space is as follows [28]: can be diagonalized by eigenvalue decomposition as follows:in which represents the eigenvalue and satisfies , and represents the eigenvector. Formulas (13) and (14) suggest the following:

Since the eigenvectors of the covariance matrix are spanned by samples, there must be a coefficient ():

Formulas (15) and (16) suggest the following:

The inner product of the reconstructed variables in the feature space can be described by a kernel function. This study adopts a Gaussian kernel function, which is defined as follows:

The kernel matrix is [28], and (17) can be further simplified as follows:in which and are the eigenvalues and eigenvectors of the kernel matrix . According to the above formula, and can be calculated.

2.4. Jarque–Bera Test

MSPM method usually performs parameter evaluation and feature extraction given that variables obey normal distribution. Although it is reasonable to assume that some industrial processes obey normality, this assumption is often questionable when faced with complex industrial processes. Therefore, the Jarque–Bera (JB) test for normality distribution of complex process data is proposed. In statistics, the Jarque–Bera test examines whether the sample data shows the skewness and kurtosis of a normal distribution and goodness.

Assume to be a dataset of n independent random variables; the sample skewness and kurtosis of are calculated as follows:in which ; if the process variable follows a Gaussian distribution, the skewness and kurtosis of are close to 0 and 3, respectively. The normality of the sample can be tested by the skewness and kurtosis deviation of the expected value. The formula for calculating the JB statistic is as follows:

The threshold of the statistical magnitude is calculated from the significance level and the number of samples . If , this means that the process variable obeys the normality assumption.

3. Nonlinear Discriminant Method and Bayesian Inference Fusion Strategy

3.1. Nonlinear Discriminant Method

It is of great significance to divide the linearly correlated variables and the nonlinearly correlated variables into two subblocks for separate monitoring since the complex industrial process variables may have both linear and nonlinear correlations. This section proposes a nonlinear discriminant method for this problem, which is described at length as follows.

Mutual information (MI), a nonlinear evaluation method based on communication entropy theory, quantitatively describes the nonlinear correlation between two random variables [29, 30]. If there is a strong correlation between these two variables, their MI values will become relatively large. On the contrary, if the two variables are approximately independent, the MI value will become very small. The proposal of mutual information provides great convenience for characterizing the relationship between two variables [30], but the computational complexity of the joint probability in mutual information is relatively large. Therefore, this study selects the maximum mutual information coefficient (MIC) with low computational complexity to indicate the degree of correlation between variables.

Although the maximum mutual information coefficient has a strong ability to represent linear and nonlinear correlations, it cannot clearly indicate whether the variables are linear or nonlinear. The Pearson correlation coefficient can just make up for the lack of MIC. It is obtained by dividing the covariance of the two variables by the product of the standard deviation. The Pearson correlation coefficient value is located in [−1, 1]: a value equal to 1 indicates that the two variables are perfectly linearly correlated, a value equal to −1 indicates that the two variables are perfectly linearly negatively correlated, and a value equal to 0 indicates that there is no linear relationship between the two variables. The calculation process of the MIC and Pearson correlation coefficient is as follows:in which represents the maximum mutual information coefficient of the two variables, and represents the Pearson correlation coefficient between the two variables. is the expected value, and is the variance value.

Although MIC and Pearson correlation coefficients shine in characterizing the correlation between variables, deficiencies are seen in determining whether the variables are linearly or nonlinearly correlated. Therefore, we introduce a nonlinear discriminant coefficient, which combines the strengths of MIC and Pearson’s correlation coefficient to determine whether a variable is nonlinear. Here is the calculation:in which represents the value of nonlinear discrimination, which represents the degree of nonlinear correlation of process variables. can be inferred from (23). If represents a complete nonlinear correlation between variables and represents a completely linear relationship between variables, to illustrate other cases of the value of , a threshold of is set. If means that the variables are nonlinearly correlated, this paper takes , represents the weight coefficient, and in this paper [30].

3.2. Bayesian Inference Fusion Strategy

Since the process variables are divided into multiple subspaces for separate monitoring, how to integrate the monitoring results becomes a problem. Reference [22] proposed a decision logic to determine the process running state. To improve the robustness of the model, this study adopts a Bayesian inference fusion strategy to determine the final monitoring results of the process. Bayesian inference is based on a probabilistic approach, which is similar to that in recent studies [31, 32]. Jarque–Bera and the nonlinear discriminant method divided the process dataset into Gaussian linear subspace, Gaussian nonlinear subspace, non-Gaussian linear subspace, and non-Gaussian nonlinear subspace. The corresponding monitoring statistics of each subspace are established. The failure probability of in a Gaussian linear subspace is defined as follows:in which N and F represent normal operating conditions and abnormal operating conditions, respectively. Given a confidence level , , and . The calculation process of and is as follows:in which is control limit.

Similarly, definition of the failure probability of the remaining three subspaces shares similarities with the calculation method of the Gaussian linear subspace given above. Combination of the statistical magnitude of and of Gaussian linear subspace and Gaussian nonlinear subspace and that of non-Gaussian linear subspace and non-Gaussian nonlinear subspace is as follows:

Following weighted statistic formula is established according to the above formula to facilitate the monitoring:

If the calculation result of the above formula exceeds the confidence level , it indicates possible failures of the industrial process; otherwise, the industrial process is in a normal operating state.

4. Process Monitoring Program

The flowchart of PCA-KPCA-ICA-KICA-BI is shown in Figure 1, and the specific description is as follows.

4.1. Spatial Decomposition

Step 1 normalized the historical dataset X under normal operating conditions.Step 2 calculated the value of each column of the dataset X.Step 3 divided the dataset X into Gaussian subspace and non-Gaussian subspace by JB value and JBα.Step 4 divided the Gaussian subspace and the non-Gaussian subspace into Gaussian linear subspace, Gaussian nonlinear subspace, non-Gaussian linear subspace, and non-Gaussian nonlinear subspace through the nonlinear discrimination method proposed in Section 3.1.

4.2. Fault Detection
4.2.1. Offline Model

Step 1 adopted the historical dataset X under normal operating conditions for training; Step 2 performed space decomposition; Step 3 detected the Gaussian linear subspace, Gaussian nonlinear subspace, non-Gaussian linear subspace, and non-Gaussian nonlinear subspace making use of PCA, KPCA, ICA, and KICA models; Step 4 calculated the confidence limits for each subspace.

4.2.2. Online Monitoring

Step 1 collected the currently monitored industrial process system dataset and normalized it.Step 2 divided into Gaussian linear subspaces, Gaussian nonlinear subspaces, non-Gaussian linear subspaces, and non-Gaussian nonlinear subspaces using spatial decomposition.Step 3 calculated the monitoring statistics of each word space.Step 4 learned the statistics in Step 3 and the confidence limits in Step 4 of the offline model. The final statistic BIC is calculated using the Bayesian inference fusion strategy of (27) in Section 3.2, where indicates a fault, and the opposite indicates that no fault has occurred.

5. Case Study

To verify the effectiveness of the PCA-KPCA-ICA-KICA-BI hybrid model proposed in this study, two cases were adopted to verify the performance of the model.

5.1. Tennessee Eastman (TE) Chemical Process

The Tennessee Eastman (TE) chemical process described by Downs and Vogel consists of five main units: reactor, product condenser, vapor-liquid separator, recycle compressor, and product stripper. The reactor includes four reactive substances (A, C, D, and E) and an inert substance (B). After the reactants are fed into the reactor, products G and H and by-product F are generated through a series of chemical reactions. The product stream is condensed in a condenser and then separated by a vapor-liquid separator. The uncondensed product is sent back to the reactor by the centrifugal compressor for re-reaction, and the condensed product is sent to the stripper for stripping [33]. The flowchart of the TE chemical industry is shown in Figure 2. The TE process contains 41 measured variables (22 continuous process variables, 19 component measured variables) and 12 manipulated variables. The dataset adopted for this study was downloaded from https://web.mit.edu/braatzgroup/links.html. Table 1 presents all the variables included in the TE process, and the dataset contains 52 variables and 960 samples. For the convenience of subsequent experimental analysis, Table 2 lists 21 different faults (faults are introduced after the 161st sample). In the following datasets of faults 11, 16, and 19, the PCA-KPCA-ICA-KICA-BI hybrid model proposed in this study is compared with the traditional single model to verify the effectiveness of the hybrid model.

5.1.1. Subspace Division

Based on the offline modeling in Section 4.2, the Jarque–Bera test divided the variable space into Gaussian subspaces and non-Gaussian subspaces, and the nonlinear discriminant method classified the variable space into Gaussian linear subspaces, Gaussian nonlinear subspaces, non-Gaussian linear subspace, and non-Gaussian nonlinear subspace. The results are shown in Table 3.

5.1.2. TE Process Fault Detection

Fault 11 is caused by random changes to reactor cooling water inlet temperature. PCA, KPCA, ICA, and KICA models are monitored in four subspaces. Figure 3 shows the fault detection results of fault 11 by traditional single PCA, KPCA, and ICA models and the hybrid model proposed in this study. As shown in Figures 3(a)3(c), traditional single PCA, KPCA, and ICA models can detect fault 11; the SPE and T2 fault detection accuracy of the PCA model are 0.755 and 0.529, respectively; the SPE and T2 fault detection accuracy of the KPCA model are 0.205 and 0.327, respectively; and the SPE andI2 fault detection accuracy of the ICA model are 0.229 and 0.372, respectively. As shown in Figure 3(d), the hybrid model proposed in this study can detect more fault samples than the single models, and the fault detection accuracy is 0.924, far exceeding the number of failure detection samples for a traditional single model.

To further verify the efficient performance of the PCA-KPCA-ICA-KICA hybrid model, the unknown fault 16 was selected for testing. The fault detection results shown in Figure 4(d) revealed that the fault detection accuracy of the hybrid model reached a very ideal result, 0.928. Furthermore, tests on a single model were carried out to compare and illustrate the superiority of this hybrid model. Figures 4(a)4(c) show that the SPE and T2 fault detection accuracy of the PCA model were 0.273 and 0.592, respectively; the SPE and T2 fault detection accuracy of the KPCA model were 0.535 and 0.275, respectively; and the SPE and I2 fault detection accuracy of the ICA model were 0.348 and 0.802, respectively.

The detection results of fault 11 and 16 in Figures 3 and 4 show that TE process variables have linear, nonlinear, Gaussian, and non-Gaussian distribution characteristics. The traditional single model only considers one distribution case and can only detect TE process faults caused by linearly distributed process variables, nonlinearly distributed process variables, or non-Gaussian distributed process variables, so the fault detection accuracy is relatively low. The hybrid model proposed in this study fully considers the multi-characteristic distribution of complex industrial process variables and divides the process variables into multiple subspaces using the JB test and the nonlinear evaluation method proposed in this study, and then a model with targeted distribution is used for each subspace for fault detection. Finally, the fault detection results are given using a Bayesian inference fusion strategy, and the results show that the proposed hybrid PCA-KPCA-ICA-KICA-BI model has higher fault detection accuracy for multifeatured complex industrial process fault detection.

The method proposed in this study considers the advantages of a single model and gives full play to the fault detection advantages of the single model using the JB test and nonlinear evaluation methods. Table 4 gives the TE process 21 faults for the traditional PCA model, KPCA model, ICA model, PCA-ICA hybrid model, and PCA-KPCA-ICA-KICA-BI hybrid model proposed in this study. The proposed hybrid model and the existing PCA-ICA hybrid model have higher fault detection accuracy than the existing single PCA, KPCA, and ICA models, but the PCA-ICA hybrid model does not consider the distribution characteristics of the nonlinear process variables, and the decision logic of the hybrid model is insufficient, resulting in lower fault detection. The accuracy of the existing hybrid model is lower than that of the hybrid model proposed in this study. The existing single fault detection model, the PCA-ICA hybrid model, and the hybrid model proposed in this study have a fault detection accuracy close to 1 for faults 1, 2, 6, 7, 12, 14, 17, and 18 of the TE process; however, the fault detection accuracy of faults 3, 9, and 15 is not very satisfactory. This confirms that the magnitude and obviousness of faults can lead to a relatively large difference in fault detection accuracy. Therefore, to address the problem of small fault magnitude not being easily detected, due to the ability of exponentially weighted moving average (EWMA) control scheme to detect small changes, a strategy that combines the exponentially weighted sliding average control scheme with the PCA model is proposed to detect small faults in industrial processes [34]; how to detect initial faults or faults with a low signal-to-noise ratio will be a very interesting problem. Figures 3 and 4 show that a single model is inferior to the hybrid model in fault detection performance despite its ability to detect certain fault samples, indicating its deficiency, that is, only considering one distribution of variables. Simply put, the hybrid model shines in dealing with the increasingly complex industrial processes.

TE process failure simulation experiments demonstrate the considerable advantages of the hybrid model proposed in this study. To further validate the performance of the model, the continuous stirred reactor process is used in Section 5.2.

5.2. CSTR Case

Figure 5 shows the simple flowchart of the CSTR process consisting of 9 variables. Here is the vector form of the dataset:where denotes the cooling water temperature, denotes the inlet temperature, and denote the inlet concentrations, denotes the solvent flow, denotes the cooling water flow, denotes the outlet concentration, denotes the temperature, and denotes the reactant flow [22, 33].

In this study, using the same simulation conditions as Yoon and Macgregor [33], 960 normal operating CSTR data samples were collected through the established CSTR process simulation platform as the training set, and another 160 normal samples and 800 fault samples were collected as the test set. The test set was used to verify the fault detection performance of the hybrid model proposed in this study.

5.2.1. Subspace Division

In view of the offline modeling in Section 4.2, first, Jarque–Bera test divided the CSTR process variable space into Gaussian subspaces and non-Gaussian subspaces, and the nonlinear discriminant method classified the variable space into Gaussian linear subspaces, Gaussian nonlinear subspaces, non-Gaussian linear subspace, and non-Gaussian nonlinear subspace, as shown in Table 5.

5.2.2. CSTR Process Fault Detection

Figure 6 shows the fault detection results of the CSTR process. Figures 6(a)6(d) show that the hybrid model proposed in this paper integrates the distribution characteristics of the process variables and uses the BI method to make a comprehensive decision on the fault detection results with significant advantages, where the fault detection rate of the PCA-KPCA-ICA-KICA-BI hybrid model is 1 and is lower than the FPR of the single model. Since the CSTR process is a typical nonlinear process [22, 33], the PCA model and the ICA model cannot handle the nonlinear distributed characteristic variables well, resulting in lower fault detection rates, with 0.83 for the PCA model and 0.08 for the ICA model. KPCA can map the nonlinear data to a high-dimensional space to become linear, so the KPCA model has good applicability in nonlinear process fault detection, which is verified in Figure 6(b). Comparison of Figures 6(b) and 6(d) shows that the fault detection rate of the KPCA model is comparable to that of the hybrid model proposed in this paper, which indicates that full consideration of the distribution of process variables and the use of a targeted fault detection model is helpful in improving the fault detection accuracy. The fault detection results of the CSTR process show that the hybrid PCA-KPCA-ICA-KICA-BI model can handle complex multi-characteristic process variables well and improve the fault detection accuracy of complex industrial processes.

6. Conclusion

A hybrid fault detection method based on principal component analysis, kernel principal component analysis, independent component analysis, kernel independent component analysis, and Bayesian inference (PCA-KPCA-ICA-KICA-BI) is proposed for complex industrial processes. First, the Jarque–Bera test and the nonlinear discriminant method divided the feature variables into Gaussian linear blocks, Gaussian nonlinear blocks, non-Gaussian linear blocks, and non-Gaussian nonlinear blocks. Each division was monitored by the PCA-KPCA-ICA-KICA-BI model, and the corresponding results were given comprehensively by the Bayesian inference. Finally, the efficient performance is verified by the Tennessee Eastman (TE) and Continuous Stirred Tank Reactor (CSTR) processes. The traditional single model has a fault detection accuracy of 0.68, and the hybrid PCA-ICA model has a detection accuracy of 0.72, which means that more faults are considered normal, while the hybrid model proposed in this paper has a fault detection accuracy of 0.86, which is a significant improvement.

Complex industrial processes are characterized by strong coupling of variables, and any small fault can cause huge safety hazards, which makes it difficult to collect data on actual industrial process faults. Therefore, this study used TE and CSTR processes with complex characteristics, such as high-dimensional, linear, nonlinear, Gaussian, and non-Gaussian features, to verify the fault detection performance of the hybrid model proposed in this paper. The analysis of the fault detection results shows that the PCA-KPCA-ICA-KICA-BI hybrid model can fully consider the variable distribution characteristics of TE and CSTR processes and has a higher fault detection accuracy than the traditional single models and the PCA-ICA hybrid model. The application of the hybrid model proposed in this study to the TE process and the CSTR process shows that the hybrid model has good prospects for securing complex industrial processes. To further optimize and improve the PCA-KPCA-ICA-KICA-BI hybrid model and to enhance the applicability of the model, further research directions could be suggested in the following areas:(1)Initial failures with insignificant changes in complex industrial process data could be detected.(2)Compound failures could be detected.(3)Most current fault detection models require a large number of data samples. However, in the pursuit of a safe and sound industrial process, intriguing is how to realize fault detection based on a small number of samples.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (52065065) and Xinjiang University Doctoral Research Start-Up Fund Project (BS190216).