Monitoring of Multimode Processes Based on Subspace

Apr 9, 2015 - In this paper, an effective monitoring method based on subspace ...... and its application in fault detection of multimode processes Che...
0 downloads 0 Views 2MB Size
Article pubs.acs.org/IECR

Monitoring of Multimode Processes Based on Subspace Decomposition Shuai Li,*,†,‡ Xiaofeng Zhou,†,‡ Haibo Shi,†,‡ Zhi Qiao,§,∥ and Zeyu Zheng†,‡ †

Shenyang Institute of Automation, and ‡Key Laboratory of Network Control System, Chinese Academy of Sciences, Shenyang, 110016, PR China § NUS Graduate School for Integrative Sciences and Engineering, National University of Singapore, Singapore, 117456, Republic of Singapore ∥ Department of Physics and Centre for Computational Science and Engineering, National University of Singapore, Singapore, 117542, Republic of Singapore ABSTRACT: This paper presents a new monitoring method for multimode processes based on subspace decomposition. In the proposed method, the influence of quality variables and multimode information are considered in multimode processes modeling, which is crucially important to ensure industrial production safety and quality stabilization. Process data are decomposed into the global common subspace and the local specific subspace and monitoring is performed in each subspace to simplify the model structure. Two experiments: penicillin fermentation processes and practical foods industrial production processes, have been used to demonstrate the excellent performance of the proposed method.

1. INTRODUCTION Process monitoring is significant and attractive for industrial production safety and quality stabilization.1−5 Multimode is one of the most significant features in the modern industrial production processes.6 Some industrial production processes frequently change because of various factors, such as alterations of raw materials and compositions, applications of different manufacturing strategies, etc.7 To ensure industrial production safety and quality stabilization of products, it is vital to develop effective monitoring methods for multimode processes.6 Multivariate statistical process monitoring (MSPM) methods, such as principal component analysis (PCA), partial leastsquares (PLS) etc., may have unsatisfactory performance when directly used for multimode processes.8 To track multimode characteristics, multiple models methods,9 model library-based methods,10 localized Fisher discriminant analysis,11 and Gaussian mixture model approach6,12,13 have been proposed. Multiple models methods9 were developed by building different monitoring models for different modes that needed complete process data of each mode.14 Although it may be effective for each mode, when there are numerous modes, it is clumsy.15 Moreover, the correlations between different modes that possess useful information in modeling and monitoring are neglected. Ge et al.16 proposed an adaptive local model for multimode online monitoring, which introduced the just-intime-learning (JITL) strategy to solve the multimode problem. Rashid et al.17 presented a hidden Markov model (HMM) for multimode processes monitoring, which utilized local ICA models to characterize various modes adaptively. The abovementioned methods mainly consider the local characteristic in each mode, but the global characteristic is also significant. We15 considered the global characteristic and presented a subspace separation method based on manifold learning and kernel principal component analysis (KPCA), which reveals that each mode has part of the same process characteristics as in © 2015 American Chemical Society

multimode processes. Then the relationships of input and output data sets were introduced, and a novel subspace separation method was presented in the context of manifold learning and kernel partial least-squares (KPLS), which reveals that each mode can be divided into two subspaces.14 It should be noted that these methods14,15 are only appropriate for multimode processes with equal mode length because each specific subspace is separated by subtracting an identical common subspace from the global space in each mode, which cannot be suitable for practical industrial production processes. Moreover, multimode information is not considered in extracting common subspace, which may lead to the loss of partial valuable information in multimode modeling. In this paper, an effective monitoring method based on subspace decomposition is developed for multimode processes. To fulfill this purpose, each mode of the proposed method is decomposed into the global common subspace and the local specific subspace, which does not require each mode to have equal mode length. Multimode information and the influence of quality variables are considered. Moreover, global and local monitoring strategy and partial derivative contribution analysis are developed. The main contributions of this paper lie in the following aspects: (1) it provides a potential solution to the issue of nonlinear multimode processes monitoring in an efficient data-driven manner; (2) local and global characteristics are used for multimode processes modeling; (3) it is unnecessary that each mode has equal mode length for subspace decomposition; (4) the correlations between different modes, multimode information, and the influence of quality Received: Revised: Accepted: Published: 3855

December 2, 2014 March 17, 2015 April 1, 2015 April 9, 2015 DOI: 10.1021/ie504730x Ind. Eng. Chem. Res. 2015, 54, 3855−3864

Article

Industrial & Engineering Chemistry Research

ships to decompose the process data into the global common subspace and the local specific subspace. In general, the structure information on data can be considered from the following sides: variable space and sample space.26 The global and local structures are crucial for process monitoring since the global structure defines the outer shape of the process data set and the local structure presents inner organization.1 It can decompose global and local characteristics and provide more meaningful interpretation for multimode processes. The global common subspace is defined as the part with global structure information on all modes, revealing neighborhood relationships of overall input data set, multimode information, and influences of output quality variables. The local specific subspace is defined as the part with local structure information on each mode. In conclusion, multiscale methods have the capabilities for noise-reduction and multiscale modeling and monitoring. However, the proposed method has the capabilities for multimode modeling and monitoring. 2.2. Subspace Separation. To overcome the disadvantages of multiple models methods, we considered the global characteristic and presented a subspace separation method.14 The common subspace is obtained according to the neighborhood relationships of input and output data sets. After the common subspace Φ(Xg) is obtained, the specific subspace Φ(Xs,m) is calculated:

variables are considered. Related analyses and discussions are stated to better illustrate the proposed approach. The paper is organized as follows. In section 2, we briefly describe multimode, subspace, and the subspace separation method. In section 3, the proposed method is presented in details. Then, two experiments including penicillin fermentation processes and practical foods industrial production processes are presented in section 4. Finally, some conclusions are presented in section 5.

2. PRELIMINARIES 2.1. Multimode and Subspace. The operating conditions, such as temperature, pressure, etc., have to be adjusted to meet the production specifications, which causes various operation modes.18 In multimode processes, different modes have their respective specific, similar characteristics and duration time.14,15,19 A mode is defined as the long duration process with the same statistical characteristics, which is one part of the whole multimode processes.7 A simple tutorial example of mode is stated. For data sets Xm×n = {Xm1 × b1,Xm2 × b2,···,XmB × bB}, Xm1 × b1 denotes mode 1, where n = (ΣiB= 1bi), m is the number of variables, n is the number of samples, B denotes the number of modes, bi is the number of samples in each mode. However, mode is different from scale. A scale is defined as a deterministic or stochastic component of data. A simple tutorial example of scale is presented. For data sets Xm×n = {Xm1 × n···XmL × n}, Xm1 × n denotes scale 1, where L is the number of scales. Multiscale process monitoring usually utilizes wavelet transform to decompose the original process measurements into their multiscale components according to time and frequency characteristics. It enables a separation of deterministic and stochastic features and offers a more meaningful physical interpretation of process phenomena.20−25 In this paper, as shown in Figure 1, the main idea of subspace decomposition is decomposing the processes into multiple subspaces for monitoring. Compared with multiscale method, the proposed multimode monitoring utilizes local neighborhood relationships, multimode information and quality relation-

Φ(X s , m) = Φ(X m) − Φ(Xg ),

(m = 1, ..., B)

(1)

where m denotes different modes, Φ(Xg) is the identical common subspace for each mode, Φ(Xm) is the global space of mode m, and B denotes the number of modes. The problem of weight matrix stems from its inability to take similar data in each mode separately into account, as its only focus is on preserving local neighborhood relationships of all multimode data. Hence, it is necessary to construct a more efficient weight matrix by utilizing multimode information to improve the tightness of similar data in the same mode and push data in different modes far apart. Moreover, the subspace separation method14 is only appropriate for multimode processes with equal mode length because each specific subspace is obtained by subtracting an identical common subspace from the global space in each mode. In this paper, our subspace decomposition method considering multimode information is presented, which does not need each mode to have equal mode length and can be suitable for practical industrial production processes.

3. THE PROPOSED METHOD The complex nonlinear multimode problem turns into a combination of a few simple subspace problems by subspace decomposition. The proposed method consists of two parts: (1) in the offline modeling phase, a subspace decomposition model is trained by normal samples, and monitoring statistics and control limits are obtained in each subspace; (2) in the online monitoring phase, the statistics of each sample are computed in each subspace, while the fault and faulty variable are recognized. In this paper, it is considered that each mode can be decomposed into two subspaces. The influence of quality variables and multimode information are considered, which is crucial to ensure quality stabilization. The global structure information on all modes is preserved in the global common subspace and the local structure information on each mode is

Figure 1. Proposed approach. 3856

DOI: 10.1021/ie504730x Ind. Eng. Chem. Res. 2015, 54, 3855−3864

Article

Industrial & Engineering Chemistry Research preserved in the local specific subspace. A list including the indices and the symbols used in the proposed method is described in Table 1.

n

min e(W 0) =

2

n

∑ || Φ(x i) − ∑ W ij0Φ(x j)|| i=1

j=1

n

notation

description

X Y n m q B bi Φ Φ(X) k Φ(Xgc) Φ(Xls) P K W0 Winij Wbetween ij L tr() δ, β In Q H U λi xnew Φgc,new ΛGMS T2gc, SPEls T2gc,α, Qils,α

input data set output quality variables number of samples number of input variables number of output quality variables number of modes number of samples in each mode nonlinear mapping nonlinear mapping of input data set number of nearest neighbors global common subspace of Φ(X) local specific subspace of Φ(X) projection matrix kernel function weight matrix without multimode information multimode weights in the same mode multimode weights between different mode mode information trace operator trade-off parameters identity matrix score matrix of output quality variables load matrix of output quality variables combination coefficients matrix lagrange coefficients new sample global common subspace of xnew covariance matrix of Φ(Xg) monitoring statistics control limits

(3)

where the weight Wij0 only reflects the local distance relationships of Φ(xj) and Φ(xi), which neglects multimode information. In this paper, the geometrical interpretations based on the marginal perspective27 are applied to solve the problem. The weights Winij in the same mode and Wbetween between ij different modes in the proposed approach are defined as ⎧ W in = 1, if j ∈ J , L(x ) = L(x ) i j i ⎪ ij ⎪ between ⎨ W ij = −1, if j ∈ Ji , L(x i) ≠ L(x j) ⎪ ⎪ W in = W between = 0, else ij ⎩ ij

(4)

where L(xi) denotes mode information. Wijin and Wijbetween consider the tightness of similar data in the same mode and the separability in different modes, respectively. In the proposed subspace decomposition method, local neighborhood relationships with multimode information are preserved in the global common subspace. In eq 3, when the weight Wij0 increases, the distance must be reduced in order to minimize the summation. Therefore, the tightness of similar data in the same mode can be improved by increasing the weights, and data in different modes can be pushed far apart by decreasing the weights. Thus, the loss function with multimode information can be presented as follows: min e(Φ(X gc)) n

=∑ || Φ(x gc, i) − i=1

n

∑ (δ W ij0 + (1 − δ) j=1

× (W ijin + W ijbetween))Φ(x gc, j)||2 = tr(Φ(X gc)M mΦT (X gc)) s.t. ΦT (X gc)Φ(X gc) = In

(5)

where tr() is trace operator, δ ∈ [0,1] is a trade-off parameter for balancing the two terms in the numerator of eq 5, In denotes an identity matrix, and Mm is symmetric and positive semidefinite: M m = MTm = (In − (δ W ij0 + (1 − δ)(W ijin + W ijbetween)))T · (In − (δ W ij0 + (1 − δ)(W ijin + W ijbetween)))

|| Φ(x i) − Φ(x j)|| (Φ(x i) − Φ(x j))T (Φ(x i) − Φ(x j)) = K(x i , x i) − 2K(x i , x j) + K(x j, x j)

(W ij0 = 0, j ∉ Ji , i = 1, ..., n)

j=1

Input data set X = {x1,x2,···,xn} ∈ Rm and quality variables Y = {y1,y2,···,yn} ∈ Rq are obtained, where n = (ΣiB= 1bi), m and q are the number of input and quality variables, respectively, n is the number of samples, B denotes the number of modes, bi is the number of samples in each mode. To improve the ability to handle the nonlinear characteristic data, X is mapped to highdimensional feature space Φ(X) = [Φ(x1),Φ(x2),···,Φ(xn)]. Subsequently, k nearest neighbors of each data point in highdimensional feature space are constructed on the basis of distance in high-dimensional feature space:

=

∑ W ij0 = 1

s.t.

Table 1. Descriptions of Indexes and Symbols Used in the Proposed Method

(6)

Quality stabilization of products in a practical industrial production process is one of the one of the greatest problems of concern, especially in multimode processes with alterations of raw materials and compositions, different manufacturing strategies, etc.7 Hence, to ensure the quality stabilization, the relationships of the global common subspace and output quality variables are introduced. Q = [q1,...,qd] ∈ Rn denotes a score matrix of output quality variables YT, Q = YTH, H = [h1,...,hd] ∈ Rq is the load matrix of YT, ||hi|| = 1. Thus, the

(2)

where K(xi,xj) = ΦT(xi)Φ(xj). For each sample Φ(xi) in highdimensional feature space, Ji denotes the neighbors of Φ(xi) and the weight matrix without multimode information W0 can be calculated by solving the optimization problem with constraints: 3857

DOI: 10.1021/ie504730x Ind. Eng. Chem. Res. 2015, 54, 3855−3864

Article

Industrial & Engineering Chemistry Research global common subspace Φ(Xgc) [Φ(xgc,1);Φ(xgc,2);···;Φ(xgc,d)] ∈ Rn can be calculated as

where Φ(Xls) = [Φ(Xls,1),...,Φ(Xls,B)] ∈ Rbi is the local specific subspace of each mode and bi is the number of samples in each mode. In the proposed method, mode information, two statistics and B + 1 control limits can be calculated for monitoring in multimode processes.30 When a new sample xnew is obtained, its mode information should be first recognized on the basis of weight matrix W = δW0ij + (1 − δ)(Winij + Wbetween ). W can be ij denoted as

=

min βtr(Φ(x gc, j)M mΦT (x gc, j)) − (1 − β)Φ(x gc, j)YT hi ⎧ ΦT (x )Φ(x ) = I ⎪ gc, j gc, j n s.t. ⎨ T ⎪h h = 1 ⎩ i i (7)

⎡ W11 W12 b1× b2 ⎢ b1× b1 ⎢ 21 W b × b W b222 × b2 W=⎢ 2 1 ⎢ ⋮ ⋮ ⎢ B 1 B 2 ⎢W ⎣ bB × b1 WbB × b2

where β ∈ [0,1] is also a trade-off parameter, j = 1,...,d. It is considered that the problem of only calculating the global common subspace is unable to learn out-of-sample in monitoring. Then a projection matrix is calculated to tackle the out-of-sample learning problem. P = [p1,...,pd] denotes the projection matrix. Thus, each data point in high-dimensional feature space is mapped to the global common subspace, that is Φ(X gc) = PT Φ(X)

min e(L(x new) = i)

min βtr(pTi Φ(X)M mΦT (X)pi) + (1 − β)pTi Φ(X)YT hi

= ||[|| Φ(x new) − Φ(x Bi)||2 , ...,

⎧ pT p = 1 ⎪ i i s.t.⎨ ⎪ hT h = 1 ⎩ i i

|| Φ(x new) − Φ(x Bi + bi − 1)||2 ]W biii × bi ||

(i = 1, ..., B) (15)

where (Bi = Σii−1 = 1bi + 1(B1 = 1,i = 2,···,B)) is the first number of each mode in weight matrix W and bi is the number of samples in each mode. After determining mode information on a new sample, the global common subspace Φgc,new of xnew can be computed as

(9)

However, Φ(X) in high-dimensional feature space is unknown on the basis of kernel function theory.28 To solve the out-ofsample learning problem in monitoring, it is assumed that there exists combination coefficients matrix U = [u1,...,ud] ∈ Rn, such that (i = 1, ..., d)

Φgc,new = PT Φ(x new) = UTΦT (X)Φ(x new)

(10)

= UTKT(x new , X)

The optimization problem with constraints of eq 9 can be expressed as min

(14)

The mode information L(xnew) of xnew can be determined by obtaining its minimum reconstructed weight error in corresponding mode:

(8)

The projection matrix P can be calculated by the following optimization problem with constraints:

pi = Φ(X)u i

⋯ W1b1B× bB ⎤ ⎥ 2B ⎥ ⋯ W b2 × bB ⎥ ⋱ ⋮ ⎥⎥ ⎥ ⋯ WbBB B × bB ⎦

(16)

Then two statistic indices T2gc (Hotelling’s T2) of the global common subspace and SPEls (squared prediction error, SPE) of the local specific subspace are calculated as

βtr(uTi Φ(X)T Φ(X)M mΦT (X)Φ(X)u i)

+(1 − β)uTi Φ(X)T Φ(X)YT hi

Tgc2 = ΦTgc,newΛ−GC1Φgc,new

=βtr(uTi KM mKu i) + (1 − β)uTi KYT hi

= K(x new , X)UΛ−GC1UTKT(x new , X)

T ⎧ ⎪ ui ui = 1 s.t.⎨ ⎪ T ⎩ hi hi = 1

2 SPE ls = || Φ̃ (x new)||

(11)

2

= ||(ΦT (x new) − ΦTgc,newPT )||

The Lagrange multiplier method is used to solve the above problem, and the following eq 12 can be obtained: 2

T

(β KM mK + (1 − β) /4λ 2 KY YK)u i = λ1u i

= K(x new , x new) − 2K(x new , X)UUTKT(x new , X)

(12)

+ K(x new , X)UUTKUUTKT(x new , X)

In conclusion, the combination coefficients vectors ui (i = 1,...,d) are the eigenvectors corresponding to the smallest d eigenvalues of (βKMmK + (1 − β)2KYTYK/4λ2).29 The global common subspace Φ(Xgc) and the local specific subspace Φ(Xls) can be decomposed as below:

(18)

where ΛGC is the covariance matrix of the global common subspace and it can be expressed as ΛGC =

ΦT (X) = ΦT (X gc)PT + ΦT (X ls) = ΦT (X gc)PT + [ΦT (X ls,1); ...; ΦT (X ls, B)]

(17)

=

(13) 3858

Φ(X gc)ΦT (X gc) n−1

=

PT Φ(X)ΦT (X)P n−1

UTΦT (X)Φ(X)ΦT (X)Φ(X)U UTK2U = n−1 n−1

(19)

DOI: 10.1021/ie504730x Ind. Eng. Chem. Res. 2015, 54, 3855−3864

Article

Industrial & Engineering Chemistry Research K(x new , X) = ΦT (x new)Φ(X) = [K(x new , x1), K(x new , x2), ..., K(x new , x n)] (20)

Finally, B+1 control limits of each subspace can be calculated by a F-distribution with confidence coefficient α31 and a weighted Chi-squared distribution,32 respectively. 2 Tgc, α =

d(n2 − 1) Fd , n − d , α n(n − d)

Q ls,i α = giχh2, α i

(i = 1, ..., B)

(21) (22)

μ2i /Si,

where gi = Si/2 μi, hi = 2 μi and Si are the estimated mean and variance of the SPEls in each local specific subspace from the normal multimode data, respectively. Contribution analysis is widely used for fault identification, which indicates that the variable with the largest contribution to the monitoring indices is the fault variable.33 For nonlinear methods, variable contribution can be characterized by the partial derivative and larger partial derivative means larger contribution.33−35 To recognize the fault variable, contribution analysis of multimode processes in this paper can be obtained by partial derivatives. The partial derivative of the kernel function for the lth variable vl is calculated as35

Figure 2. Flowchart of the proposed method.

∂K(x new , X) 2 = − [(x new, l − x1, l)2 K(x new , x1), ..., vl c

Kl =

(x new, l − x n , l)2 K(x new , x n)]

(2) (3)

(23)

where vl is the lth variable, xi,l denotes value of the lth variable in xi. For a new sample, contributions of the lth variable to the global common subspace and the local specific subspace are calculated as T2

cont l gc = +

∂Tgc2 ∂vl

(5) (6)

1 = |K lUΛ−GMS UTKT(x new , X)

1 K(x new , X)UΛ−GMS UTKTl |

ls contSPE = l

(4)

(7) (24)

(8)

∂SPE ls = |K lUUTKUUTKT(x new , X) ∂vl

− 2K(x new , X)UUTKTl |

standardized input data set X and quality variables Y are obtained. k nearest neighbors of each data point in highdimensional feature space are first constructed according to eq 2. The weight matrix with multimode information is calculated according to eqs 3 and 4. The global common subspace and the local specific subspace are obtained according to eqs 7−13. Two statistics and B+1 control limits are calculated for monitoring according to eqs 16−25. For online monitoring, statistic indices of faulty data are calculated according to eqs 17−20 and 23−25. Mode information on faulty data is determined according to eqs 14 and 15. Faults and faulty variables can be obtained according to the control limits of offline modeling.”

4. EXPERIMENT RESULTS The purpose of this section is to validate the effectiveness of the proposed method by the experiments of penicillin fermentation processes and practical foods industrial production processes.

(25)

After the contributions of all variables are obtained, each control limit of the lth variable is Cl = m(contl) + 2.5758s(contl), where m(contl) and s(contl) are the mean and standard deviation of contl, respectively. The number 2.5758 in the contribution relation is commonly used.35,36 The fault variable is considered to be responsible for the current fault.37 The flowchart of the proposed method is described in Figure 2. It is obvious that the proposed approach can decompose nonlinear multimode processes, and monitor and identify fault variables. In addition, multimode information is reflected directly in process modeling. The out-of-sample learning problem in monitoring is solved by computing a projection matrix. A simple tutorial example of the proposed method is as follows: (1) Normal history data are used for modeling and faulty data are used for monitoring. For offline modeling,

Figure 3. Flowchart of penicillin fermentation processes. 3859

DOI: 10.1021/ie504730x Ind. Eng. Chem. Res. 2015, 54, 3855−3864

Article

Industrial & Engineering Chemistry Research

Figure 5. Monitoring result of fault in penicillin fermentation processes using the proposed method: (a) monitoring, (b) contribution analysis. In panel a, red lines denote control limits of different subspaces, black lines denote statistic indices. In panel b, different colors denote nine different variables. In monitoring, there is one control limit in the global common subspace according to eq 21. The control limits of different modes in the local specific subspace is different according to eq 22. In contribution analysis, the control limits of different variables are different according to eqs 24 and 25.

4.1. Penicillin Fermentation Processes. The penicillin fermentation processes have the characteristics of nonlinear and multimode. Most of the necessary cell mass is generated in the initial preculture mode. Cells continue to grow to be penicillin in the fed-batch mode.38 Compared with the initial preculture mode, the trajectories of various variables are stable in the fedbatch mode. The multimode information can be used for multimode modeling. The data can be obtained using PenSimv2.0 simulation software, which provides a standard platform for penicillin fermentation processes monitoring and fault identification. The flowchart of the penicillin fermentation processes and the trajectories of various variables in nominal conditions are illustrated in Figures 3 and 4.38 The reaction time of each penicillin fermentation process is 400 h, which contains a preculture mode of about 45 h and a fed-batch mode of about 355 h. The sampling interval is 1 h.38 Nine process variables and one quality variable of the processes are shown in Table 2. In this paper, six runs are generated from the simulator for multimode modeling under normal operating conditions. A faulty run in which instantaneous flow at the entrance is suddenly decreased at 46 h and maintained all the time is used for monitoring and discovering the fault variable. When a fault occurs, partial derivative contribution analysis can be used to identify the fault variable.

Figure 4. Trajectories of each variable in nominal conditions.

Table 2. Input and Quality Variables in Penicillin Fermentation Processes input variables 1 2 3 4 5 6 7 8 9

aeration rate (L/h) agitator power (W) substrate feed speed (L/h) generated heat (kcal/L) culture volume (L/h) carbon dioxide concentration (mmol/L) PH fermentor temperature (K) dissolved oxygen saturability (%)

quality variables 1 penicillin concentration (g/h)

3860

DOI: 10.1021/ie504730x Ind. Eng. Chem. Res. 2015, 54, 3855−3864

Article

Industrial & Engineering Chemistry Research

Figure 6. Flowchart of the foods industrial production processes.

Table 3. Input and Quality Variables in Foods Production Processes input variables 1 instantaneous flow at the entrance (kg/h) 2 moisture at the entrance (%) 3 return air temperature (°C) 4 tank temperature (°C) 5 actual moisture at the first exit (%) 6 actual temperature at the first exit (°C) 7 high-speed return air temperature (°C) 8 high-speed loose steam flow (kg/h) 9 furnace temperature (°C) 10 hybrid wind temperature (°C) 11 actual temperature at the exit (°C) 12 actual moisture at the exit (%)

quality variables 1 2 3 4

moisture (%) padding value (g/cm3) entire product rate (%) broken product rate (%)

The standardized matrix is obtained from the original data matrix. The global common subspace and the local specific subspace of two modes are decomposed by the proposed method. The control limits are calculated. Monitoring and fault identification of each subspace are shown in Figure 5. The statistic indices of samples which are greater than the corresponding control limits are faulty. It is shown that fault occurs from the 46th to the 400th sample. Different control limits denote corresponding identified mode information. The variable contribution which is greater than the corresponding control limit denotes a faulty variable. Different control limits in specific subspace in Figure 5 indicate that mode information is exactly recognized. Moreover, it is proved that instantaneous flow at the entrance is the fault variable, which shows that the proposed approach is suitable for multimode processes monitoring. 4.2. Foods Industrial Production Processes. The flowchart of the foods industrial production processes is illustrated in Figure 6. The products with various specifications are manufactured continuously and require monitoring to ensure production safety and quality stabilization. Multimode information for multimode modeling can be obtained according to the products with various specifications. Different modes may have unequal duration in practical foods industrial production processes. Input and quality variables are selected, as shown in Table 3. All variables are measured at practical foods production processes, providing abundant information for modeling and monitoring. The history normal data and multimode information are used for offline modeling. Each mode is decomposed and the intrinsic multimode structure can be efficiently preserved in the global common subspace. The faulty data are used to verify the accuracy of monitoring. The sampling interval is 1 min. Two products with different

Figure 7. Monitoring result of Fault 1 using the proposed method: (a) monitoring, (b) contribution analysis. In panel a, red lines denote control limits of different subspaces, black and green lines denote statistic indices. In panel b, different colors denote 12 different variables. In monitoring, there is one control limit in the global common subspace according to eq 21. The control limits of different modes in the local specific subspace are different according to eq 22. In contribution analysis, the control limits of different variables are different according to eqs 24 and 25.

specifications in the same production line have similar characteristics in a common perspective and also specific characteristics and effects on multimode processes, which are decomposed based on the proposed method. It should be noted that the proposed approach is also fit for the conditions of multimode processes with more than two modes. Three fault data sets are used to validate the effectiveness of monitoring by the proposed method. Fault 1: return air temperature is in the abnormal condition from 20th to 80th sample. Fault 2: tank temperature is deviant from 25th to 80th sample. Fault 3: there is a failure of drying machine, which 3861

DOI: 10.1021/ie504730x Ind. Eng. Chem. Res. 2015, 54, 3855−3864

Article

Industrial & Engineering Chemistry Research

Figure 8. Monitoring result of Fault 2 using the proposed method: (a) monitoring, (b) contribution analysis. In panel a, red lines denote control limits of different subspaces, black and green lines denote statistic indices. In panel b, different colors denote 12 different variables. In monitoring, there is one control limit in the global common subspace according to eq 21. The control limits of different modes in the local specific subspace are different according to eq 22. In contribution analysis, the control limits of different variables are different according to eqs 24 and 25.

Figure 9. Monitoring result of Fault 3 using the proposed method: (a) monitoring, (b) contribution analysis. In panel a, red lines denote control limits of different subspaces, black and green lines denote statistic indices. In panel b, different colors denote 12 different variables. In monitoring, there is one control limit in the global common subspace according to eq 21. The control limits of different modes in the local specific subspace are different according to eq 22. In contribution analysis, the control limits of different variables are different according to eqs 24 and 25.

results in disordered actual temperature at the exit from the 60th sample to the 80th sample. In this case, all samples are obtained at equal sampling intervals. Monitoring and fault identification of each subspace by using the proposed approach for Faults 1−3 are shown in Figures 7−9. In Figure 7, it is obviously shown that Fault 1 occurs from the 20th to 80th samples and mode information can be exactly recognized. Moreover, it is evident that return air temperature is a fault variable, which shows that accurate fault identification result can be obtained by the proposed approach. Similarly, as shown in Figure 8, Fault 2 occurs from the 25th to 80th samples and mode information can be exactly recognized. In addition, tank temperature is a fault variable. Mode information and fault variable of Fault 3 are detected accurately in Figure 9. The results explain that the proposed monitoring approach for multimode processes can detect failure accurately and which mode is faulty. The proposed approach provides a strong clue to determine which variables are responsible for the output products quality. When faulty variables are recognized, quality control can be realized by adjusting faulty variables, which can improve industrial production safety and quality stabilization. For comparing purpose, one conventional multiple model method that builds a kernel partial least-squares (KPLS) model for each mode is performed for the foods industrial production

Figure 10. Monitoring result of Fault 3 using the conventional multimode processes monitoring method. Red lines denote control limits of different subspaces, black and green lines denote statistic indices. In the multimodel method, the control limits of different modes are different.

processes. The proposed method considers global and local characteristics of multimode processes. When there are various modes, the multiple model method is ineffective and needs more complete process data of each mode. Therefore, the proposed approach is more suitable for multimode processes 3862

DOI: 10.1021/ie504730x Ind. Eng. Chem. Res. 2015, 54, 3855−3864

Article

Industrial & Engineering Chemistry Research

(10) Tenenhaus, M.; Vinzi, V. E. PLS regression, PLS path modeling and generalized Procrustean analysis: A combined approach for multiblock analysis. J. Chemom. 2005, 19, 145−153. (11) Chiang, L. H.; Kotanchek, M. E.; Kordon, A. K. Fault diagnosis based on Fisher discriminant analysis and support vector machines. Comput. Chem. Eng. 2004, 28, 1389−1401. (12) Yu, J. A new fault diagnosis method of multimode processes using Bayesian inference based Gaussian mixture contribution decomposition. Eng. Appl. Artif. Intell. 2013, 26, 456−466. (13) Yu, J.; Qin, S. J. Multimode process monitoring with Bayesian inference-based finite gaussian mixture models. AIChE J. 2008, 54, 1811−1829. (14) Zhang, Y. W.; Li, S. Modeling and monitoring of nonlinear multi-mode processes. Control Eng. Pract. 2014, 22, 194−204. (15) Zhang, Y. W.; Li, S. Modeling and monitoring between-mode transition of multimode processes. IEEE Trans. Ind. Inform. 2013, 9, 2248−2255. (16) Ge, Z. Q.; Song, Z. H. Online monitoring of nonlinear multiple mode processes based on adaptive local model approach. Control Eng. Pract. 2008, 16, 1427−1437. (17) Rashid, M. M.; Yu, J. Hidden Markov model based adaptive independent component analysis for chemical process monitoring. Ind. Eng. Chem. Res. 2012, 51, 5506−5514. (18) Tan, S.; Wang, F. L.; Peng, J.; Chang, Y. Q.; Wang, S. Multimode process monitoring based on mode identification. Ind. Eng. Chem. Res. 2012, 51, 374−388. (19) Zhang, Y. W.; Chai, T. Y.; Li, Z. M.; Yang, C. Y. Modeling and monitoring of dynamic processes. IEEE Trans. Neur. Net. Learn. 2012, 23, 277−284. (20) Bakshi, B. R. Multiscale PCA with application to multivariate statistical process monitoring. AIChE J. 1998, 44 (7), 1596−1610. (21) Yoon, S.; MacGregor, J. F. Principal-component analysis of multiscale data for process monitoring and fault diagnosis. AIChE J. 2004, 50 (11), 2891−2903. (22) Zhang, Y. W.; Ma, C. Fault diagnosis of nonlinear processes using multiscale KPCA and multiscale KPLS. Chem. Eng. Sci. 2011, 66, 64−72. (23) Zhang, Y. W.; Li, S.; Hu, Z. Y. Improved multi-scale kernel principal component analysis and its application for fault detection. Chem. Eng. Res. Des. 2012, 90, 1271−1280. (24) Wen, C. L.; Zhou, F. N.; Wen, C. B.; Chen, Z. G. An extended multi-scale principal component analysis method and application in anomaly detection. Chin. J. Electron. 2012, 21, 471−476. (25) Lau, C. K.; Ghosh, K.; Hussain, M. A.; Hassan, C. Fault diagnosis of Tennessee Eastman process with multi-scale PCA and ANFIS. Chemom. Intell. Lab. 2013, 120, 1−14. (26) Zhang, M. G.; Ge, Z. Q.; Song, Z. H.; Fu, R. W. Global-local structure analysis model and its application for fault detection and identification. Ind. Eng. Chem. Res. 2011, 50 (11), 6837−6848. (27) Zhang, Z.; Chow, T. W.; Zhao, M. B. Trace ratio optimizationbased semi-supervised nonlinear dimensionality reduction for marginal manifold visualization. IEEE Trans. Knowl. Data Eng. 2013, 25, 1148− 1161. (28) Zhang, X.; Yan, W. W.; Shao, H. H. Nonlinear multivariate quality estimation and prediction based on kernel partial least squares. Ind. Eng. Chem. Res. 2008, 47, 1120−1131. (29) Xiang, S. M.; Nie, F. P.; Pan, C. H.; Zhang, C. S. Regression reformulations of LLE and LTSA with locally linear transformation. IEEE Trans. Syst. Man Cybernetics. B 2011, 41, 1250−1262. (30) Ge, Z. Q.; Song, Z. H. Multimode process monitoring based on Bayesian method. J. Chemometr. 2009, 23, 636−650. (31) Lowry, C. A.; Montgomery, D. C. A review of multivariate control charts. IIE Trans. 1995, 27, 800−810. (32) Nomikos, P.; MacGregor, J. F. Multivariate SPC charts for monitoring batch processes. Technometrics 1995, 37, 41−59. (33) Fan, J. C.; Qin, S. J.; Wang, Y. Q. Online monitoring of nonlinear multivariate industrial processes using filtering KICA-PCA. Control Eng. Pract. 2014, 22, 205−216.

monitoring, which is shown in experimental results for Fault 3 in Figures 9 and 10 based on the false negative rate and false positive rate.

5. CONCLUSIONS A new multimode processes monitoring method based on subspace decomposition is proposed and its experiments for monitoring and fault identification are illustrated. The proposed method has attractive features, which improve the disadvantages of multiple model methods. Moreover, the multimode information is considered, which could be useful in multimode processes modeling and monitoring. The proposed subspace decomposition method is building an integrated model for multimode monitoring and can be suitable for practical industrial production processes. The influence of quality variables and multimode information are used for multimode modeling to improve the accuracy of monitoring. The experimental studies on penicillin fermentation processes and practical foods industrial production processes demonstrate that the proposed approach is effective and feasible. It should be noted that the proposed approach is also fit for the conditions of multimode processes with more than two modes.



AUTHOR INFORMATION

Corresponding Author

*E-mail: [email protected]. Notes

The authors declare no competing financial interest.



ACKNOWLEDGMENTS This work is supported by the National High Technology Research and Development Program of China (2013AA040705-1).



REFERENCES

(1) Tong, C. D.; Yan, X. F. Statistical process monitoring based on a multi-manifold projection algorithm. Chemom. Intell. Lab. 2014, 130, 20−28. (2) McClure, K. S.; Gopaluni, R. B.; Chmelyk, T.; Marshman, D.; Shah, S. L. Nonlinear process monitoring using supervised locally linear embedding projection. Ind. Eng. Chem. Res. 2014, 53, 5205− 5216. (3) Stubbs, S.; Zhang, J.; Morris, J. Multiway interval partial least squares for batch process performance monitoring. Ind. Eng. Chem. Res. 2013, 52, 12399−12407. (4) Elshenawy, L. M.; Yin, S.; Naik, A. S.; Ding, S. X. Efficient recursive principal component analysis algorithms for process monitoring. Ind. Eng. Chem. Res. 2010, 49, 252−259. (5) Zhu, J. L.; Zhang, Z. D.; Pan, F. Fault identification for missing data systems with a dynamic Bayesian network approach. Inf. Control 2013, 42 (4), 499−505. (6) Xie, X.; Shi, H. B. Dynamic multimode process modeling and monitoring using adaptive Gaussian mixture models. Ind. Eng. Chem. Res. 2012, 51, 5497−5505. (7) Wang, F. L.; Tan, S.; Peng, J.; Chang, Y. Q. Process monitoring based on mode identification for multi-mode process with transitions. Chemom. Intell. Lab. 2012, 110, 144−155. (8) Ma, H. H.; Hu, Y.; Shi, H. B. A novel local neighborhood standardization strategy and its application in fault detection of multimode processes. Chemom. Intell. Lab. 2012, 118, 287−300. (9) Zhao, S. J.; Zhang, J.; Xu, Y. M. Monitoring of processes with multiple operating modes through multiple principle component analysis models. Ind. Eng. Chem. Res. 2004, 43, 7025−7035. 3863

DOI: 10.1021/ie504730x Ind. Eng. Chem. Res. 2015, 54, 3855−3864

Article

Industrial & Engineering Chemistry Research (34) Cho, J. H.; Lee, J. M.; Choi, S. W.; Lee, D.; Lee, I. B. Fault identification for process monitoring using kernel principal component analysis. Chem. Eng. Sci. 2005, 60, 279−288. (35) Zhang, Y. W.; Zhang, L. J.; Lu, R. Q. Fault identification of nonlinear processes. Ind. Eng. Chem. Res. 2013, 52, 12072−12081. (36) Choi, S. W.; Lee, I. B. Multiblock PLS-based localized process diagnosis. J. Process Control 2005, 15, 295−306. (37) Kariwala, V.; Odiowei, P. E.; Cao, Y.; Chen, T. A branch and bound method for isolation of faulty variables through missing variable analysis. J. Process Control 2010, 20, 1198−1206. (38) Zhang, Y. W.; Li, S.; Hu, Z. Y.; Song, C. H. Dynamical process monitoring using dynamical hierarchical kernel partial least squares. Chemom. Intell. Lab. 2012, 118, 150−158.

3864

DOI: 10.1021/ie504730x Ind. Eng. Chem. Res. 2015, 54, 3855−3864