Exploiting Historical Databases to Design the Target Quality Profile for

May 13, 2013 - The feasibility of the proposed methodology is tested in a pharmaceutical product development case study, where the product is obtained...
0 downloads 0 Views 1MB Size
Article pubs.acs.org/IECR

Exploiting Historical Databases to Design the Target Quality Profile for a New Product Emanuele Tomba,† Pierantonio Facco,† Fabrizio Bezzo,† and Salvador García-Muñoz‡,* †

CAPE-LabComputer-Aided Process Engineering Laboratory, Dipartimento di Ingegneria Industriale, Università di Padova, via Marzolo 9, 35131 Padova PD, Italy ‡ Pfizer Worldwide R&D, 445 Eastern Point Road, Groton, Connecticut 06340, United States ABSTRACT: Latent variable regression model (LVRM) inversion has been demonstrated to be a valid tool to support the design of new products and processes and for process control. One of the basic assumptions of LVRM inversion is that the set of desired characteristics specified for the inversion must adhere to the covariance structure of the historical data used to build the model. When developing a new product, it is not unusual that the desired product characteristics are assigned or allowed to vary slightly so as to satisfy the end-customer or the downstream processing requirements. If these values do not obey the covariance structure described by the LVRM, a mismatch between the model estimates obtained from the inversion solution and the desired product properties is observed. In this paper we address the above-mentioned issue: starting from a set product characteristics that does not comply with the LVRM structure, we propose a strategy to assist the selection of the new product quality profile, which is most suitable for LVRM inversion. Two approaches exploiting the desired values for the product characteristics and the LVRM parameters are proposed to reconstruct a new product profile in order to minimize the mismatch between model estimates and desired product properties. The feasibility of the proposed methodology is tested in a pharmaceutical product development case study, where the product is obtained through high-shear wet granulation.

1. INTRODUCTION Stemming from the original work of Jaeckle and MacGregor,1,2 the use of the inverse of a latent variable regression model (LVRM) has been demonstrated to be a useful tool to support decision making processes in several applications, ranging from the design of operating conditions,1−4 to LVM-based predictive control,5−7 product optimization8 and product formulation design.9 In general, the objective of LVRM inversion is to estimate a set of input variables xNEW ensuring the achievement of a desired set of output variables yDES. In this paper, we consider the problem of the development of a new product, whose quality profile is characterized by a set yDES of properties, some of which may be assigned a priori, whereas some other may be allowed to vary slightly. The studies on LVRM inversion3−10 are built on an optimization framework aiming at minimizing the difference between the desired yDES and the one estimated through model inversion. This difference can either be forced through a hard equality constraint, or added as a weighted term to the objective function, which we will refer to as a “soft constraint”. Any difference resulting between the optimal solution (ŷNEW) and the desired one (assuming no other constraint is active) will be proportional to the orthogonal distance between the assigned values of yDES and its projection onto the model hyperplane of the latent variables. Thus, to obtain a solution close to the target, the desired set of quality specifications yDES should adhere to the covariance structure of the historical data used to build the underlying model.1 This, however, may become an issue when the end customer assigns specific values to some of the elements of yDES: in this case, it is necessary to iterate between the customer requirements and a feasible yDES complying with the covariance structure of the Y matrix used to train the model. © XXXX American Chemical Society

Clearly, there is a potential conflict between the importance assigned by the end user to the elements of yDES, and the ability of the model to represent them. If the desired values of yDES assigned by the user were set as hard rather than soft constraints (i.e., a weighted term in the objective function) in the optimization framework, then the possibility arises that the assigned values of yDES will not lie on the model hyperplane. Therefore, the optimization step may fail due to the possible infeasibility of the hard equality constraints, since it may be numerically impossible to obtain the desired values for the elements of yDES and simultaneously satisfy the covariance structure of Y described by the model loadings. In this paper we address the above-mentioned challenges by proposing a structured approach to guide the reassessment of a preliminarily defined target attribute profile yDES in order to obtain a new product quality profile with the same covariance structure as the matrix of historical response variables used to build the model. Given such a vector for yDES, it is then possible to use hard rather than soft constraints into the optimization formulation for the LVRM inversion, thus making the solution less computationally expensive. Two scenarios are taken into account: (i) the end-user wishes to specify some elements of yDES only, and (ii) the end-user wishes to specify all elements of yDES. The paper is organized as follows: in section 2 we briefly discuss the inversion of latent variable regression models and further elaborate on the mathematics of the problem addressed here; in section 3 we describe the proposed methods to select yDES; Received: November 28, 2012 Revised: April 2, 2013 Accepted: May 13, 2013

A

dx.doi.org/10.1021/ie3032839 | Ind. Eng. Chem. Res. XXXX, XXX, XXX−XXX

Industrial & Engineering Chemistry Research

Article

Assigning soft constraints in the objective function offers the advantage that the equality constraints specified in either xNEW or yDES do not necessarily lie in the model subspace for the optimizer to find a solution. There are two main downsides in using soft constraints in the model inversion optimization problem. The first one is the inherent need for the user to define weights for each of the terms of the objective function and for each of the elements in xNEW and yDES. Second, soft constraints add additional degrees of freedom to the optimizer, thus making the optimization exercise harder. Conversely, the establishment of hard equality constraints for the desired values of elements in xNEW and yDES reduces the number of iterations in the application of LVRM inversion procedure to support the design problem, since once the solution is found there is no discrepancy between the obtained and the desired values of the response y. Moreover, the problem is easier to solve for the optimizer from a numerical point of view. The downside of using hard constraints in the LVRM inversion problem is that the set of constraints given for xNEW and yDES must be coherent with the covariance structure of the original matrices used to build the model. In this context, a methodology would be needed to allow a product developer to set as many hard constraints for the elements of the desired product profile yDES as possible, while adhering to the historical data covariance structure described by the model. Two possible solutions to address this problem are proposed in the following section. 2.3. Residual Space in a Latent Variable Model. As highlighted in step 3, if the specified values for yDES are outside the model subspace (described by the Q loadings matrix), the solution obtained from the inversion problem will not be able to provide the desired product quality profile. A metric that can be used to quantify the distance of yDES from the model space is the squared prediction error (SPEyDES) which is defined as:13

the applicability of the approach is demonstrated in section 4 through a case study; eventually some final remarks conclude the work.

2. INVERSION OF LATENT VARIABLE REGRESSION MODELS 2.1. Latent Variable Models. Latent variable regression models (such as partial least-squares regression, PLS11) are databased models that relate regressor and response matrices by projecting them onto a latent space of a much smaller rank (the model space) than the original matrices. Given a data set X [I × N] of I different input conditions (raw materials or process parameters) in which N variables are measured (the regressor space) and a data set Y [I × M] in which the M product properties for the I products are collected (the response space), after the appropriate pretreatment is applied to the data (e.g., meancentering and scaling them), the LVRM represents the matrices in terms of their latent variables: X = TPT + E X

(1)

Y = TQT + E Y

(2)

T = XW*

(3)

The data in X are therefore projected onto the model latent space through the weight matrix W* [N × A], giving the matrix T [I × A] of the projections (called scores) of the original variables in the latent space of the model. The projections in T are then related to the original matrices X and Y through the loading matrices P [N × A] and Q [M × A]. EX [I × N] and EY [I × M] are the residual matrices accounting for the model mismatch in the reconstruction of X and Y. A represents the number of significant latent variables (LVs) chosen to build the model; namely, it corresponds to the dimension of the latent space. In general, the LVs can be interpreted as the main driving forces explaining the systematic variability in X that is related to the variability in Y. For a detailed theoretical discussion on PLS, the user is referred to the papers from Höskuldsson11 and Burnham et al.12 2.2. Latent Variable Model Inversion Procedure. The objective of model inversion is that of using a model to estimate a new regressor vector xNEW corresponding to a desired target quality attribute profile of response variables (yDES). In general, a LVRM inversion exercise goes through the following steps: 1. build the LVRM between the preprocessed X and Y matrices; 2. set the possible constraints or physical bounds for yDES and xNEW; 3. if multiple equality constraints are given for yDES (or for xNEW), verify that these values are feasible within the model space; 4. invert LVRM solving the appropriate inversion problem. The central exercise in the inversion step (step no. 4) is that of identifying a score vector (τ) that will result, during prediction and reconstruction of xNEW and ŷNEW, in the desired values for the elements of yDES as well as fulfilling any constraints identified either in the elements of the target attribute product profile (yDES) or in the elements of the regressor (xNEW). Mathematically, there are two different ways to reach a solution for τ that satisfies the equality constraints in xNEW and yDES: (i) to set soft constraints in the formulation of the objective function using ad-hoc weights; (ii) to set hard constraints for these equalities (see eqs 7 and 8 in Tomba et al.10).

M

SPE y DES =

̂ )2 ∑ (yiDES − yiDES i=1

(4)

where ŷDES is the ith element of the vector ŷDES, which represents i the projection of yDES through the LVRM. In principle, it is desired that this distance be zero so that the hard constraint yDES = ŷDES = qiτ for the ith variable specified for yDES can be i i established, where qi represents the ith row of the Q matrix.10 Figure 1 illustrates a geometrical interpretation of the SPE metric.

Figure 1. Representation of the geometrical interpretation of the SPE metric.

Indeed, when the target profile yDES (or a subset of it) of a new product is identified by the product development team, it may happen that this profile does not entirely match the covariance B

dx.doi.org/10.1021/ie3032839 | Ind. Eng. Chem. Res. XXXX, XXX, XXX−XXX

Industrial & Engineering Chemistry Research

Article

space where multiple values of τ can provide the same predicted value for yDES while providing multiple possible values for the i M − 1 free elements of yDES, depending on the correlation structure of Y. This artificial null space will change depending on what element of yDES is being assigned and can be explicitly determined by the linear system of equations ŷDES = Qτ represented by:

structure of the historical samples in Y. In this case, SPEyDES is greater than zero and the product quality set ŷNEW corresponding to the LVRM inversion solution (section 2.2) will differ from the specified yDES if a soft constraint is assigned, or in an infeasible problem if a hard constraint on yDES is assigned.10 For SPEyDES to be approximately zero, ŷDES should be used instead of yDES for the model inversion. Two alternatives can then be considered: (i) estimate ŷDES by directly projecting and reconstructing yDES through the model; (ii) force some of the elements of ŷDES to be equal to those assigned in yDES. In the first case, ŷDES can be very different from the desired product quality set yDES, despite being its best reconstruction on the model space (minimum distance from yDES). Even if this allows the use of the above-mentioned hard constraint on yDES for the model inversion, none of the product quality variables in ŷDES will have the same value as that originally specified in yDES. As a consequence, the product quality set ŷNEW obtained from the LVRM inversion may be significantly different from yDES. In the second case, since the interest is to satisfy the constraints for the desired values of the elements of yDES as closely as possible, some of the elements of ŷDES are imposed to be equal to those assigned in yDES, while estimating a proper value for the others (e.g., the conditional mean14,15). In the following section we will illustrate two different strategies for the selection of ŷDES based on the second alternative described above. The strategies differ according to the way in which the specifications for the elements in yDES (namely, the equality constraints) are managed. In the first approach, one of the elements of yDES is assigned at a time, while the other elements are calculated through the model using a direct model inversion approach in order to obtain ŷDES belonging to the model space. In the second approach, ŷDES is calculated by assigning the largest number of elements of yDES still leading to obtain a ŷDES within the model space; the number and type of the elements are selected according to an optimal criterion. In both approaches it is assumed that the values for all the M elements of yDES have been assigned by the user, but the methods can be easily applied even if S < M elements have been specified. Additional details on the models to use for the reconstruction of ŷDES and on the difference in reconstructing ŷDES with a PCA or a PLS model are given in Appendix A. It is proper to mention that all the discussion and methodology presented in this work can also be applied to select equality constraints for xNEW, since it is important for any fixed elements in this vector to exhibit the same covariance structure as in the X matrix used to build the model.

qji in (5) being the element on the ith row and jth column of the Q matrix, and τj the jth element of the column vector τ, with i = 1, ..., M and j = 1, ..., A. For illustrative purposes, consider four scenarios: (a) the variables of Y are completely independent (Y is full rank) and each variable is represented by one LV; (b) Y is full rank, but the variables are explained in groups by different components; (c) Y is rank deficient and correlation is captured in A components, where A < M; (d) Y is rank deficient and correlation is captured in A components, where A ≤ M, but A > rank(Y). The appearance of these induced null spaces is obvious in case (a), where the right-hand of eq 5 is reduced to the main diagonal elements (eq 6). If an equality constraint is applied to one of the elements of yDES, the rest of the equations define the available null space in which the user can pick any value of the free scores.

Scenario (b) would imply that there are as many independent directions of variability in Y as columns in it; however, each direction of change affects multiple variables. Consider the scenario of a three-dimensional space with three LVs such that the representative loadings across all LVs are as in eq 7. Given the below situation and a constraint placed on the first element of yDES, one could isolate τ1 from the first row and replace it into the third row to end with a system of two equations with four unknowns (τ2,τ3,ŷDES and ŷDES 2 3 ). This system represents the twodimensional induced null space where any solution chosen for τ2 or τ3 can be used to estimate τ1 and satisfy the equality condition for the first element of yDES while also keeping the vector ŷDES in the latent space.

3. USING THE HISTORICAL KNOWLEDGE TO SELECT THE PRODUCT TARGET ATTRIBUTE PROFILE 3.1. Theoretical Considerations. Let us consider the vector of the desired product profile yDES autoscaled according to the mean and the standard deviation of the columns of the historical dataset of product properties (Y) used to build the model. Although the method outlined subsequently in this section (Method 1) is proposed to provide an estimate of the M − 1 free elements of yDES, given an equality constraint enforced on each ith element yDES of yDES, it is reasonable to think that such an i estimate is uniquely defined only if the M − 1 free elements of yDES have a strong correlation with the ith that is being assigned. This situation would imply that the effective rank of Y is 1 (only one LV is necessary to represent Y) and hence the reconstruction of ŷDES based on one element is reasonable. However, if the effective rank of Y is of higher order, assigning the value of the ith element of yDES would create an induced null

y1̂DES = q11τ1 y2DES ̂ =

+ q13τ3

q22τ2

y3DES ̂ = q31τ1

(7)

+ q33τ3

In scenario (c), the number of LVs is lower than the number of variables in Y and not all variables are represented in all latent spaces (e.g., equation system 8). Given the situation below, a hard constraint enforced on the first element will assign the value of the fourth and will result in a one-dimensional induced null C

dx.doi.org/10.1021/ie3032839 | Ind. Eng. Chem. Res. XXXX, XXX, XXX−XXX

Industrial & Engineering Chemistry Research

Article

space where the user can choose any value of τ1 (which in turn defines the values of the second and third element of ŷDES).

y1̂DES =

null space is presented due to the obvious reaction toward predicting the majority of the ŷDES vectors based on one element (in the worst of the cases, these predictions will be nothing but the unconditional means). In the following sections the algorithms on which the proposed methods are based are described. The first method (Method 1) offers a simple procedure to appreciate the trade-offs and understand the effects that assigning one element of the quality profile versus another one has on the product quality profile. Differently, the second method (Method 2) provides a procedure to find an optimal product quality profile as a trade-off between the historical data covariance structure and the userdefined values of the quality variables. 3.2. Method 1: Assigning One Quality Variable at a Time. In the first method, the selected product quality set ŷDES is = yDES calculated imposing that, for the ith element of ŷDES, ŷDES i i , while the values of the other elements in ŷDES are assumed to be missing. Several approaches have been proposed to deal with missing data when using multivariate statistical techniques like PCA or PLS.14,15 In all these contributions the objective was to use the model to estimate the scores corresponding to a new sample presented to the model characterized by missing measurements. In general this is concerned with multivariate statistical process control or process modeling applications, where missing measurements are in the input data (namely in the regressor side, if a PLS model is considered). Differently, in this application, in order to reconstruct the product quality profile we are considering as missing the measurements referred to the response set ŷDES, thus using the model inversion to reconstruct them on the basis of the available fixed values. The proposed method exploits the submodel constituted by the PLS Q loadings to reconstruct the new product target profile ŷDES through a direct inversion of the PLS model.1 For each variable i, the proposed procedure aims at estimating the scores τ̂(i) of ŷDES(i) on the basis of the ith element (yDES ), i which is assigned, by projecting it back to the model plane through a direct inversion of the model:

q12τ2

y2DES ̂ = q21τ1 (8)

y3DES ̂ = q31τ1 y4DES ̂ =

q42τ2

Finally, consider scenario (d). In this case the number of LVs is lower than the number of variables in Y, however it is greater than the effective rank of Y. This is a typical situation when building PLS models between a regressor data set X and a response data set Y, and the rank of the X matrix is larger than the rank of the Y matrix. In these cases, if the PLS model is built with A = rank(X) LVs, there will be k = A − rank(Y) directions in the latent space which have little (theoretically none) influence on the Y space, but that are needed to adequately describe the variability in the regressor space. These latent directions form the PLS null space1 which gives additional degrees of freedom in the estimation of the score vector τ, in addition to the induced null spaces which can be generated in situations similar to those described above. For example, assume the same case as in eq 8 in which the Y space is four-dimensional and rank(Y) = 2, but consider that three LVs were chosen to build the PLS model, as they were needed to represent adequately the X space (eq 9). In this case, a hard constraint imposed on the first element will result in the estimation of τ2, but the user can choose any value for τ1 and τ3. The system represents a two-dimensional null space, which however is formed by the combination of a one-dimensional induced null space and the PLS null space due to the differences in the ranks of X and Y.

y1̂DES =

q12τ2

y2DES ̂ = q21τ1 y3DES ̂ = y4DES ̂ =

T

q31τ1 q42τ2

+

q33τ3

+

q43τ3

T

τ (i) = (Q(i) Q(i))−1Q(i) yiDES

(9)

(10)

where Q(i) is the submatrix of the loadings Q in which only the row of Q corresponding to the element assigned in ŷDES(i) is considered. Namely, according to the procedure in Method 1, Q(i) is a row vector of [1 × A] dimensions. This method is applied to all the M variables specified for yDES, T giving then in output a matrix Ŷ DES , whose columns are the M different reconstructions for the product quality ŷDES(i) obtained assigning in turn each element i. The procedure goes through the following steps: 1. Assign the value of the ith element of ŷDES(i) in order for it to be equal to the corresponding element in yDES (ŷDES(i) = yDES i i ), DES(i) as missing data. considering the other elements in ŷ 2. Estimate the score vector τ̂(i) corresponding to ŷDES(i) through the direct model inversion in eq 10. 3. Reconstruct ŷDES(i) from τ̂(i) and the Q loadings of the PLS

Note that the case presented in scenario (d) (eq 9) could only occur when ŷDES is reconstructed through the PLS Q loadings, differently from the situations described in the previous scenarios which are valid also in the cases in which ŷDES is reconstructed based on the PCA loadings (Appendix A). In practice, it is common to have only desirable ranges for some quality attributes of the product while having specific assigned conditions for other quality descriptors. In this paper we suggest to handle the free elements of yDES as missing data for the sake of simplicity and to expedite the decision of a vector of quality properties that will result in the target overall performance for a new product. Other researchers have already presented and studied the behavior of the analytical estimators for each of the missing data methods and have already discussed whether the expected predicted value for the missing elements are close to the unconditional mean, the conditional mean, the least Mahalanobis distance, or the least squared prediction error.15,14 From the perspective of this application, we use similar approaches as a shortcut to the construction of potential vectors representing the target quality profile for a product. The discussion on the induced

model and store it in the matrix Ŷ DES : T

ŷ DES(i) = Qτ (̂ i)

(11)

4. Assign the next desired product property, until all the M properties in yDES have been considered. D

dx.doi.org/10.1021/ie3032839 | Ind. Eng. Chem. Res. XXXX, XXX, XXX−XXX

Industrial & Engineering Chemistry Research

Article

The element in yNEW with the highest contribution to SPEyNEW is selected and its constraint is removed. An optimization problem is then solved to update the design set yNEW:

Thus, from the different suggestions for the new product quality set ŷDES(i), the user can have an idea of the general impact that a change in each of the original variables will have to a change in the covariance of the desired quality profile so that it better resembles that of the historical data. The user would then select the candidate solution that better adheres to the end-customer needs. In some cases, reconstructing ŷDES(i) through the direct inversion of the model (eq 10) may lead to an unfeasible solution clashing against the physical limits some of the product quality variables may have. If that occurs, additional flexibility to the procedure described for Method 1 may be added by substituting the step 2 and 3 of the above procedure (i.e., eq 10 and eq 11) with the optimization problem described in the next section (solving eq 13 instead). That will be better clarified when discussing the case study results in section 4.2. 3.3. Method 2: Assigning More than One Quality Variable. The second proposed method is based on an approach that is somehow dual to Method 1. The method starts from the originally defined product quality vector yDES, and uses an iterative procedure to progressively find the assigned variables in yDES that contribute the most to the SPEyDES value, and to remove the corresponding equality constraints until a new estimated desired product quality yNEW is obtained that is as close as possible to the model space. Let us assume that the variables in yDES are completely (or for the most part) assigned (all equality constraints) and let us define a (small) threshold ε setting the acceptability limit for the value of SPEyNEW. To ensure that the new product quality profile in yNEW belongs to the model space, the value defined for ε should be significantly lower than the SPE values of the historical data used to build the model. A general schematic of the procedure is reported in Figure 2.

A

min(y NEW − Qτ̂)T Γ(y NEW − Qτ̂) + g · ∑ τ̂

ymNEW = ymDES

lbk < ykNEW < ubk

where bj represents the value of the inequality constraint assigned to the jth element of yNEW to account for the specification limit possibly specified for it, while lbk and ubk are respectively the lower and the upper physical boundaries for the k-th element of yNEW. Note that physical boundaries represent the variable domain in the optimization procedure. Conversely, inequality constraints represent the regions inside which the product properties are desired to fall, and are then subsets of the physical bounds. In the objective function, Γ is a diagonal matrix of weights defined by the user according to the relative importance he/she wants to give to the product quality variables (usually a good choice for the elements of Γ is represented by the explained variance per variable (Ry,pv2), calculated on the historical samples); g is a weight for the second term of the objective function where τ̂a is the ath element of τ̂ and s2a is the variance of the ath column of matrix T. From the optimization problem, a new set yNEW representing the new estimated target quality profile is obtained, which is again assessed against threshold ε. The procedure of progressively removing the equality constraints initially specified for the elements of yDES is repeated until SPEyNEW is found below the given threshold ε. Then, a new product quality set yNEW (= ŷDES) is obtained, which represents the best compromise between the set of the target quality profile initially defined by the user (yDES) and the model requirements. Conversely, if after removing all the equality constraints in yDES, SPEyNEW > ε still holds, then the problem is unfeasible and a revision on the constraints specified in eq 13 should be considered. The second term of the objective function in eq 13 represents the Hotelling’s T2 of the solution. This term is added to the objective function to consider the cases in which an induced null space is present due to the structure of the loadings Q. In these cases, the null space can be exploited to move the solution along it, in order to find a new set yNEW that belongs to the model space, but at the same time is inside (or close to) the range of the properties of the historical products (thus avoiding extrapolated solutions). To this end, we have g ≠ 0 and reliably g ≪ 1 in eq 13, in order to give more importance in the objective function to SPEyNEW rather than to the Hotelling’s T2 of the solution. In the case the null space is due to the differences in the ranks of the X and Y matrices (namely, the Q loading matrix is redundant), g can be set to zero, since the Hotelling’s T2 of the solution is considered in the subsequent PLS inversion problem, for the estimation of the regressors which provide the desired responses yNEW.10 Note also that the problem in eq 13 can be solved by introducing slack variables to soften the hard equality constraints assigned to the elements of yNEW.16

(12)

where f m is the residual for the mth element of y

(13)

yjNEW < bj

After setting yNEW = yDES, at each iteration the procedure verifies if the desired set yNEW belongs to the model space, by calculating SPEyNEW and comparing it to ε. If this is not verified, the contributions to SPEyNEW are calculated according to NEW

sa2

s.t.

Figure 2. Schematic of the algorithm implemented for Method 2.

SPECONT,m = fm ·fm

a=1

τa2̂

. E

dx.doi.org/10.1021/ie3032839 | Ind. Eng. Chem. Res. XXXX, XXX, XXX−XXX

Industrial & Engineering Chemistry Research

Article

In Table 3 the diagnostics for the PLS model between the data sets X and Y are reported in terms of explained variance and cumulative explained variance per LV both in model design (R2X, R2Y, R2XCUM, R2YCUM) and in cross-validation (P2, Q2, P2CUM, Q2CUM) for all the 7 LVs which can be estimated from the analysis. Note that the cross-validation diagnostics are reported not only for the response variables in Y (Q2, Q2CUM), but also for the regressors in X (P2, P2CUM).10 This is due to the fact that since the objective of model inversion is the estimation of the regressors starting from the response variables, the model must ensure an adequate representation not only of the Y, but also of the X space. For this reason, a comprehensive strategy for the selection of the number of LVs to design a LVRM for inversion should also consider a metric to diagnose the performances of the model in representing the X data.10 From Table 3, both R2X and P2 show a significant decrease in the amount of explained variance after the fourth LV. From the values of R2Y and especially of Q2 it can be seen that the amount of variance for Y explained by the LVs after the third does not seem to be significant. By considering the diagnostics for Y three LVs would then be sufficient to build the LVRM. However, R2X and P2 values show that the latent space for X is four dimensional. Accordingly, four LVs were selected for the model design. This means that there is a one-dimensional null space in the latent space of the model, which has to be considered in the inversion of the PLS model. This case study can thus potentially represent an example of Scenario (d) described in Section 3.1, as the effective rank of the Y space is lower than both the dimension of Y and the number of considered LVs. The presence of the PLS null space and of the possible induced null spaces generated in the application of the proposed procedures should then be considered in the reconstruction of ŷDES. In Figure 3 the loadings q (i.e., the columns of the Q matrix in eq 2) of the PLS model on the four considered LVs are reported. The loadings have been standardized by R2y,pv to get a better contrast and to allow for a “cross-component” analysis. The analysis of these plots allows assessing the main driving forces, which explain the variability in the Y data most related to X. These driving forces will be exploited by the proposed methodologies for the selection of the new target product quality profiles ŷDES. For the sake of conciseness, a detailed description and interpretation of the loading plots of Figure 3 is reported in Appendix B. Figure 4 reports the values of the SPE versus the values of the Hotelling’s T2 for the historical samples in Y (black dots), together with the relevant 95% (red short-dashed lines) and 99% (blue dashed lines) confidence limits. The squared dot (□) represents the values of T2yDES and SPEyDES. As can be seen, even if T2yDES is inside the 95% (= 10.97) confidence limit, meaning that the values specified in yDES are not far from the mean of the historical values, SPEyDES is above the 95% relevant limit (= 5.75), meaning that the historical correlation structure is not valid for yDES. Thus, the model is not really appropriate in representing yDES due to the high model mismatch, and it is not recommended to perform the inversion with yDES. The methods previously described can then be applied to exploit the historical covariance structure of the data in order to give suggestions on possible new product properties sets ŷDES, which can be feasibly used for the model inversion.

Finally, note that the solution of Method 2 coincides with that of Method 1 (namely, the direct inversion of the model) if an equality constraint for only one element of yDES is assigned in eq 13, g = 0 and no inequality constraints or boundaries are present in the inversion problem.

4. CASE STUDY: DEFINING THE TARGET PROFILE FOR A WET GRANULATED PRODUCT The proposed methodologies are applied to a particle engineering problem to design the quality profile of a wet-granulated product. The ultimate objective is that of using the historical knowledge available for different products to estimate the best input raw material properties to obtain granules with the desired characteristics in output from a wet granulation process. Before performing the LVRM inversion, it must be ensured that the model is able to adequately describe the desired product quality yDES, which is assigned by the user (all equality constraints on the elements of yDES). 4.1. Preliminary Data Analysis. In this example the data reported in the work of Vemavarapu et al.17 have been used. In the original work, the authors studied the influence of the raw material properties on the performances of a product under the assumption that it is obtained using a high-shear wet granulation process. Twenty-four different active pharmaceutical ingredients (APIs) were considered and processed under the same granulation conditions to study the influence of the raw material properties only. Each API was characterized by measuring seven physical properties, while the wet-granulated products were characterized by measuring seven different product characteristics. Table 1 reports the input material physical properties and Table 1. Measured Properties for the APIs (X) and for the Wet-Granulated Products (Y) API properties 1. 2. 3. 4. 5. 6. 7.

product properties

H2O solubility (mg/mL) contact angle (deg) H2O holding capacity (wt % gain) D[3,2] (μm) D90/D10 surface area (m2/g) pore volume (cm3/g)

1. 2. 3. 4. 5. 6. 7.

loss on drying (LOD) (%) oversize (%) ΔFlodex (mm) ΔCompactibility (KPa/MPa) D[3,2] (μm) D90/D10 growth ratio

the variables measured for the characterization of the products in each experimental run. Data are collected in two data sets: a data set X [24 × 7] including the properties of 24 APIs tested and a data set Y [24 × 7] of the properties of the corresponding products. Additional details on the variables and on the rationale to select them are reported in the original paper17 together with information on the measurement procedures and on the process conditions. For the purpose of this study, we assume that it is required to manufacture a granulated product with the characteristics yDES reported in Table 2. Note that these data do not correspond to a real product, but in general they represent an example of the combination of desirable properties for a wet granule.

Table 2. Desired Product Properties yDES for a Wet-Granulated Product

yDES

LOD (%)

oversize (%)

ΔFlodex (mm)

ΔCompactibility (KPa/MPa)

D[3,2] (μm)

D90/D10

growth ratio

3

0

20

5

400

2.5

8

F

dx.doi.org/10.1021/ie3032839 | Ind. Eng. Chem. Res. XXXX, XXX, XXX−XXX

Industrial & Engineering Chemistry Research

Article

Table 3. Diagnostics of the PLS Model between X and Y LV

R2X

R2XCUM

P2

P2CUM

R2Y

R2YCUM

Q2

Q2CUM

1 2 3 4 5

40.84 18.69 15.99 17.45 2.49

40.84 59.54 75.53 92.98 95.47

35.71 18.26 15.38 22.03 3.66

35.71 53.97 69.35 91.38 95.04

32.34 23.34 8.30 1.58 2.77

32.35 55.69 63.99 65.57 68.34

24.68 24.31 11.72 2.30 2.88

24.68 48.99 60.71 63.02 65.90

coincide with the original values of yDES in Table 2. For the sake of comparison, the last row reports the set ŷDES obtained by directly projecting yDES onto the model space. In the two last columns of the table, the values of the T2 and of the SPE are also reported for each calculated set. As it can be seen, for all the suggested new product quality sets ŷDES(i), the calculated values for the assigned elements of ŷDES(i) are equal to the corresponding equality constraint yDES (reported between parentheses). Moreover, from the values of the SPE, the suggested variable combinations result to be all lying onto the model space. It is interesting to note that for each of the calculated sets ŷDES(i), the covariance structure of the historical samples in Y (which is basically represented by the plots in Figure 3) is optimally used in the proposed approach to estimate the new sets ŷDES(i), even if only one of the seven variables is constrained in each case, as will be discussed below. Prior to such discussion, the reader is referred to Table 5, where the mean and standard deviation values of the variables in Y are provided for a clear comparison with the results of Table 4. Let us consider the case in which the percentage of oversize granules was assigned to 0 (ŷDES(2)), which is quite different from the historical mean in Table 5. From Figure 3, it can be seen that this variable is related to D[3,2] and inversely related to D90/D10 on LV1; furthermore, it scarcely affects the other LVs. In fact, given that in ŷDES(2) the oversize percentage is lower than the historical mean, the suggested set is characterized by a low value of D[3,2] and a high value of D90/D10 (even if within one standard deviation as from the values in Table 5). The other variables are more similar to their unconditional mean (Table 5), because they do not show a strong relation with the oversize percentage (Figure 3). This kind of analysis can be repeated for the other sets in Table 4. In particular, in the case of ŷDES(4), a value of Δcompactibility out of the range of the historical product data was assigned. In this case, it can be observed that in order to obtain such a value of ΔCompactibility, the product needs to exhibit a low and broad PSD, and higher-than-mean oversize percentage and ΔFlodex. Furthermore, a negative (and physically meaningless) value of loss on drying (LOD) is obtained. This can be explained because Method 1 does not allow for the inclusion of physical boundaries for the variables and, as a consequence, unfeasible design outputs may be achieved sometimes (especially when extreme values are desired for some other variables). To account for this issue, as anticipated in the end of section 3.2, the problem has been solved using the procedure described for Method 1 by substituting the direct model inversion to reconstruct ŷDES(4) (eq 10 and eq 11) with the optimization framework formalized in eq 13, which allows binding the variables to physically sound values. In particular, the assigned value for ΔCompactibility (ΔCompactibility = 5) has been set as a hard constraint for yNEW in eq 13. Using an optimization framework in this case has also another advantage. By analyzing the loading plots in Figure 3, it can be seen that ΔCompactibility is mainly described by the first two LVs (Figure 3), while it does not contribute significantly on the

Figure 3. Loadings Q of the PLS model on the X and Y data sets.

Figure 4. Plot of SPE versus T2 values for the historical products in Y (●) and the new desired product quality set yDES (□). The lines represent respectively the 95% (short-dashed red) and 99% (dashed blue) confidence limits.

4.2. Method 1: Results. In Table 4 the results obtained after applying Method 1 using the proposed direct inversion approach are reported. Each row represents a vector ŷDES(i) suggested by the model, obtained fixing one at a time each one of the seven properties of yDES in Table 2. The assigned properties for each of the seven ŷDES(i) sets are those bold in brackets in Table 4, and G

dx.doi.org/10.1021/ie3032839 | Ind. Eng. Chem. Res. XXXX, XXX, XXX−XXX

Industrial & Engineering Chemistry Research

Article

Table 4. Method 1. New sets of product properties ŷDES(i) for a wet-granulated product suggested on the basis of the historical data modela LOD (%)

oversize (%)

ΔFlodex (mm)

ΔCompactibility (KPa/MPa)

D[3,2] (μm)

D90/D10

growth ratio

T2

SPE

3 (3) 2.6 1.4 −2.1 1.1 1.8 1.8 2.3

13.5 0 (0) 22.9 34.7 41.1 38.2 21.1 33.3

10.2 11.2 20 (20) 21.6 9.4 8.4 6.3 16.7

−0.58 0.22 1.47 5.00 (5) 0.28 −0.49 0.03 1.94

137 68.4 147 111 400 (400) 411 166 90.4

8.7 15.0 12.3 22.6 4.5 2.5 (2.5) 9.6 15.7

12.2 11.7 30.4 35.4 17.5 15.1 8.0 (8) 19.8

0.90 1.00 1.33 7.64 0.44 0.54 0.35 4.01

∼0 0 0 ∼0 ∼0 ∼0 ∼0 0

DES(1)

ŷ ŷDES(2) ŷDES(3) ŷDES(4) ŷDES(5) ŷDES(6) ŷDES(7) ŷDES a

The bold values represent the assigned element in each set (the original values of yDES are reported in brackets). The proposed approach has been used to calculate the others. T2 and SPE statistics are also given.

Table 5. Mean and standard deviation (st. dev.) values for the properties of the wet granulated products in the historical database Y LOD (%) mean st. dev.

oversize (%)

1.55 1.41

ΔFlodex (mm)

24.5 29.1

ΔCompactibility (KPa/MPa)

10.6 9.6

D[3,2] (μm)

0.5 1.9

D90/D10

181 410

growth ratio

9.8 13.1

15.8 15.1

Table 6. Method 1. New sets of product properties calculated using the optimization framework in eq 13 instead of the direct model inversion in the case ΔCompactibility of the granulated product is assigned (ŷDES(4)), considering or not the soft constraints (SC) on T2.a 2

No SC on T SC on T2 a

LOD (%)

oversize (%)

ΔFlodex (mm)

ΔCompactibility (KPa/MPa)

D[3,2] (μm)

D90/D10

growth ratio

T2

SPE

0 0

0 0

12.4 28.5

5 (5) 5 (5)

11 22

36.7 31.5

7.1 37.5

11.8 8.4

0 1.8 × 10−6

The bold values in brackets represent the values assigned to ΔCompactibility. T2 and SPE statistics are also given.

Table 7. Method 2. New set of product properties for a wet-granulated product suggested on the basis of the historical data model.a 2

No SC on T SC on T2

LOD (%)

oversize (%)

#

#

3 0.1

0 16.1

ΔFlodex (mm)

ΔCompactibility (KPa/MPa)

#

20 17.6

5.4 3

D[3,2] (μm) 3 75

D90/D10 42.7 20.5

growth ratio #

8 25.1

T2

SPE

49.9 2.1

0 0

The values with the # superscript represent the variable values equal to the ones in yDES. T2 and SPE statistics are also given. SC stands for “soft constraint”. a

solution obtained without considering the soft constraint on T2 is above the 95% historical confidence limit represented in Figure 3. Including the soft constraint on the Hotelling’s T2 aids the optimizer to find a solution closer to the origin of the model space (and thus to the historical product profiles), and approximately lying on the model space (SPEyDES ≈ 10−6, second row of Table 6). Two important considerations on the solution with the soft constraint on T2 should be remarked. First, it can be seen that the addition of the soft constraint into the objective function penalizes the minimization of SPEyDES, which is slightly different from zero. This is due to the fact that in this case the induced null space is actually a pseudonull space,3 that is, moving the solution along it (while keeping fixed ΔCompactibility) does not ensure that a solution belonging to the model space is obtained, and slight deviations may therefore occur (SPEyDES ≠ 0). As can be seen, the decrease in T2 due to the presence of the soft constraint is limited, as the boundaries specified for some of the variables do not allow moving the solution further toward the origin of the historical data model plane. Second, note that in general the largest differences between the two solutions reported in Table 6 are mainly due to the growth ratio and to ΔFlodex. As can be seen from the first two plots of Figure 3, these variables scarcely affect the first two LVs of the model, which instead are those that better describe ΔCompactibility, while they are the most significant on the third LV.

other LVs. This means that by assigning this property, it is possible to isolate the score on LV1 or on LV2, and to replace it for the reconstruction of the other variables (see eq 7 above). The scores on the other LVs can be selected independently, thus generating an induced null space, which intersects with the PLS model null space generated by the redundancy in the Q loadings due to the differences in the ranks of the X and Y matrices. The soft constraint on the Hotelling’s T2 proposed in eq 13 allows for the optimizer to move the solution along this null space toward the origin of the latent space so as to find a solution that satisfies the given constraints in the range of the historical data. In Table 6 the solution obtained applying the procedure described for Method 1 (Section 3.2) by substituting the direct model inversion with the optimization framework in eq 13 is shown, when the equality constraint is set for ΔCompactibility (ŷDES(4)). Physical boundaries were specified for LOD and oversize percentage which were assigned to vary between 0 and 100, while D90/D10 and the growth ratio were set to be greater than 1. The solution is presented for the cases in which the soft constraint (SC) on the Hotelling’s T2 is considered (g = 10−4) or not (g = 0) in the optimization formulation. As can be seen, both of the obtained solutions satisfy the equality constraint on ΔCompactibility but are completely different from ŷDES(4) in Table 4, in particular with respect to LOD and oversize percentage, which are found to be at their relevant boundaries. Note that the Hotelling’s T2 statistic for the H

dx.doi.org/10.1021/ie3032839 | Ind. Eng. Chem. Res. XXXX, XXX, XXX−XXX

Industrial & Engineering Chemistry Research

Article

Table 8. Method 2. Estimations of the Product Profiles yNEW Obtained at Each Iteration of the Procedure Described in section 3.3 LOD (%) NEW(1)

y yNEW(2) yNEW(3) yNEW(4)

#

3 3# 3# 3#

oversize (%) #

0 0# 0# 0#

ΔFlodex (mm)

ΔCompactibility (KPa/MPa)

#

20 20# 20# 20#

#

5 3 3 5.4

D[3,2] (μm) #

400 400# 29 3

D90/D10 #

2.5 2.5# 2.5# 42.7

growth ratio #

8 8# 8# 8#

T2

SPE

8.8 3.0 6.7 49.9

7.3 4.4 2.9 0

completely new product quality set yNEW, in which only the inequality constraint on ΔCompactibility is satisfied. To clarify the iterative procedure on which Method 2 is based, in Table 8 the estimations of the product profiles per iteration are reported together with the corresponding values of T2 and SPE for the first case presented in Table 7, in which the soft constraint on T2 is not considered. In Figure 5 the plots of the contributions of each variable to SPE (SPECONT) are shown for each iteration. In Table 8, the variables for which the corresponding constraint is removed at each iteration are underlined, while those which the procedure keeps equal to the ones specified in the original desired set yDES are indicated with the # superscript. From the combined analysis of the plots of Figure 5 and the results in Table 8, a deeper insight on the algorithmic procedure can be obtained. From the projection at the first iteration of yNEW(1) = yDES onto the Q loadings of the PLS model, the contributions SPE(1) CONT are calculated (Figure 5a). It can be observed that ΔCompactibility is the property most contributing to SPE. Therefore, the corresponding equality constraint is removed, and the optimization problem in eq 13 is solved to find the new set yNEW (yNEW(2) in Table 8), in which the value of ΔCompactibility satisfies the specified inequality constraint for it. It results that SPEyNEW(2) = 4.4, which is still above the threshold ε. In Figure 5b the contributions to SPEyNEW(2) are reported (SPE(2) CONT). It can be noted that the highest SPE(2) CONT is still due to ΔCompactibility. However, the value of ΔCompactibility, whose equality constraint has already been removed, is kept fixed by the inequality constraint. Given that this inequality constraint cannot be relaxed due to the product requirements, in the next iteration, the optimization problem in eq 13 is solved removing the equality constraints on both ΔCompactibility and D[3,2], which is the variable with the second highest contribution to SPEyNEW(2). The new set yNEW(3) (Table 8) presents SPEyNEW(3) = 2.9, still (3) above ε; SPECONT (Figure 5c) highlights that the model mismatch is mainly due to D90/D10. Accordingly, the optimization problem is solved again, removing the constraint on this variable, too. Finally, solution yNEW(4) exhibits SPEyNEW(4) ≈ 0 and thus it represents the optimal solution (Table 7; case without soft constraint on T2). By analyzing the iterative solution through Figure 5 and the values of SPE at each iteration, it can be noted that in this case a solution inside the 95% SPE confidence limit (Figure 4) is obtained by simply removing the equality constraint on ΔCompactibility (yNEW(2) in Table 8). The procedure could therefore have stopped at the first iteration. However, to allow for the inclusion of the hard constraints for ŷDES in the LVRM inversion procedures (namely ŷDES = Qτ), the methods were asked to find a solution very close to the model space (ε → 0). Note that in general Method 1 and Method 2 return different information. The first method provides a general perspective on how the assignment of a product specification affects the other variables according to the historical knowledge. The second method provides an optimal solution as a trade-off between the desired product quality variables and the need to fulfill the

This means that they are the most important variables on the induced null space, namely those which undergo the highest variations by moving the solution along it. 4.3. Method 2: Results. In Table 7 the results obtained from the application of Method 2 (eq 13) are shown in terms of suggested new product quality sets yNEW. The procedure has been applied specifying an additional constraint for ΔCompactibility, which was asked to be greater than 3 kPa/MPa (which represents a limit condition considering the available historical data set), while LOD and oversize percentage were assigned to vary between 0 and 100, and D90/D10 and the growth ratio were set to be greater than 1 (physical boundaries). Furthermore, with reference to section 3.3, ε was set to 10−6 in order to obtain a new product quality set yNEW that feasibly lies onto the model space. Results are reported for two cases: in the first case the soft constraint on the Hotelling’s T2 in eq 13 was not considered (i.e., g = 0), while in the second it was included (g = 10−4). For both cases in Table 7, the values of yNEW (which according to the procedure can be kept equal to the ones specified in the original desired set yDES) are indicated with the hashmark (#) superscript. In the last two columns, the values of the T2 and SPE statistics for yNEW are reported, too. As can be seen from the first row of Table 7 (no soft constraint on T2), four of the seven product properties are maintained equal to the ones specified in yDES, namely LOD, the percentage of oversize granules, ΔFlodex, and the growth ratio, while the solution yNEW can be considered lying onto the model space (SPEyNEW ≈ 0). This means that the user can at most keep these values equal to the corresponding ones in yDES to obtain a set yNEW which belongs to the model space. In other words, Table 7 shows that to obtain a product that has the four indicated values assigned, the values of the other properties have to be those reported in the first row of Table 7 for the product to adhere to the historical product property covariance structure. Indeed yNEW represents the best trade-off between the original target quality profile yDES and the model requirements. Note that the T2 value (49.9) indicates that an extrapolated solution above the T2 confidence limits is eventually obtained (Figure 4). In general, this may not be a problem since, when applying LVRM inversion, this allows moving the solution along the null space in order to limit extrapolations in the input variable space, even if the desired product profile is out of the range of the historical products.10 The procedure described in Method 2 considering a soft constraint in T2 was also applied. Note that, as discussed in section 3.1, an induced null space (depending on the subset of yDES being assigned) may be generated in this case, too. The second row of Table 7 shows the solution obtained considering the soft constraint on T2 in eq 13. As can be seen, none of the variables keeps its value equal to those specified in yDES. Moreover, note that the value of ΔCompactibility hits the specified inequality constraint (3 kPa/MPa), while the T2 of the solution is significantly decreased, compared to the previous case (2.1 versus 49.9). In practice, in order to keep the solution onto the model hyperplane by limiting the corresponding Hotelling’s T2 and by satisfying the provided constraint, the procedure has to remove all the equality constraints on yDES by estimating a I

dx.doi.org/10.1021/ie3032839 | Ind. Eng. Chem. Res. XXXX, XXX, XXX−XXX

Industrial & Engineering Chemistry Research

Article

Figure 5. Method 2. Contribution plots obtained during the procedure iterations to calculate yNEW: (a) 1st iteration; (b) 2nd iteration; (c) 3rd iteration.

the loadings from a PCA model on Y can be used as an alternative. Two different procedures have been presented. In the first one, product quality features are assigned one at a time, and an approach based on model inversion of the model is used to estimate the other variables. In the second procedure, an algorithm is used to iteratively select and remove the constraints of the variables that are found to be most responsible for the model mismatch. An optimization problem is solved to calculate their values according to the loadings of the Y space. We demonstrated how in both these methods an induced null space may be generated depending on the assigned variables, in which different solutions, all satisfying the given constraints can be found. The proposed approaches have been applied to a real casestudy concerning a high-shear wet granulation process. The methods have shown their effectiveness in assessing the feasibility of a new product, and in suggesting a reliable and physically sound product quality profile. Note that the methodology relies on empirical data only and, accordingly, the feasibility of a product design is exclusively defined on the basis of the previous knowledge. However, we believe that the methodology may prove to be a useful tool not only to understand if an empirical model is appropriate for inversion, but also to assess the potential quality profiles of completely new products since the very first developmental stages. Future work may involve the investigation of using a “slack variable” approach instead of a weighted term in the objective function to address the stiffness induced by adding hard equality constraints to the problem; such an approach would need to

relationships between product variables obtained from the historical data and represented by the Q loadings of the PLS model. However, it must be noted that Method 1 is more susceptible to uncertainty due to the calculation of the inverse of T the Q(i) Q(i) matrix. Depending on the variable which is assigned DES or on the number of LVs selected to build the PLS for ŷ T model, matrix Q(i) Q(i) may be ill-conditioned due to variable correlation. The ill-conditioning and the presence of noise in the measurements, which masks the effective rank of the Y space, could result in poor estimations of the solution scores using eq 10. For this reason, as shown in section 4.2, Method 1 can be applied by substituting the direct inversion of the model with an optimization framework, which allows moving the solution along the (induced) null space to find the minimum Mahalanobis distance (i.e., with minimum Hotelling’s T2) solution.3,4 Finally consider that it may occur that none of the Methods returns a solution if the design of a product with very different properties from the ones in historical data set is required.

5. CONCLUSIONS AND FUTURE WORK Latent variable regression model (LVRM) inversion has long been demonstrated to be an effective tool to support product and process design. In this paper we proposed a methodology to exploit the covariance structure of the historical data used to build the LVRM so as to guide the selection of a target attribute profile (yDES) with the same covariance structure as the matrix Y of the responses. The method exploits the Q loadings of the PLS model to suggest possible sets of target profiles as trade-offs between the model requirements and the desired product characteristics. Depending on the structure of the latent space of Y, J

dx.doi.org/10.1021/ie3032839 | Ind. Eng. Chem. Res. XXXX, XXX, XXX−XXX

Industrial & Engineering Chemistry Research

Article

are also those which have had higher growth ratio and LOD. This is somehow expected since one of the objectives of granulation is that of increasing the flow properties of the processed materials, by enlarging their size. Moreover, moisture (inversely related to LOD) can act as a binding, making the granules more cohesive and thus less flowable.19 The 4th LV seems to be less significant than the other ones in explaining the systematic variability of Y, as reported in Table 3 (1.58%). This can be also noticed from the low value of the loadings in the last plot of Figure 3.

investigate the theoretical considerations of having a relaxed constraint with a threshold and any potential null-spaces acting together. Additional avenues for future work are to incorporate the uncertainty in the model parameters into the objective function (or the constraints).



APPENDIX A. ON THE RECONSTRUCTION OF YDES As seen above, to reconstruct ŷDES a model describing the covariance structure of the historical data in Y is needed. In general, the covariance structure of the historical data can be optimally described by a principal component analysis (PCA18) on the historical product data set Y, apart from the Q loadings of the PLS model between X and Y (eq 2). Thus ŷDES could feasibly be reconstructed either exploiting the PCA or the PLS model loadings. However, the covariance structure described by the PCA model on Y could potentially not be as the one described by the Q loadings of the PLS model. Considering the different objectives of the two techniques, reconstructing ŷDES through the PCA model forces the target product quality yDES to respect the correlation between the quality variables of the historical products in Y. Differently, the Q loadings of the PLS model are calculated so that the LVs of Y are rotated to maximize their covariance with the LVs of X. Thus the systematic variability of Y which is related to that in X is considered for the reconstruction of ŷDES. Given that the objective of this paper is that of proposing a procedure which allows an imposition of the hard constraint for yDES in the optimization formulation of the model inversion problem, the reconstruction of ŷDES has been based on the Q loadings of the PLS model. The proposed methodologies can however be applied with no modifications to the case in which the PCA loadings are used to reconstruct ŷDES.



AUTHOR INFORMATION

Corresponding Author

*E-mail: sal.garcia@pfizer.com. Notes

The authors declare no competing financial interest.



ACKNOWLEDGMENTS E.T. gratefully acknowledges “Fondazione Ing. Aldo Gini” for the financial support. The authors want also to thank Prof. Massimiliano Barolo for his help and precious discussions to improve the value of the work and the quality of the manuscript.



REFERENCES

(1) Jaeckle, C. M.; MacGregor, J. F. Product design through multivariate statistical analysis of process data. AIChE J. 1998, 44, 1105. (2) Jaeckle, C. M.; MacGregor, J. F. Industrial applications of product design through the inversion of latent variable models. Chemom. Intell. Lab. Syst. 2000, 50, 199. (3) García-Muñoz, S.; Kourti, T.; MacGregor, J. F.; Apruzzese, F.; Champagne, M. Optimization of batch operating policies. Part I. Handling multiple solutions. Ind. Eng. Chem. Res. 2006, 45, 7856. (4) García-Muñoz, S.; MacGregor, J. F.; Neogi, D.; Letshaw, B. E.; Mehta, S. Optimization of batch operating policies. Part II. Incorporating process constraints and industrial applications. Ind. Eng. Chem. Res. 2008, 47, 4202. (5) Flores-Cerrillo, J.; MacGregor, J. F. Control of batch product quality by trajectory manipulation using latent variable models. J. Process. Control 2004, 14, 539. (6) Flores-Cerrillo, J.; MacGregor, J. F. Latent variable MPC for trajectory tracking in batch processes. J. Process. Control 2005, 15, 651. (7) García-Muñoz, S.; Dolph, S.; Ward, H. W., III Handling uncertainty in the establishment of a design space for the manufacture of a pharmaceutical product. Comput. Chem. Eng. 2010, 34, 1098. (8) Yacoub, F.; MacGregor, J. F. Product optimization and control in the latent variable space of nonlinear PLS models. Chemom. Intel. Lab. Sys. 2004, 70, 63. (9) Muteki, K.; MacGregor, J. F.; Ueda, T. Rapid development of new polymer blends: The optimal selection of materials and blend ratios. Ind. Eng. Chem. Res. 2006, 45, 4653. (10) Tomba, E.; Barolo, M.; García-Muñoz, S. General framework for latent variable model inversion for the design and manufacturing of new products. Ind. Eng. Chem. Res. 2012, 51, 12886. (11) Höskuldsson, A. PLS regression methods. J. Chemom. 1988, 2, 211. (12) Burnham, A. J.; Viveros, R.; MacGregor, J. F. Frameworks for latent variable multivariate regression. J. Chemom. 1996, 10, 31. (13) Macgregor, J. F.; Kourti, T. Statistical process control of multivariable processes. Control Eng. Practice. 1995, 3, 3. (14) Arteaga, F.; Ferrer, A. Dealing with missing data in MSPC: Several methods, different interpretations, some examples. J. Chemom. 2002, 16, 408. (15) Nelson, P. R. C.; Taylor, P. A.; MacGregor, J. F. Missing data methods in PCA and PLS: Score calculations with incomplete observations. Chemom. Intel. Lab. Sys. 1996, 35, 45. (16) Kerrigan, E. C.; Maciejowski, J. M. Soft constraints and exact penalty functions in model predictive control. In: Proc. UKACC



APPENDIX B. INTERPRETATION OF THE LOADING PLOTS OF THE WET GRANULATION CASE STUDY The 1st LV, which accounts for ∼32% of the total variance in Y (Table 3), is mainly driven by the particle size distribution (PSD) variables, namely D[3,2] and D90/D10 (top plot of Figure 3). In particular, D[3,2] and D90/D10 are opposite, meaning that granulated materials in the database with high PSD (high D[3,2]) have usually narrower PSD (low D90/D10), compared to the mean of the data. This seems to affect the percentage of oversize granules, which is directly correlated with D[3,2] and the difference in compactibility compared to the raw material (ΔCompactibility). Namely, it is expected that products with high PSD mean, have larger percentages of oversize granules compared to materials with lower PSD means, and this seems to slightly affect the compactibility difference of the granulated product with the raw material, which is expected to be lower. The 2nd LV, which explains ∼23% of the total variance of the data, is affected mainly by the moisture content loss upon drying (LOD) of the materials after the granulation (second plot of Figure 3). From the analysis of the variable loadings it can be argued that materials which have a higher loss of water upon drying are in general those which are less grown and result in products with lower compactibility properties compared to the raw materials (ΔCompactibility). The 3rd LV (third plot of Figure 3) explains ∼8% of the total variance of the data and is mainly driven by the flow properties of the products (ΔFlodex). In particular it seems that products which are more flowable than the corresponding raw materials K

dx.doi.org/10.1021/ie3032839 | Ind. Eng. Chem. Res. XXXX, XXX, XXX−XXX

Industrial & Engineering Chemistry Research

Article

International Conference (Control 2000): Cambridge (U.K.), 4−7 September 2000. (17) Vemavarapu, C.; Surapanemi, M.; Hussain, M.; Badawy, S. Role of drug substance material properties in the processability and performance of a wet granulated product. Int. J. Pharm. 2009, 374, 96. (18) Jackson, J. E. A User’s Guide to Principal Components; John Wiley & Sons, Inc.: New York, 1991. (19) Emery, E.; Oliver, J.; Pugsley, T.; Sharma, J.; Zhou, J. Flowability of moist pharmaceutical powders. Powder Technol. 2009, 189, 409.

L

dx.doi.org/10.1021/ie3032839 | Ind. Eng. Chem. Res. XXXX, XXX, XXX−XXX