From charlesreid1

Box, George; Draper, Norman (1987). Empirical Model-Building and Response Surfaces. Wiley and Sons. ISBN 0-471-81033-9. 

Contents

Chapter 1: Introduction to Response Surface Methodology

Questions when planning initial set of experiments:

1. Which input variables should be studied?

2. Should the input variables be examined in their original form, or should transformed input variables be employed

3. How should response be measured?

4. At which levels of a given input variable should experiments be run?

5. How complex a model is necessary in a particular situation?

6. How shall we choose qualitative variables?

7. What experimental arrangement (experimental design) should be used?


Chapter 2: Use of Graduating Functions

Polynomial approximations:

  • a polynomial of degree d can be thought of as a Taylor series expansion of the true underlying theoretical function y(x) truncated after terms of dth order
  • the higher the degree d, the more closely the Taylor series can approximate the true function
  • the smaller the region R over which y(x) is being approximated with the polynomial approximation, the better the approximation

Issues with application of polynomial approximations:

  • least squares - how does it work? what are its assumptions?
  • standard errors of coefficients - how to estimate the standard deviations of the linear coefficients?
  • adequacy of fit - approximating an unknown theoretical function empirically; need to be able to check whether a given degree of approximation is adequate; how can analysis of variance (ANOVA) and examination of residuals (observed - fitted values) help to check adequacy of fit?
  • designs - what designs are suitable for fitting polynomials of first and second degrees? (Ch. 4, 5, 15, 13)
  • transformations - how can one find transformations (generally)?


Chapter 3: Least Squares for Response Surface Work

Method of Least Squares

Least squares helps you to understand a model of the form:

y = f(x,t) + e

where:

E(y) = eta = f(x,t)

is the mean level of the response y which is affected by k variables (x1, x2, ..., xk) = x

It also involves p parameters (t1, t2, ..., tp) = t

e is experimental error

To examine this model, experiments would run at n different sets of conditions, x1, x2, ..., xn

would then observe corresponding values of response y1, y2, ..., yn

Two important questions:

1. does postulated model accurately represent the data?

2. if model does accurately represent data, what are best estimates of parameters t?

start with second question first


Given: function f(x,t) for each experimental run

n discrepancies:

Method of least squares selects best value of t that make the sum of squares smallest:

S(t) = sum of squares function

minimizing choice of t is denoted

are least-squares estimates of t good?

their goodness depends on the nature of the distribution of their errors

least-squares estimates are appropriate if you can assume that experimental errors:

are statistically independent and with constant variance, and are normally distributed

these are "standard assumptions"

Linear models

this is a limiting case, where

adding experimental error :

model of this form is linear in the parameters

Algorithm

Formulate a problem with n observed responses, p parameters...

this yields n equations of the form

y_1 = t_1 z_{11} + t_2 z_{21} + ...

y_2 = t_1 z_{21} + t_2 z_{22} + ...

etc...

This can be written in matrix form:

and the dimensions of each matrix are:

  • y = n x 1
  • Z = n x p
  • t = p x 1
  • epsilon = n x 1

the sum of squares function is given by:

or,

this can be rewritten as:

Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \mathbf {Z^{\prime }Zt=Z^{\prime }y} }

Rank of Z

If there are relationships between the different input parameters (z's), then the matrix Z can become singular

e.g. if there is a relationship z2 = c z1, then you can only estimate the linear combination z1 + c z2

reason: when z2 = c z1, changes in z1 can't be distinguished from changes in z2

Z (an n x p matrix) is said to be full rank p if there are no linear relationships of the form:

a_1 z_1 + a_2 z_2 + ... + a_p z_p l= 0


if there are q > 0 independent linear relationships, then Z has rank p - q


Analysis of Variance: 1 regressor

Assume simple model

This states that y is varying about an unknown mean

Suppose we have 3 observations of y,

Then the model can be written as

and

and

so that

[ 4 ]   [ 1 ]     [ \epsilon_1 ]
[ 1 ] = [ 1 ] t + [ \epsilon_2 ]
[ 1 ]   [ 1 ]     [ \epsilon_3 ]

Supposing the linear model posited a value of one of the regressors t, e.g.

Then you could check the null hypothesis, e.g.

If true, the mean observation vector given by

or,

[ 0.5 ]   [ 1 ]
[ 0.5 ] = [ 1 ] 0.5
[ 0.5 ]   [ 1 ]

and the appropriate "observation breakdown" (whatever that means?) is:

Associated with this observation breakdown is an analysis of variance table:

Source Degrees of freedom (df) Sum of squares (square of length), SS Mean square, MS Expected value of mean square, E(MS)
Model 1 6.75
Residual 2 3.00
Total 3

sum of squares: squared lengths of vectors

degrees of freedom: number of dimensions in which vector can move (geometric interpretation)

the model says whatever the data is, the systematic part of must lie in the direction of Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle z_{1}} , which gives only one degree of freedom.

Whatever the data, the residual vector must be perpendicular to (why?), and so it can move in 2 directions and has 2 degrees of freedom

Now, looking at the null hypothesis:

the component is a measure of discrepancy between POSTULATED model and ESTIMATED model

Making "standard assumptions" (earlier), expected value of sum of squares, assuming model is true, is

For the residual component it is (or, in general, Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \nu _{2}\sigma ^{2}} , where is number of degrees of freedom of residuals)

Thus a measure of discrepancy from the null hypothesis is

if the null hypothesis were true, then the top and bottom would both estimate the same

So if F is different from 1, that indicates departure from null hypothesis

The MORE F differs from 1, the more doubtful the null hypothesis becomes

Least squares: 2 regressors

Previous model, , said y was represented with a mean plus an error.

Instead, suppose that there are systematic deviations from the mean, associated with an external variable (e.g. humidity in the lab).

Now equation is for straight line:

or, Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle y=z_{1}t_{1}+z_{2}t_{2}+\epsilon }

So now the revised least-squares model is: Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \eta =z_{1}t_{1}+z_{2}t_{2}}

Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \eta =E(y)} - i.e. is in the plane defined by linear combinations of vectors

because , these two vectors are NOT at right angles

the least-squares values Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle {\hat {t_{1}}},{\hat {t_{2}}}} produce a vector

these least-squares values make the squared length Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \sum (y-{\hat {\hat {y}}})^{2}=\vert y-{\hat {\hat {y}}}\vert ^{2}} of the residual vector as small as possible

The normal equations express fact that residual vector must be perpendicular to both and Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle z_{2}} :

Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle {\begin{aligned}z_{1}^{\prime }(y-{\hat {\hat {y}}})&=&0\\z_{2}^{\prime }(y-{\hat {\hat {y}}})&=&0\end{aligned}}}

also written as:

Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle {\begin{aligned}\sum z_{1}(y-{\hat {t_{1}}}z_{1}-{\hat {t_{2}}}z_{2})&=&0\\\sum z_{2}(y-{\hat {t_{1}}}z_{1}-{\hat {t_{2}}}z_{2})&=&0\end{aligned}}}

also written (in matrix form) as:

Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \mathbf {Z^{\prime }} (\mathbf {y-Z{\hat {t}}} )=0}


Now suppose the null hypothesis was investigated for Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle t_{1}=t_{10}=0.5} and Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle t_{2}=t_{20}=1.0}

Then the mean observation vector Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \eta _{0}} is represented as Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \eta _{0}=t_{10}z_{1}+t_{20}z_{2}}

Source Degrees of freedom SS MS F
Model and Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle z_{2}} 2 Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \vert {\hat {\hat {y}}}-\eta _{0}\vert ^{2}=\sum \left[\left(t_{1}-t_{01}\right)z_{1}+\left(t_{2}-t_{02}\right)z_{2}\right]^{2}=6.69} 3.345 2.23
Residual 1 Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \vert y-{\hat {\hat {y}}}\vert ^{2}=\sum \left(y-{\hat {t_{1}}}z_{1}-{\hat {t_{2}}}z_{2}\right)^{2}=1.50} 1.50
Total 3 Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \vert y-\eta _{0}\vert ^{2}=\sum \left(y-\eta _{0}\right)^{2}=8.19}

and so

Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle F_{0}={\frac {\vert {\hat {\hat {y}}}-\eta _{0}\vert /2}{\vert y-{\hat {\hat {y}}}\vert ^{2}/1}}=2.23}

Orthogonalizing second regressor

In the above example, and Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle z_{2}} are not orthogonal

One can find the vectors and that are orthogonal

To do this, use least squares property that residual vector is orthogonal to space in which the predictor variables lie

Regard Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle z_{2}} as "response" vector and as predictor variable

You then obtain Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle {\hat {z_{2}}}=0.2z_{1}} (how?)

so the residual vector is Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle z_{2\cdot 1}=z_{2}-{\hat {z_{2}}}=z_{2}-0.2z_{1}}

now the model can be rewritten as Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \eta =\left(t_{1}+0.2t_{2}\right)z_{1}+t_{2}\left(z_{2}-0.2z_{1}\right)=tz_{1}+t_{2}z_{2\cdot 1}}

This gives three least-squares equations:

1. Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle {\hat {y}}=2z_{1}} 2. Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle {\hat {y}}=1.5z_{1}+2.5z_{2}} 3. Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle {\hat {y}}=2.0z_{1}+2.5z_{2\cdot 1}}

The analysis of variance becomes:

Source df SS
Response function with only 1 Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \vert {\hat {y}}-\eta _{0}|vert^{2}=\left({\hat {t}}-t_{0}\right)^{2}\sum z_{1}^{2}=12.0}
Extra due to Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle z_{2}} (given ) 1 Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \vert {\hat {\hat {y}}}-{\hat {y}}\vert ^{2}={\hat {t}}_{2}^{2}\sum z_{2\cdot 1}^{2}=4.5}
Residual 1 Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \vert y-{\hat {\hat {y}}}\vert ^{2}=\sum \left(y-{\hat {\hat {y}}}\right)^{2}=1.5}
Total 3 Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \vert y-\eta _{0}\vert ^{2}=\sum \left(y-\eta _{0}\right)^{2}=18.0}


Generalization to p regressors

With n observations and p parameters:

n relations implicit in response function can be written

Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle {\boldsymbol {\eta }}=\mathbf {Zt} }

Assuming Z is full rank, and letting Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle {\hat {\mathbf {t} }}} be the vector of estimates given by normal equations

Sum of squares function is Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle S(t)=(y-\eta )^{\prime }(y-\eta )=(y-{\hat {y}})^{\prime }(y-{\hat {y}})+({\hat {y}}-\eta )^{\prime }({\hat {y}}-\eta )}

because cross-product is zero from the normal equations

Furthermore, because Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \mathbf {Z^{\prime }Z} } is positive definite, Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle S(t)} minimized when

So the solution to the normal equations producing the least squares estimate is the one where :

Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle {\hat {t}}=(\mathbf {Z^{\prime }Z} )^{-1}\mathbf {Z^{\prime }y} }

Source df SS
Response function p Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle \vert {\hat {y}}-\eta \vert ^{2}=({\hat {t}}-t)^{\prime }\mathbf {Z^{\prime }Z} ({\hat {t}}-t)}
Residual n-p Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \vert y - \hat{y} \vert^2 = \sum ( y - \hat{y} )^2 }
Total n Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \vert y - \eta \vert^2 = \sum ( y - \eta )^2 }


Bias in Least-Squares Estimators if Inadequate Model

Say data was being fit with a model Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle y = Z_1 t_1 + \epsilon} ,

but the true model that should have been used is Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle y = Z_1 t_1 + Z_2 t_2 + \epsilon}

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle t_1} would be estimated by Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \hat{t_1} = (\mathbf{ Z_1^{\prime} Z_1 } )^{-1} \mathbf{ Z_1^{\prime} y }}

but using true model, Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \begin{array}{rcl} E( \hat{t_1} ) &=& ( \mathbf{Z_1^{\prime} Z_1} )^{-1} \mathbf{Z_1^{\prime}} E(\mathbf{y}) \\ &=& ( \mathbf{ Z_1^{\prime} Z_1 } )^{-1} \mathbf{Z_1^{\prime}} (\mathbf{Z_1 t_1} + \mathbf{Z_2 t_2} ) \\ &=& \mathbf{t_1 + A t_2} \end{array} }

The matrix A is the bias or alias matrix

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle A = \left( \mathbf{ Z_1^{\prime} Z_1 } \right)^{-1} \mathbf{ Z_1^{\prime} Z_2 }}

Unless A = 0, Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \hat{t_1}} will represent t1 AND t2, not just t1

A = 0 when Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \mathbf{Z_1^{\prime} Z_2} = 0} , which happens if regressors in Z1 are orthogonal to regressors in Z2

Confidence Intervals

(examples given for orthogonal and non-orthogonal design... looks interesting but didn't understand it fully)

Chapter 4: Factorial Designs at 2 Levels

I think this approach has a problem... Can only lead to LINEAR models. Chapter 7 begins to deal with 2nd order models.

However, I'm not completely screwed. Composite designs: Chapter 9 details central composite designs, which consist of factorial designs for first-order effects, plus more points to determine higher-order terms.


Brief explanation of 2-level factorial designs

Designation of lower/upper level with -1/+1

Analysis of Factorial Design

Main effect of a given variable, as defined by Yates (1937), is the average difference in the level of response as one moves from low to high level of that variable

Example: effect of variable 1 is estimated by: Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \begin{align} \frac{1}{4} \left( x_{2=+1,3=+1} + x_{2=-1,3=+1} + x_{2=+1,3=-1} + x_{2=-1,3=-1} \right)_{1=+1} \\ + \frac{1}{4} \left( x_{2=+1,3=+1} + x_{2=-1,3=+1} + x_{2=+1,3=-1} + x_{2=-1,3=-1} \right)_{1=-1} \\ = 0.75 \end{align}}

and effect of variable 2 is Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle -0.59}

and effect of variable 3 is Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle -0.35}

Factorial designs also make calculation of interactions possible... i.e. is effect of 1 different at the two different levels of 3?

Example given of calculating multiple interactions...

Variance, Standard Errors

For complete Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle 2^k} design, if Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle V(y) = \sigma^2} :

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle V(\mbox{grand mean}) = \frac{ \sigma^2 }{ 2^k }}

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle V(\mbox{effect}) = \frac{4 \sigma^2}{ 2^k }}

or, if there are r repeats, then the denominators become Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle r 2^k}

In practice, still need estimate Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle s^2} of experimental error variacne Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sigma^2}

Suppose we're given estimate of Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle s^2 = 0.0050} ; then

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \hat{V}(\overline{y}) = 0.000625}

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \hat{V}(\mbox{effect}) = 0.0025}

and corresponding standard errors are the square roots:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle s(\overline{y}) = 0.025}

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle s(\mbox{effect}) = 0.05}

so the effects of each variable, with the standard error, is:

Variable I: Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \overline{y} = 2.745 \pm 0.025}

Variable 1: Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle 0.75 \pm 0.05}

Variable 2: Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle -0.59 \pm 0.05}

Variable 3: Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle -0.35 \pm 0.05}

Variable 12: Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle 0.03 \pm 0.05}

etc...

Regression Coefficients

If you fit a first degree polynomial to textile data, you can obtain:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \hat{y} = (2.745 \pm 0.025) + (0.375 \pm 0.025) x_1 - (0.295 \pm 0.025) x_2 - (0.175 \pm 0.025) x_3}

The estimated regression coefficients Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle b_1 = 0.375, b_2 = -0.295, b_3 = -0.175} and their errors are half of the main effects and their standard errors

Factor of one half comes from definition of effect: difference in response on moving from the -1 level to the +1 level of given variable Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_i} , so it corresponds to change in y after changing Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_i} by 2 units

Regression coefficient Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle b_i} is the change in y when Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_i} is changed by 1 unit

Dye example

Example of Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle 2^6} factorial design

Analysis of results show that data adequately explained in terms of linear effects in only three variables, Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_1 , x_4 , x_6}

Linear equations in Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_1 , x_4 , x_6} fitted by least squares to all 64 data points:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \begin{array}{rcl} \hat{y}_1 &=& (11.12 \pm 0.24)+(0.87 \pm 0.24) x_1+(1.49 \pm 0.24) x_4+(1.35 \pm 0.24) x_6 \\ \hat{y}_2 &=& (16.95 \pm 0.74)-(5.64 \pm 0.74) x_1-(0.17 \pm 0.74) x_4+(5.42 \pm 0.74) x_6 \\ \hat{y}_3 &=& (28.28 \pm 0.67)+(0.19 \pm 0.67) x_1-(1.50 \pm 0.67) x_4-(4.44 \pm 0.67) x_6 \end{array} }

These are not a function of Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_2, x_3, x_5} but this does NOT mean substituting value of 0 for nonsignificant coefficients

Value of 0 would not be best estimate

Regard fitted equations as best estimates in the three dimensional subspace of the full six-dimensional space in which Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_2, x_3, x_5} are at average values

Next, obtain estimate of standard error deviations from residual sum of squares:

Variable 1

Source of variation SS df MS F ratio
Total SS = Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sum y^2} 8,443.41 64
Correction factor,

SS due to Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle b_0 = (\sum y)^2 / 64}

7,918.77 1
Corrected total SS 524.63 63
Due to Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle b_1 = b_1 \sum x_1 y = ( \sum x_1 y )^2 / \sum x_1^2} 48.825 1 48.825 13.47
Due to Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle b_4 = b_4 \sum x_4 y = ( \sum x_4 y )^2 / \sum x_4^2} 142.50 1 142.50 39.32
Due to Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle b_6 = b_6 \sum x_6 y = ( \sum x_6 y )^2 / \sum x_6^2} 115.83 1 115.83 31.96
Residual 217.47 60 3.624

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle s_1 = 1.9038}

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sum y = 711.9; \sum x_1 y = 55.9; \sum x_4 y = 95.5; \sum x_6 y = 86.1}

etc... this table also exists for variable 4 and variable 6

Potential bias in standard deviation estimates:

  • biased upward because of several small main effects and interactions that are being ignored
  • biased downward because of effect of selection (only large estimates taken to be real effects)

these s values were used to estimate standard errors of coefficients hsown in parentheses beneath coefficients

Diagnostic Checking of Fitted Models

Plots of residuals vs. Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \hat{y}_1, \hat{y}_2, \hat{y}_3}

Plots of residuals vs. Time Order

Don't understand exactly what they're getting from these plots... or what "NSCORES" are... or what "Time Order" is

Response Surface Analysis

Application of model to manufacturing: want to restrict one variable to 20, another variable to 26... and maximize strength

Response surface: looking at three-dimensional cube... these two constraints create two planes

The two planes intersect and create a line PQ, and along this line the strength varies from 11.08 (Q) to 12.46 (P)

Estimated difference in strengths at P and Q given by:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \begin{array}{rcl} \hat{y}_P - \hat{y}_Q &=& b_1 ( x_{1P} - x_{1Q} ) + b_4 ( x_{4P} - x_{4Q} ) + b_6 ( x_{6P} - x_{6Q} ) \\ &=& 12.46 - 11.08 \\ &=& 1.38 \end{array} }

and the variance is given by:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \begin{array}{rcl} V(\hat{y}_P - \hat{y}_Q) &=& \left[ (x_{1P} - x_{1Q})^2 + (x_{4P} - x_{4Q})^2 + (x_{6P} - x_{6Q})^2 \right] V(b_i) \\ &=& \overline{PQ}^2 V(b_i) \end{array} }

where Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle V(b_i) = \frac{ \sigma^2 }{n}} , and Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \overline{PQ}^2} is the squared distance between the points P and Q in the scale of the x's

So the standard deviation of Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle (\hat{y}_P - \hat{y}_Q) = 1.38} is Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \overline{PQ} \frac{ \sigma }{ n^{\frac{1}{2}} }}

when this is evaluated, it is:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \left[ (x_{1P} - x_{1Q})^2 + (x_{4P} - x_{4Q})^2 + (x_{6P} - x_{6Q})^2 \right]^{\frac{1}{2}} \frac{\sigma}{ 64^{\frac{1}{2}} } = 0.2809 \sigma }

value of Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle s_1, s_2, s_3} substituted for Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sigma} , this gives a standard error

For variable 1, Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle s_1 = \sigma} , the standard error is 0.53

This means that the difference Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \hat{y}_P - \hat{y}_Q = 1.38} is 2.6 times larger than the standard error, meaning we can be confident the strength is in fact higher at P than at Q

Appendix 4A: Yates' Method for Obtaining Factorial Effects

This has got to be the worst description of a mathematical technique, ever.


Chapter 5: Blocking and Fractionating Factorial Designs (skipping...)

Chapter 6: Use of Steepest Ascent for Process Improvement

Expensive and impractical to explore entire operability region (i.e. entire region in which the system could be operated)

But this should not be the objective

Instead, explore subregion of interest

For new/poorly understood systems, need to apply a preliminary procedure to find these subregions of interest where a particular model form (e.g. 2nd order polynomial) will apply

One method: one factor at a time method

Alternative method: steepest ascent method (Box: this is more effective, economical)

Steepest Ascent Method

Example: chemical system whose yield depends on time, temperature, concentration

Early stage of investigation: planar contours of first-degree equation can be expected to provide fair approximation in immediate region of point P far from optimum

Direction at right angles to contour planes is in direction of steepest ascent, if pointing toward higher yield values

Exploratory runs performed along path of steepest ascent

Best point found, or interpolated estimated maximum point on path, could be made base for new first-order design, from which further advance might be possible

After one or two applications of steepest ascent, first-order effects will no longer dominate, first order approximation will be inadequate

Second order methods (Chapter 7, Chapter 9) will then have to be applied


Chapter 7: Fitting Second-Order Models

Chapter 8: Adequacy of Estimation and the Use of Transformation

Chapter 9: Exploration of Maxima and Ridge Systems with Second-Order Response Surfaces

At first glance this chapter just appears to be a re-hash of the earlier chapter on ridge systems and optimization.

However, on second glance, section 2 discusses a composite design used to construct second-order response surface for a polymer elasticity.

9.2 Example: Polymer Elasticity

Illustrating example to elucidate nature of maximal region for polymer elasticity experiment

Central Composite Design

The design employed was second order central composite design

Such design consists of two-level factorial (or fractional factorial), chosen to allow estimation of all first-order and two-factor interaction terms

This is augmented with additional points to estimate pure quadratic effects

These designs are discussed in more detail in Chapter 15

Using standard factorial coding, 3 variable values converted to -1/+1

First, determine the low/high levels of variable Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \phi} :

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \phi_{low}}

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \phi_{high}}

Next, determine the midlevel:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \frac{ \left( \phi_{high} + \phi_{low} \right) }{ 2 } = \phi_{mid}}

And last, semirange:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \phi_{semi} = \phi_{high} - \phi_{mid} = \phi_{mid} - \phi_{low}}

So that the "standard factorial coding" is:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \psi = \frac{ \phi - \phi_{mid} }{ \phi_{semi} } }

First set of runs: factorial design, coded factorial variable values were -1 and +1

Second set of runs: three-dimensional "star", coded factorial variable values were -2, 0, and +2

Block difference: 1+ week between first and second set of runs, allowing much time for systematic differences

Experiment was run in two blocks of eight runs (Chapter 5 terminology)

Estimation/Elimination of Block Differences

If all 16 runs performed under conditions of first block:

Constant term in second degree polynomial could be written Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \beta_0 - \delta}

If all 16 runs performed under conditions of second block:

Constant term in second degree polynomial could be written Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \beta_0 + \delta}

True mean difference between blocks is Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle 2\delta}

This makes the model:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle y = g ( \mathbf{x}, \boldsymbol{\beta} ) + x_B \delta + \epsilon }

where the variable Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_B} is a blocking variable

This variable is -1 for the first block, +1 for the second block

Next (this is where he loses me) the blocking (indicator) variable Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_B} is orthogonal to each column in Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \mathbf{Z}_1}

Then it follows from Section 3.9 (uh, what?) that:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \hat{\delta} = \frac{ \left( \sum_{j=1}^{16} x_{Bj} y_j \right) }{ \sum_{j=1}^{16} x_{Bj}^{2} } }

He then says, that you can separate out the "blocks" contribution from the residual sum of squares

He then loses me again, with this "blocks" contribution with 1 degree of freedom, that is given by:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle 16 \hat{ \delta }^2 = 26.63 }

And then references a page that doesn't seem to talk about anything related (p. 513)........

He then presents this table:

Source SS degrees of freedom MS
Blocks 26.6 1 26.6 (F = 8.9)
Residual after removal of block 15.0 5 3.0
Residual before removal of blocks 41.6 6

Because blocking is orthogonal, it does not change the estimated coefficients in the model

But the portion of the original residual sum of squares accounted for by the systematic block difference is removed, and that increases the experimental accuracy

It is further possible to analyze this residual variance further and isolate measures of adequacy of fit

Importance of Blocking

Blocking is important!!!

Once the Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle 2^3} factorial design is completed, there are a couple of different ways to proceed

Had first-order effects been large compared with their standard errors, and large compared with estimated interaction terms, then application of steepest ascent would be appropriate (no second-order model)

Using the application of steepest ascent would lead to a maxima in (likely) a different location

Sequential Assembly of Designs

Second part of design added with knowledge that the second degree polynomial equation could now be estimated

A change in level could have occurred between two blocks of runs

Possibility of sequential assembly of different kinds of designs: discussed in more detail in Section 15.3

Examination of Fitted Surface

Location of maximum of fitted surface

Investigation of Adequacy of Fit

Isolating residual degrees of freedom for composite design

Before accepting fitted second degree equation, one must consider lack of fit

N observations fitted to linear model with p parameters:

Fitting process itself will introduce p linear relationships among the N residuals Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle y - \hat{y}}

If N is large with respect to p:

  • the effect of induced dependence among residuals will be slight
  • Plotting techniques employed to examine residuals useful in revealing inadequacies in model

As p becomes larger, and as it approaches N:

  • patterns caused by induced dependencies become dominant, and can mask those due to model inadequacies

Section 7.4: Using factorial desgin, possible to obtain information on adequacy of fit by isolating and identifying individual residual degrees of freedom associated with feared model inadequacies

For fitting a polynomial of degree n, it is important to consider possibility that polynomial of higher degree is needed

This focuses attention on characteristics of estimates when the feared alternative model applies, but the simpler assuemd model ahs been fitted

Contemplation of fitted model embedded in more complex one makes it possible to answer two questions:

1. to what extent are original estimates of coefficients biased if the more complex model is true?

2. what are appropriate checking functions to warn of the possible need for a more complex model?

both questions critically affected by choice of design

Thoughts: need for adequate surrogate models is desperate... If we don't have good surrogate models, all of the hard computational work goes to waste.

The investigation of model fitness, experimental design, and statistical analysis of the results is just as important as development of the model itself.

ESPECIALLY in the case of moving toward predictive science, questions (1) and (2) above are CRITICAL!

Bias Characteristics of the Design

Can write extended third-order polynomial model in form:

y = Z1 beta1 + Z2 beta2 + epsilon

Or in orthogonalized form,

y = Z1 ( beta1 + A beta2 ) + ( Z2 - Z1 A ) beta2 + epsilon

where Z1 beta1 includes all terms up to and including second order, and Z2 beta2 has all terms of third order

Alias or bias matrix A:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle A = \left( Z_1^{\prime} Z_1 \right)^{-1} Z_1^{\prime} Z_2}

It shows that only the estimates of first order terms are biased by third-order terms

If a third-order model is appropriate, and if b1 b2 and b3 are previous least-squares estimates, then

E(b1) = beta1 + 2.5 beta111 + 0.5 beta122 + 0.5 beta133

E(b2) = beta2 + 2.5 beta222 + 0.5 beta112 + 0.5 beta233

E(b3) = beta3 + 2.5 beta333 + 0.5 beta113 + 0.5 beta223

Checking Functions for the Design

Examination of matrix Z2 - Z1 A reveals rather remarkable circumstance

of the 10 columns, only 4 are independent; these 4 are simple multiples of 4 columns of Z2*

(some analysis I don't quite follow...)

Thus, although we cannot obtain estimates of each of the third-order effects individually, using this design we can isolate certain linear combinations of them (certain alias groups)

size of these combinations can indicate particular directions in which there may be lack of fit

example: if Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle l_{jjj}} (linear combinations of Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle j^{th}} observation, I think?) were excessively large, it could indicate that a transformation of Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_j} might be needed to obtain adequate representation using second degree equation

Transformation aspect: discussed in more detail in Section 13.8

Complete Breakup of Residual Sum of Squares

16 runs in composite design used

10 degrees of freedom = estimation of second degree polynomial

1 degree of freedom = blocking

1 degree of freedom = pure error comparison in which two center points are compared

4 degrees of freedom remain

  • canbe associated with possible lack of fit from neglected third order terms
  • alternatively, with need for transformation variables
Source SS degrees of freedom MS
Blocks 26.6 1 26.6
111 7.2 1 7.2
222 2.6 1 2.6
333 1.6 1 1.6
123 1.2 1 1.2
Pure error 2.3 1 2.3
Residual sum of squares 41.6 6

Since none of the mean squares are excessively large compared with others, and do not contradict earlier supposition that Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sigma = 2} (or Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sigma^2 = 4} ), no reason to suspect lack of fit


Chapter 10: Occurrence and Elucidation of Ridge Systems I

Reason for occurrence of unusual ridge shapes of systems can be seen because factors like temp., time, pressure, concentration, etc. are regarded as "natural" variables because they can be conveniently manipulated and measured

Individual fundamental variables (e.g. collision of two types of molecules) often a function of multiple variables

This is why you may see multiple min/max or optimal levels of a fundamental variable

Example: measuring an observable that is a function of voltage, but all you can measure is current and resistance (presuming Ohm's law existence unknown)

This leads to a ridge system, where (along the ridge) the voltage is maximal

Elucidation of Stationary Regions/Maxima/Minima by Canonical Analysis

Canonical analysis: writing second degree equation in form in which it can be more readily understood

involves elimination of all cross-product terms

Examples

(several examples and forms of canonical analysis given)

Appendix 10A: Simple explanation of canonical analysis

(Geometrical explanation of canonical analysis)


Chapter 11: Occurrence and Elucidation of Ridge Systems II

One of the most important uses of response surface techniques: detection, description, exploitation of ridge systems

Examples

Stationary ridge

Rising ridge

Canonical Analysis to Characterize Ridge Phenomena

Example: Consecutive Chemical System with Near Stationary Planar Ridge

(example given: Box and Youle, 1955)

Chemical system

Transformation of variables

Canonical analysis

Direct fitting of canonical form

Exploiting canonical form

Example: Small reactor study yielding rising ridge surface

Example: Stationary ridge in five variables

Economic importance of dimensionality of maxima/minima

Method for obtaining desirable combination of several responses

Appendix 11A: Calculations for ANOVA

Appendix 11B: Ridge analysis (alternative to canonical analysis)

Chapter 12: Links Between Empirical and Theoretical Models

Chapter 13: Design Aspects of Variance, Bias, and Lack of Fit

response y is measured, with a mean value Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle E(y) = \eta} , believed to depend on set of variables Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \boldsymbol{\xi} = \xi_1, \xi_2, \dots, \xi_k}

Exact functional relationship is

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle E(y) = \eta = \eta(\boldsymbol{\xi}) }

and is usually unknown/unknowable

Flight of bird, fall of leaf, flow of water through valve: even using such equations, we are likely to be able to approximate main features of relationship

This book: employ crude polynomial approximations, exploiting local smoothness properties

Adequate LOCALLY (flight of bird can be approximated by straight line function of time for short times, maybe quadratic function at long times)

Low order terms of Taylor series approximation can be used over region of interest Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle R(\boldsymbol{\xi})}

This lies within larger region of operability Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle O(\boldsymbol{\xi})}

if Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle f(\boldsymbol{\xi})} is the polynomial approximation,

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle E(\boldsymbol{\xi}) = \eta(\boldsymbol{\xi}) \approx f(\boldsymbol{\xi})}

"The fact that the polynomial is an approximation does not necessarily detract from its usefulness because all models are approximations. Essentially, all models are wrong, but some are useful. However, the approximate nature of the model msut always be borne in mind."

Suppose the following:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \boldsymbol{\epsilon} = \epsilon_1, \epsilon_2, \dots, \epsilon_n} is vector of random errors with zero vector mean,

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \boldsymbol{y} = y_1, y_2, \dots, y_n}

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \boldsymbol{f(\xi)} = f(\boldsymbol{\xi_1}), f(\boldsymbol{\xi_2}), \dots, f(\boldsymbol{\xi_n})} (where the n different Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \xi} are n observations of Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \xi} , the true model is not:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \mathbf{y} = \mathbf{f}(\boldsymbol{\xi}) + \boldsymbol{\epsilon} }

but is actually

Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle {\begin{array}{rcl}\mathbf {y} &=&{\boldsymbol {\eta }}({\boldsymbol {\xi }})+{\boldsymbol {\epsilon }}\\&=&\mathbf {f} ({\boldsymbol {\xi }})+{\boldsymbol {\delta }}({\boldsymbol {\xi }})+{\boldsymbol {\epsilon }}\end{array}}}

where

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \boldsymbol{\delta}(\boldsymbol{\xi}) = \boldsymbol{\eta}(\boldsymbol{\xi}) - \mathbf{f}(\boldsymbol{\xi}) }

is a vector discrepancy that should be small over the region of interest

There are TWO types of errors that must be taken into account:

1. Systematic, or bias, errors Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \delta(\xi) = \eta(\xi) - f(\xi)} , which is the difference between the expected value of the response Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle E(y) = \eta(\xi)} and its approximating function Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle f(\xi)}

2. Random errors Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \epsilon}

Systematic errors are always to be expected

Since the time of Gauss, they have been ignored and most concentration has been focused on random error

(Nice mathematical results are possible when this is done)

In choosing a design, ignoring of systematic error is not innocuous approximation, and may lead to misleading results

Competing effects of bias and variance

Example of an interval with an unknown function Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle E(y) = \eta(\xi)} , and looking at a plot of Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \eta} vs. Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \xi}

Region of interest: Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \xi_{-} \leq \xi \leq \xi_{+}}

Approximating using straight line:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle f(\xi) = \alpha + \beta \xi}

And errors Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \epsilon} in observations have variance Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sigma^2}

Next step is to apply the coding transformation

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x = \frac{ \xi - \frac{1}{2} \left( \xi_{+} + \xi_{-} \right) }{ \xi_{+} - \xi_{-} } }

to convert the interval of interest into interval Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle (-1, 1)}

Now suppose use least-squares fit to find

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \hat{y}_{x} = a + bx }

Mean squared error calculation:

MSE estimating Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \eta_x} with Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \hat{y}_x} is, for N design points and variance Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sigma^2} ,

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \begin{array}{rcl} &=& \frac{ NE(\hat{y}_x - \eta_x)^2 }{ \sigma^2 } \\ &=& \frac{ NE \left[ \hat{y}_x - E( \hat{y}_x ) + E( \hat{y}_x ) - \eta_x \right]^2 }{ \sigma^2 } \\ &=& \frac{ NV(\hat{y}_x) }{ \sigma^2 } + \frac{ N\left[ E(\hat{y}_x) - \eta_x \right]^2 }{ \sigma^2 } \end{array} }

Symbolically:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle M_x = V_x + B_x }

standardized mean squared error Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle M_x} is equal to variance Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle V_x} plus squared bias Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle B_x} at x

Integrated Mean Squared Error

Can integrate variance and squared bias over region of interest R:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle V = \frac{ \int_R V_x dx }{ \int_R dx } }

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle B = \frac{ \int_R B_x dx }{ \int_R dx } }

and if integrated mean square error is denoted Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle M} , then

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle M = V + R }

Regions of Interest, Regions of Operability for k Dimensions

When using standard designs:

Choice of approximation function Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle f(\xi)} and selected neighborhood Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle R(\xi)} are implicit as soon as experimenter decides on type of design, variables to investigate, levels of variables, and transformations to use

Example: chemist exploring effect of 2 types of catalyst; picks experimental design factors to be total catalyst weight (sum of 2 variables) and catalyst weight ratios (ratio of 2 variables)

other scientists might have selected different ranges for variables, or selected to use different transforms for design factors


Such differences would not necessarily have any adverse effect on the end result of the investigation. The iterative strategy we have proposed for the exploration of response surfaces is designed to be adaptive and self-correcting. For example, an inappropriate choice of scales or of a transformation can be corrected as the iteration proceeds.

However, for a given experimental design, a change of scale can have a major influcence on:

1. the variance of estimated effects

2. the sizes of systematic errors


Regions of Interest: Weight Functions

"Region of interest" discussion implies accuracy of prediction is uniform over interval R

Sometimes this is not the case

Might need high accuracy at some point P in predictor variable space, and can tolerate reduced accuracy away from P

e.g. think of Gaussian vs. top hat

Introduce a weight function Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle w(\boldsymbol{x})}

Minimizing a weighted mean squared error integrated over the whole operability region O

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle M = V + B }

with

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \begin{array}{rcl} M &=& \frac{N}{\sigma^2} \int_O w(x) E\left[ \hat{y}(x) - \eta(x) \right]^2 dx \\ V &=& \frac{N}{\sigma^2} \int_O w(x) E\left[ \hat{y}(x) - E\hat{y}(x) \right]^2 dx \\ B &=& \frac{N}{\sigma^2} \int_O w(x) E\left[ E\hat{y}(x) - \eta(x) \right]^2 dx \end{array} }

Weight functions should also be normalized, so that

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \int_O w(x) = 1 }

1-D Weight Function Example

Fitted equation:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \hat{y}_x = b_0 + b_1 x }

True model:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \eta_x = \beta_0 + \beta_1 x + \beta_{11} x^2 }

Suppose N runs made at levels Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_1, x_2, \dots, x_N} , and that Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sum x_i = N \overline{x}}

Define

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle m_p = \frac{ \sum x_u^p }{ N } }

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \mu_p = \sum_O w(x) x^p dx }

it can be shown that the integrated mean squared error is

(big expression for M)

Want to minimize M

can choose Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle m_3 = 0} to eliminate one term, and this can be done by making design symmetric about center

Only design characteristic that remains is Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle m_2} , which measures spread of sample points (small spread means near-neighbors of center point are selected, etc.)

"All-Variance" Case: bias term is totally ignored

"All-Bias" Case: variance term is totally ignored

Results from 1-D case: optimal value for V = B is close to that for all-bias designs, dramatically different from all-variance designs

This suggests that, if a simplification is to be made in the design problem, it might be better to ignore the effects of sampling variation rather than those of bias

Designs Minimizing Bias

Designs minimizing squared bias are of practical importance

Consideration of properties of such designs are important

Example: polynomial model of degree d1, actual model of degree d2

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \hat{y}(x) = \mathbf{X}_1 b_1 }

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \eta(x) = \mathbf{X}_1 \beta_1 + \mathbf{X}_2 \beta_2 }

also,

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \begin{array}{rcl} \mathbf{M}_{11} &=& N^{-1} \mathbf{X}_1^{\prime} \mathbf{X}_1 \\ \mathbf{M}_{12} &=& N^{-1} \mathbf{X}_1^{\prime} \mathbf{X}_2 \end{array} }

and

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \begin{array}{rcl} \mu_{11} &=& \int_O w(x) x_1 x_1^{\prime} dx \\ \mu_{12} &=& \int_O w(x) x_1 x_2^{\prime} dx \end{array} }

Necessary and sufficient condition for squared bias B to be minimized:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \mathbf{M}_{11}^{-1} \mathbf{M}_{12}^{-1} = \mu_{11}^{-1} \mu_{12} }

Not necessary, but sufficient, condition is

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \begin{array}{rcl} \mathbf{M}_{11} &=& \mu_{11} \\ \mathbf{M}_{12} &=& \mu_{12} \end{array} }

Elements of Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \mu_{11},\mu_{12}} are of form:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \int_O w(x) x_1^{\alpha_1} x_2^{\alpha_2} \dots x_k^{\alpha_k} dx }

and these are moments of weight function.

Elements of Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \mathbf{M}_{11},\mathbf{M}_{12}} are of form:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle N^{-1} \sum_{u=1}^{N} x_{1u}^{\alpha_1} x_{2u}^{\alpha_2} \dots x_{ku}^{\alpha_k} }

and these are moments of the design points.

All are of order Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \alpha = \sum_i \alpha_i}

Thus the sufficient condition above states that up to and including order Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle d_1 + d_2} the design moments must equal the weight function moments

Previous section: all-bias design obtained by setting Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle m_2 = \mu_2} , Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle m_3 = \mu_3}

(Conclusions are...? I don't know exactly. He gives an example for fitting a response function plane to a real function of degree 2 within a spherical region of interest R...)

Detecting Lack of Fit

Two features from a model are desired:

1. Good estimation of Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \eta}

2. Detection of model inadequacy

Box and Draper 1959, 1963: consider experimental design strategies that fit the first critera, then narrow down to ones that also fit second criteria

Consider mechanics of making test of goodness of fit using ANOVA

Estimating Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle p} parameters

Using Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle p+f} observations

Results in Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle f} degrees of freedom

Repeated observations made at certain points to provide Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle e} pure error degrees of freedom

Total number of observations: Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle N = p + f + e}

ANOVA table:

Source df E(MS)
Parameter estimates p
Lack of fit f Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sigma^2 + \Delta^2/f}
Pure error e Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sigma^2}
Total N

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \Delta^2} = noncentrality parameter

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sigma^2} = experimental error variance, or expectation of unbiased pure error mean square

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sigma^2 + \Delta^2/f} = expected value of lack of fit mean square

Test of lack of fit: comparison of mean square for lack of fit to mean square for error, via Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle F(f,e)} test

Noncentrality parameter takes the general form:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \Delta^2 = \sum_{u=1}^{N} \left[ E(\hat{y}_u) - \eta_u \right]^2 = E(S_L) - f \sigma^2 }

where Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle S_L} = lack of fit sum of squares

Good detectability of general lack of fit can be obtained by choosing a design that makes Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \Delta^2} large

This can be achieved by putting certain conditions on the experimental design moments

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle d^{th}} order design provides high detectibility for terms of order Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle d+1} if:

1. all odd design moments of order Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle (2d+1)} or less are zero

2. the following ratio is large:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \frac{ \sum_{u=1}^{N} r_u^{2(d+1)} }{ \left[ \sum_{u=1}^{N} r_u^2 \right]^{d+1} } }

where

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle r_u^2 = x_{1u}^2 + x_{2u}^2 + \dots + x_{ku}^2 }

Example: for first order design Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle d=1} , ratio Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sum r_u^4/(\sum r_u^2)^2} should be large to detect high dependability of quadratic lack of fit; Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle d=2} , ratio Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sum r_{iu}^6 / ( r_{iu}^2 )^3} should be large to provide high detectability of cubic lack of fit.

Example: Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle k = 2, d_1 = 1, d_2 = 2}

Two example designs are given:

  • one design (A) is sensitive to lack of fit produced by interaction term, completely insensitive to lack of fit produced by quadratic terms
  • one design (B) is sensitive to lack of fit produced by quadratic terms alone, not sensitive to lack of fit due to interaction terms

Detecting Variable Transformability

Construction of designs to detect whether a variable should be transformed to yield a simpler model

Discussion of both first and second order models...

Second Order Models

For example, a function may possess asymmetrical maximum which, after suitable variable transformation, can be represented by quadratic function

Parsimonious class of designs of this type: central composite arrangements in which a cube, consisting of two-level factorial with coded points Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \pm 1} , or fraction of resolution Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle R \geq 5} , augmented by an added "star" with axial points at coded distance Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \alpha} and by Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle n_0} added center points

(note that Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle n_c} is number of cube points, Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle n_s} is number of star points, Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle n_{0c}} number of center cube points, Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle n_{0s}} number of center star points, and thus Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle n_0 = n_{0c} + n_{0s}} )

Before accepting utility of fitted equation, need to be reassured on two questions:

1. Is there evidence from data of serious lack of fit?

2. If not, is the change in Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \hat{y}} over the experimental region explored by design large enough compared with standard error of Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \hat{y}} to indicate that response surface is adequately estimated?

ANOVA table: throws light on both questions

  • elements (row) for:
    • mean
    • blocks
    • first order extra
    • second order extra
    • lack of fit Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle b_{111}}
    • lack of fit Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle b_{222}}
    • lack of fit Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle CC}
    • pure error

Main concern: marked lack of fit of second order model

Design of experiment table: factors/levels table

Need for transformation would be associated with appearance of third order terms

Associated with the design table are four possible third-order columns, namely those created by:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle (x_1^3, x_1 x_2^2); (x_2^3, x_2 x_1^2); }

These form two sets of two items

Suppose these third-order columns orthogonalized with respect to low-order Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \mathbf{X}} vectors (regress them against 6 columns for Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_1, x_2, x_1^2, \dots} )

Then take residuals to yield columns Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_{111}} from Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_1^3} , Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_{122}} from Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_1 x_2^2} , etc.

In vector notation:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \mathbf{x}_{iii} = -3 \mathbf{x}_{ijj} }

The curvature contrast (curvature contrast has expectation zero if assumption of a quadratic model is true) associated with Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \mathbf{x}_{111}} is:

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \begin{array}{rcl} c_{31} &=& \frac{1}{36} \mathbf{x}_{111}' \mathbf{y} \\ &=& \frac{1}{3} \left[ \frac{ \overline{y}_{\alpha} - \overline{y}_{-\alpha} }{ 2 \alpha } - \frac{ \overline{y}_{1} - \overline{y}_{-1} }{ 2 } \right] \end{array} }

in this case, Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \alpha} is the average of the responses at the second level (so, for composite design, Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \alpha = 2} most likely).

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \overline{y}_{\alpha}} is average of response Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle y} at level Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_1 = \alpha} , etc.

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle c_{31}} is measure of overall non-quadricity in the Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_1} direction

Corresponding measure in Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x_2} diredction is Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle c_{32} = \mathbf{x}_{222}^{\prime} \mathbf{y} / 36}

General Formulas

General composite designs contain:

1. a "cube" consisting of a Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle 2^k} (full factorial) or Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle 2^{k-p}} (fractional factorial), made up of points of type Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \pm 1} for resolution Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle R \geq 5} replicated Failed to parse (Conversion error. Server ("https://en.wikipedia.org/api/rest_") reported: "Cannot get mml. Server problem."): {\displaystyle r_{c}} times, leading to the number of points Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle n_c = r_c 2^{k-p}}

2. a star, that is, Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle 2k} points Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle (\pm \alpha, 0, 0, \dots), (0, \pm \alpha, 0, 0, \dots), \dots} on the predictor variable axes replicated Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle r_s} times leading to Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle n_s = 2kr_s} points (assuming Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \alpha \neq 1} )

3. Center points, number Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle n_0} , where Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle n_{0c}} in cube, Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle n_{0s}} in star

Chapter 14: Variance-Optimal Designs

Ignoring of bias: the theory that follows rests on assumption that graduating polynomial is the response function

This polynomial must be regarded as a local approximation of an unknown response function

Two sources of error: variance error and bias error

Designs which take account of bias tend not to place points at the extremes of region of interest, which is where credibility of approximating function is most strained


Aims in selecting experimental design must be multifaceted

Desirable design properties:

  1. generate satisfactory distribution of information throughout region of interest R
  2. ensure that fitted value at Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x, \hat{y}(x)} be as close as possible to true value at Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle x, \eta(x)}
  3. give good detectability of lack of fit
  4. allow transformations to be estimated
  5. allow experiments to be performed in blocks
  6. allow designs of increasing order to be built up sequentially
  7. provide internal estimate of error
  8. be insensitive to wild observations and to violation of usual normal theory assumptions
  9. require minimum number of experimental runs
  10. provide simple data patterns that allow ready visual appreciation
  11. ensure simplicity of calculation
  12. behave well when errors occur in the settings of predictor variables, the x's
  13. not require impractically huge number of levels of predictor variables
  14. provide check on constancy of variance assumption

Orthogonal designs

Orthogonality: important design principle (Fisher and Yates)

Rotatability: logical extension of orthogonality

Chapter 15: Practical Choice of a Response Surface Design

have to take account of stuff on the right to determine relative importance of stuff on left

Characteristics of design Relevant experimental circumstances
Allows check of fit
  • size of experimental region
  • smoothness of response function
  • complexity of model
Allows estimations of transformations
  • lack of fit that could be corrected by transformation
Permits sequential assembly
  • ability to perform runs sequentially
  • ability to move in space of the variables
Can be run in blocks
  • homogeneity of experimental materials
  • state of control of the process
Provides independent estimate of error
  • number of runs permissible
  • possibility of large experimental error
  • existance of reliable prior estimate of error
Robustness of distribution of design points
  • possibility of occasional aberrant runs and/or observations
  • nature of error function
Number of runs required
  • cost of making runs
Simplicity of data pattern
  • need to visualize data to motivate model

Sequential assembly

Many examples of designs used sequentially, e.g. using steepest ascent with first-order designs, then finding sufficiently promising region, then creating second-order model inside that region

Illustration: three-phase sequential construction of design

  • I: regular simplex (4 of 8 cube corners) and 2 center points
  • II: complementary simplex (remaining 4 cube corners) and 2 (additional) center points
  • III: six axial points (star)

Phase I: orthogonal first-order design, checks for overall curvature (via contrast of average response of center points with average response on cube)

If first-order effects were large compared with their standard errors, and no serious curvature exhibited, then would explore indicated direction of steepest ascent

If doubts about adequacy of first-degree polynomial model, then Step II would be performed

Steps (I+II) still first order, but they give much better indication of detecting lack of fit due to additional information permitting estimation of two-parameter interactions

If first-order terms dominate, move to more promising region

If first-order effects small, and two-factor interactions dominate or strong evidence of curvature, move to Step III (second order)

Other Factors

Robustness

Minimization of effects of wild observations (not sure waht that means, exactly)

Number of Runs

Experimental design should focus experimental effort on whatever is judged important in the context

Suppose that in addition to estimating Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle p} parameters of assumed model form, need Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle f \geq 0} contrasts needed to check adequacy of fit, Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle b \geq 0} further contrasts needed for blocking, and estimate of experimental error needed having Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle e \geq 0} degrees of freedom

To obtain independent estimates of all items of interest, require a design with at least Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle p + f + b + e} runs

Relative importance of checking fit, blocking, and obtaining error estimate will differ in different situations

Minimum value of N runs will correspondingly differ

Corresponding minimum design will only be adequate if Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sigma^2} is below a critical value

When Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sigma^2} larger, designs larger than minimum design needed to obtain estimates of sufficient precision

Even with Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \sigma^2} small, designs where Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle N > p} not wasteful

Depends on whether additional degrees of freedom are genuinely used to achieve the experimenter's objectives

Simple Design Patterns

Question: why use design patterns, instead of randomized designs?

Statistician's task:

1. Inductive criticism

2. Deductive estimation

(2) involves deducing consequences of given assumptions, in the light of the data obtained; this is easily done with randomized designs

(1) involves two questions:

a) what function should be fitted in the first place?

b) how to examine residuals from fitted function to understand deviations from initial model? (relation to the predictor variables, how to find appropriate model modification)

Factorial/composite designs use patterns of experimental points allowing such comparisons to be made

Inductive criticism enhanced by possibility of being able to plot original data and residuals against variable parameter Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \xi_1} for each individual level of Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://en.wikipedia.org/api/rest_v1/":): {\displaystyle \xi_2} , or vice versa

First Order Designs

two-level factorial (Chapter 4)

fractional factorial (Chapter 5)

use of these in estimating steepest ascent (Chapter 6)

convenient curvature check: obtained by adding center points to factorial points (Section 6.3)

first-order designs can play role of initial building blocks in construction of second-order designs

Plackett Burman designs: useful first-order designs (Section 5.4); can also be used as initial building blocks for smaller second-order designs (Section 15.5)

Koshal first-order design can be used for "extreme economy of experimentation"

Regular Simplex Designs

Adapted from Box (1952), "Multifactor designs of first order", Biometrika 39, p. 49-57

Simplex in k dimensions: figure formed by joining any k+1 = N points that do not lie in a (k-1)-dimensional space

Example: (k + 1) = 3 points not all on the same straight line (because if they were on a line, that would be in a 1-dimensional space)

Example: (k + 1) = 4 points not all on the same plane (because if they were, that would be in a 2-dimensional space)

Triangle in 2 dimensions

Tetrahedron in 3 dimensions

Regular simplex = all edges are equal