Appendix 4

Analysis of an incomplete block design

The model for an incomplete block design with C progenies and NB incomplete blocks is such that the expected yield E(ygh) of a tenera offspring Tg (g=1,...,C) of a dura mother Di (i=1,...,A) and a pisifera father Pj (j=1,...,B), which is allotted to a plot in an incomplete block Blh (h=1,...,NB), can be described as the sum of a general constant f , an effect t g of the tenera Tg and an effect d h of the block Blh, hence

E(ygh) = f + t g + d h

for g=1,...,C and h=1,...,NB .

The yield ygh of the progeny Tg in the block Blh can be described as ygh = E(ygh) + egh, where egh is the environmental effect or plot error with expectation E(egh) = 0 and variance Var(egh) = s 2, these errors are uncorrelated. Because we have allotted the plots of a block at random to the progenies, which must be tested in this block according to the design, this assumption of uncorrelated errors is reasonable.

The model described for ygh is an additive model of the tenera effects and the block effects. In section 2.2.1 we have already described how the parameters of an additive model can be estimated with the Least Squares Method. To estimate these parameters we must solve the so-called Normal Equations. Let the Least Squares estimates be denoted by f for f , tg for t g and dh for d h. Let further ngh be 1 if progeny Tg is present in block Blh and ngh be 0 if progeny Tg is not present in block Blh.

The Normal Equations are then:

n.. * f + S g ng. * tg + S h n.h * dh = y.. (1)

ng. * f + ng. * tg + S h ngh * dh = yg.

for g=1,..., (2)

n.h * f + S g ngh * tg + n.h * dh = y.h

for h=1,...,NB (3)

Note that these equations are not independent. Equation (1) is equal to the sum of the equations of (2); also, equation (1) is equal to the sum of the equations of (3). Hence there are two linear dependencies between the Normal Equations. We will choose as a solution of the Normal Equations that solution where we take tC=0 and dNB=0.

The Least Squares Mean of an offspring Tg is defined as

f + tg + S h dh /NB

and this is the same for every solution of the Normal Equations.

The estimate for the variance s 2 is s2 = SS(res)/df(res) , where the residual sum of squares SS(res) is calculated as

SS(res) = S gS h ygh2 -[ f*y.. + S g tg * yg. + S h dh * y.h ]

with degrees of freedom

df(res) = n.. - [ C + NB -1 ].