This LaTeX document is available as postscript or asAdobe PDF.
These notes are based on material prepared by Brian Kennedy (1993), but modified slightly. The intent of the notes were to present methods that were commonly used in the past (1950's to 1970's), and lead up to current models and methods. These notes will also be used to introduce basic matrix algebra concepts.
1. Offspring on Parent Regression
An infinitesimal model is assumed throughout these notes. The methodology is based on linear regression methods.
The analysis begins with a model. The model begins with an equation,
Ordinary least squares (LS) equations are constructed to estimate
and
,
i.e.,
1.1 Small Numerical Example
Below are records on daughters and their dams.
| Record | Daughter | Dam |
| (i) | yi | xi |
| 1 | 51 | 60 |
| 2 | 57 | 58 |
| 3 | 45 | 48 |
| 4 | 60 | 37 |
| 5 | 40 | 25 |
In matrix format, the model looks as follows:
Appropriate SAS IML statements would be
y = { 51, 57, 45, 60, 40 };
x = {1 60, 1 58, 1 48, 1 37, 1 25};
print x y;
The ordinary least squares equations would be
xx = x`*x;
xy = x`*y;
yy = y`*y;
c = inv(xx);
bhat = c*xy;
print xx, xy, yy, c, bhat;
The numerical results are
Thus, the estimate of heritability for these data would be
rx = 2;
N = 5;
red = bhat`*xy;
ev = (yy - red)/(N-rx);
print ev;
Numerically,
2. Offspring on Mid-Parent Average
In this case, there can be several offspring from one set of
parents (as in swine or rabbits), and each offspring would have
just one record. However, a sire (dam) is mated to just one
dam (sire). A model that has been employed is
To illustrate the method, suppose we have the following data on 8 progeny from 4 sire-dam pairs.
| i | j | yij | Sire | Dam | Average |
| Record | Record | xi | |||
| 1 | 1 | 14 | 21 | 25 | 23 |
| 1 | 2 | 30 | 21 | 25 | 23 |
| 1 | 3 | 20 | 21 | 25 | 23 |
| 2 | 1 | 10 | 20 | 12 | 16 |
| 3 | 1 | 25 | 4 | 20 | 12 |
| 3 | 2 | 7 | 4 | 20 | 12 |
| 4 | 1 | 20 | 8 | 4 | 6 |
| 4 | 2 | 13 | 8 | 4 | 6 |
Let
and assume apriori that
,
then
x = {1 1 0 0 0 23,
1 1 0 0 0 23,
1 1 0 0 0 23,
1 0 1 0 0 16,
1 0 0 1 0 12,
1 0 0 1 0 12,
1 0 0 0 1 6,
1 0 0 0 1 6};
y = {14, 30, 20, 10, 25, 7, 20, 13};
g = { 0 1 1 1 1 0 };
gi = diag(g);
xx = x`*x;
xy = x`*y;
yy = y`*y;
mme = xx + gi*1.0666667;
c = inv(mme);
bhat = c*xy;
red = bhat`*xy;
ev = (yy - red)/6;
print xx, xy, yy, c, bhat, red, ev;
end;
The denominator in the equation for ev is 6 because
N=8 and the rank of the fixed effects columns (i.e.
and xi) is 2 giving a difference of 6.
The mixed model equations, mme, are
mme, is
One must also estimate the family variance,
from this model. A procedure that will be described later,
called restricted (or residual) maximum likelihood (REML),
can be used for this purpose.
Let the inverse of mme be represented as
Families would be treated as fixed effects
if they were selected as
the better families out of the population. Then
simple least squares equations are used. That is,
3. Family Means Analysis
Only the averages of the offspring and the number of offspring in each family are known. Individual records of offspring are not known. The model for individual records is the same as in the last section, but now we only have family means. The data are as follows:
| Family | Mid-parent | Number in | Family |
| Number | Average | Family | Mean |
| 1 | 23 | 3 |
|
| 2 | 16 | 1 | 13 |
| 3 | 15 | 2 |
|
| 4 | 12 | 2 |
|
The equation of the model, in this case, is
The key to this analysis is to account for different number of offspring per family mean by using a weighted LS analysis.
4. Nested Design
In this situation, sires are mated to several dams each, but there
is still only one offspring per dam and one record per offspring.
Sires are assumed to be unrelated and not selected, and sires and
dams are mated at random. The equation of the model is
| Sire | Dam | Offspring |
| Number | Record | Record |
| 1 | 50 | 41 |
| 1 | 42 | 39 |
| 1 | 45 | 52 |
| 2 | 38 | 44 |
| 2 | 48 | 60 |
| 3 | 35 | 43 |
| 3 | 27 | 37 |
| 3 | 36 | 30 |
| 3 | 43 | 47 |
Assuming that ks=12, then the MME and subsequent results are
as follows:
If sires have been selected in some way, then treating them as
fixed effects is necessary, and the sire variance is not estimated.
Then
is still an unbiased estimate of one half heritability
provided that the other assumptions of the model are still true.
This LaTeX document is available as postscript or asAdobe PDF.
Larry Schaeffer