Notes

Previous Next

Notes

1. The domestic rat (Rattus norvegicus) was not far behind in earlier centuries, but its current census and range are much more limited than that of the mouse.

2. Phenotypes of this kind are called Quantitative traits and the loci that control these phenotypes are called quantitative trait loci or QTLs.

3. Several detailed histories of the early years of mouse genetics have been published ( Keeler, 1931; Dunn, 1965; Morse, 1978, 1981, 1985; Russell, 1978, 1985; Klein 1986).

4. A tree showing the scientific descendants of Castle down through several generations is presented by Morse (1978). The author is pround to trace his own scientific heritage in mouse genetics back through Karen Artzt to Dorothea Bennett to Salome Waelsch to L. C. Dunn to William Castle.

5. Many common forms of cancer susceptibility and resistance are multifactorial, and until very recently, it was not possible to actually separate out the various loci involved. In the absences of inbred lines, cancer appears sporadically within a population, and there is no basis on which to demonstrate the involvement of genetic factors. However, with inbreeding and selection for resistance or susceptability to a particular form of cancer, the genetic nature of the disease can be proven beyond a doubt.
See also the Mouse Tumor Biology Database at MGI.

6. Francois Bonhomme has pointed out to the author that the lumping together of house mice and other "mouse-like" creatures is a peculiarity of English and other Germanic languages. In Latin-derived languages, a clear distinction is made; every French child, for example, knows the difference between a field mouse (mulot) and a house mouse (souris).

7. The complete definition of a commensal animal is one the "lives and feeds in close association with another species without benefiting it or harming it directly" (Bronson, 1984).

8. Population geneticists use the relative level of heterogeneity within a population as a predictor of the time that has passed since the population was "founded".

9. Karyotypic variation does exist within the M. m. domesticus group but it is solely of the Robertsonian type with fusions between acrocentric centromeres to form hybrid metacentric chromosomes. This variation is discussed more fully in Chapter 5.

10. The observation that hybrid sterility is expressed in males but not females is as predicted by Haldane's rule, which states that the heterogametic sex (in this case XY males) will always be the first to become sterile in inter-specific crosses (Haldane, 1922). The rationale for this empirical observation remains unknown.

11. However, there are two as yet unpublished reports of successful laboratory matings (F. Bonhomme, personal communication).

12. The Jackson Laboratory restricts its sale of mice to investigators who will use them (and their descendants) for scientific research and not commercial purposes.

13. It is possible to deviate somewhat from a strict brother-sister mating protocol during the production of an inbred strain, so long as the animals derived can still trace their lineage back to a single breeding pair at the beginning of the process. For example, survival of a line might require a mating between an offspring and parent at one or more generations. If this variation in breeding protocol occurs, one must calculate its effect on the theoretical "inbreeding coefficient" (Green, 1981) which will allow a re-determination of the stage at which the equivalent of 20 sequential brother X sister matings will occur. At this stage, the strain would be considered inbred.

14. The probability calculations for these later generations are somewhat more complicated, and the interested reader should refer to the excellent exposition on the use of probability in animal breeding studies by Green (1981).

15. The exception to this general rule can be seen in children born from unions between closely related family members. These children are often sickly, and it is this consequence of consanguinity that is likely to be responsible for the taboos against marriage between siblings and cousins that are enforced by most human cultures.

16. In this case, one must remember that the genetic background of the mutant strain will begin to diverge slowly from that of the inbred strain that gave rise to it.

17. The alternative cross-intercross or M system was used in the original work performed by Snell (1978).

18. The commonly used 129/SvJ strain obviously does not follow these rules, however, like most other cases of historical naming anomalies, it is accepted for use by the mouse community.
See also Revised Nomenclature for Strain 129 Mice at MGI.

19. Italics indicate my emphasis. The symbols used for some of the first mouse genes to be characterized were A for agouti, b for brown, c for albino (absence of color), d for dilute, Tfor Brachyury (tail), etc. As the number of known genes increased, two-letter symbols were used. Nearly all genes chracterized now are given primary symbols of at least three letters in length.

20. However, one must remember that M. spretus is a species, not a strain, and as such, different M. spretus lines may be polymorphic for the locus in question.

21. This would not be necessary for records maintained in a computer-based spreadsheet file.

22. Failure to produce litters could be a consequence either of failure to mate and/or a failure to achieve pregnancy subsequent to mating.

23. For a much more detailed exposition of this topic, the reader is directed to books by Hogan and her colleagues (1994) and by Rugh (1968).

24. To be perfectly accurate, the maximal information content possible in a diploid organism is equivalent to the haploid genome size multiplied by (1+x), where x is the fraction of nucleotides in the genome that are heterozygous. For mice and all other mammals, x will always be less than 0.01, and thus, it can be safely ignored in complexity calculations.

25. One centimorgan (cM) is defined as the distance between two markers that are observed to recombine with a frequency of 1%. A centimorgan is synonymous with a "map unit." The relationships between map distances and recombination frequencies are discussed more fully in Chapter 7.
See also centimorgan in the MGI Glossary.

26. The one-to-one correspondence between chiasmata and crossing over is accepted as fact today, but originally this idea was known as the "chiasmatype theory".

27. This value is 10% less than the generally quoted, although unsupported, consensus value of 1,600 cM (Davisson and Roderick, 1989).
See also Mouse Genome Length Estimates at MGI.

28. The lungfish Lepidosiren has a genome that is more than 20 times larger than the human/mouse genome. The genome of the plant Tradescantia is almost ten times larger.

29. I am taking broad liberties here with the definition of "typical". This calculation is just a back of the envelope exercise to obtain a very approximate number.

30. Although a centromere appears to be at one end of all normal mouse chromosomes, this cannot be the case at the molecular level because all DNA molecules must be capped at both ends with telomeres that maintain their integrity. Since there must be at least a short stretch of DNA that extends beyond the centromere, mouse chromosomes are formally acrocentric rather than telocentric.

31. Of course, once a karyotype has passed through this intermediate stage, and homozygosity for each Robertsonian translocation is attained, normal fertility is completely restored.

32. There have been numerous modifications of the Holliday model — including those proposed by Meselson and Radding — that allow a better fit to the actual data, and there is still lack of consensus on some of the details involved. However, the central feature of the Holliday model — single-strand invasion, branch migration, and duplex resolution — is still considered to provide the molecular basis for gene conversion.

33. The equation is rho = 1.660 = 0.098(GC), where rho is the buoyant density, and GC is the molar fraction of nucleotides that are G or C.

34. The 234 bp unit actually evolved through the initial duplication of an even smaller unit (116/118 bp), which in turn was derived from duplications upon duplications and divergence of three ancestral nonanucleotides (9 bp sequences) with a combined consensus of GAAAAA(T/A/C)(C/G)(AT).

35. The terms LINE and SINE were originally coined by Maxine Singer (1982) as abbreviations for long interspersed elements and short interspersed elements, respectively.

36. The best age for maximal production of embryos by superovulation is between 3 and 4 weeks before the female has actually reached sexual maturity (Hogan et al., 1994). Once sexual maturity has been reached naturally, the response to superovulation falls off, although it is not eliminated. Another factor affecting productivity is the general health and nutritional status of the females. If one intends to use females from a litter for superovulation, the litter should be culled of males early on to increase the milk supply to the female pups that remain.

37. Out of simple curiosity, it is interesting to consider why human beings have evolved away for an inductive link between the female orgasm and hormonal changes required to initiate a successful pregnancy. It seems likely that this occurred in response to the use of sex by people for purposes other than procreation — such as pair bonding — which can occur at times outside the estrus period of the female cycle.

38. In humans, implantation is the signal that alters the course of the estrus cycle. Thus, humans can not become pseudopregnant.

39. Whenever an interesting phenotype does arise, the transgene provides a tag for the cloning of flanking sequences from the disrupted gene and its wild-type counterpart to allow further characterization of the new mutant.

40. This calculation was performed by Chi-squared analysis with the Yates correction factor of 0.5. The Chi-squared statistical test is discussed in detail in Section 9.1.3. With a null hypothesis of a Tg/+ genotype and 13 offspring, one would expect each of the two classes (transgenic and non-transgenic) to be represented at equal levels of 6.5. If all of the observed offspring carry the transgene, the Chi-squared equation is solved as follows: (13 - 6.5 - 0.5)²/6.5 + (|(0 - 6.5)| - 0.5)²/6.5 = 11.1. With a single degree of freedom, this Chi-squared value leads to a P value of less than 0.001 (from Table 9.2).

41. The original congenic strain carried the Sl allele in an obligatory heterozygous statesince homozygotes are non-viable. The 129 agouti lines maintained by many investigators have since lost the Sl mutation and differ knowingly from the original 129 line only in the 15 cM region on chromosome 7 that encompasses the c and p loci.

42. The designation " anonymous locus" does not imply a lack of function but rather an ignorance as to whether a function exists. Presumably, most anonymous loci will actually be devoid of function. However, in a small number of cases, a function may be determined at a later date, at which time the anonymous locus will no longer be considered anonymous.

43. A fascinating exception to Mendel's first law is displayed by a variant portion of mouse chromosome 17 that acts as a selfish chromosomal region and is known as a t haplotype (Silver, 1993a). Ten to 20% of the animals in wild populations of M. musculus (of all known subspecies) carry a t haplotype. Heterozygous males, with a genotype of +/t, can transmit their "t-allele" to 99% or more of their offspring. This unusual phenomenon is called segregation distortion or transmission ratio distortion.

44. A statistical analysis suggests that more than luck is required to explain two aspects of Mendel's data. First, the seven genes that he chose to study each happened to lie on a different one of the seven pea chromosomes. Second, the data that Mendel collected showed an incredible "goodness of fit" with the outcome that he expected (Fisher, 1936). Suggestions have been made by Dunn (1965) and Wright (1966) to explain this early example of what today might be grounds for an investigation into experimental fraud. In the case of the ratios that are too good to be true, Mendel may have padded his numbers to fit the theory, not aware that small deviations from the expected were to be expected. Wright believes that Mendel could have made subconscious errors in favor of expectation as he was counting his samples, whereas Fisher suggests that Mendel could have concocted his numbers without actually doing the experiments. An alternative possibility is that one of Mendel's workers, rather than the great man himself, may have actually carried out the subterfuge in a misguided attempt to please his boss. In the case of the seven genes that happen to map to seven different chromosomes, it appears most reasonable to assume that Mendel eliminated all genes that failed to show independent assortment with all other genes in the study. Although this would clearly be considered fraudulent behavior in today's world, Mendel may not have seen it as such in the simpler world of the 19th century.

45. There are some important classes of DNA markers where this is not the case exemplified by the random amplification of polymorphic DNA (RAPD) loci, which are typed based on the presence or absence of a particular PCR product. In these cases, the allele that can be amplified into a PCR product should be considered as having a dominant relationship to the allele that cannot be amplified.

46. Alternative homologs are separated by a "/"; coupled alleles are placed adjacent to each other in order of linkage when this is known. If the order of linked loci is known relative to the centromere, then the most centromere-proximal locus is displayed at the left of a grouping. Different linkage groups are separated by commas. For example, if a third locus, C, not linked to A or B is under observation in the same breeding experiment, the complete genotype would be written as A B/a b, C/c.

47. The value calculated for quadruple recombination events (0.2)⁴ = 0.0016 has only a negligible effect on the final percentage and can be safely ignored.

48. One morgan is equivalent to 100 cM.

49. Evaluation of interference through analysis of linkage data is confounded by genotyping errors, which will suggest falsely that double crossovers have occurred. This is true for error rates even as low as 1%. It is only when such errors are controlled for that the true extent of interference becomes apparent in mammalian genomes (Weber et al., 1993).

50. If you use a hand-held calculator to find a specific solution to the Carter-Falconer equation, be sure the calculator is in "radians" mode and not in "degrees" mode to obtain a proper solution to the inverse trigonometric function tan^-1.

51. It is important to remember that interference only operates on genetic events that occur in the same meiotic cell and it is most readily apperent in genetic analyses based on the backcross. It does not apply to the analysis of recombinant inbred lines, where crossover events accumulate over multiple generations, or even to a traditional outcross-intercross analysis where crossover events can occur in both parents.

52. Since the original publication of the Maniatis manual, a second edition has appeared, other competing manuals have been published, and most suppliers of molecular biology reagents now also provide detailed accounts of molecular techniques.

53. The term "polymorphic" was originally defined by population geneticists as a locus with at least two alleles that are both present at a frequency of greater than 1% in a particular population. The term has been co-opted by geneticists who work in the laboratory to refer to any locus with two alleles that can be utilized in an experimental cross. The opposite of polymorphic is monomorphic.

54. The term RFLP include the word polymorphism, which, as indicated in the previous footnote, is not always accurate for the locus at hand from a strict interpretation according to population geneticists. Purists have substituted "variant" for "polymorphism" to produce the acronym RFLV; however, the term RFLP is still more commonly used.

55. In the absence of breeding data, alternative interpretations of these results are possible. For example, animals with a single restriction fragment could be heterozygous with a deletion of this locus on one chromosome and animals with two restriction fragments could be homozygous with the presence of a restriction site in the middle of the region that hybridizes to the probe. Other more complex interpretations are also possible.

56. This is not true for special hypervariable classes of RFLPs such as minisatellites discussed in Section 8.2.3.

57. It is more difficult to obtain "clean" DNA from animal tissue than from cells grown in tissue culture. Enzymes that are more sensitive to minor contaminants may work fine on tissue culture samples but not on spleen, liver, or tail DNA.

58. The percentage of CpGs in the mammalian genome that are methylated has been estimated at ~90%. Interestingly, the class of non-methylated CpGs are not randomly distributed, but rather, are clustered into islands that distinguish the 5'-ends of many genes. This fact has been exploited by molecular genetic sleuths searching for cryptic genes within large cloned regions for the purpose of positional cloning as described in Section 10.3.4.

59. This number is calculated from the known base composition of mammalian DNA. Nucleotides A and T are each present at a frequency of 0.29, and C and G are each present at a frequency of 0.21. The expected frequency of occurence of the tetranucleotide TCGA can be calculated as (0.29)(0.21)(0.21)(0.29) = 1/270 bp. Similarly, the expected frequency of MspI sites can be calculated as 1/514 bp.

60. It is also possible, although less likely considering the short length of the genomic fragments that are amplified, that some polymorphisms may arise through insertions or deletions confined to the region between two primer sites. In this case, alternative RAPD alleles might both be recognized as PCR fragments of different sizes. In one large data set from the mouse, polymorphisms of this type were not observed (Woodward et al., 1992).

61. The calculations are greatly simplified by assuming that: (1) the unique copy component of the genome is equivalent to random sequence with no preference for, or avoidance of, any particular nucleotide combinations; and (2) PCR amplification will be limited to regions of 2 kb or less in length. The frequency with which any sequence of N nucleotides will occur is once evey 4^N bases on each strand. The total number of occurences of any one oligomer in the single copy component of a genome of complexity C will be 2C/4^N, where the multiplication by two accounts for the two strands of the double helix. The probability that a second oligomer of the same composition will be present on the opposite strand within 2 kb (2 X 10³ bases) in one direction from the first oligomer can be calculated as 2 X 10³/4^N. The average total number of PCR amplifiable products canbe calculated as the total number of sites at which the primer will hybridize multiplied by the frequency with which a second site will be present within a short enough distance (here set at 2 kb) to allow amplification to occur: (2C/4^N)(2 X 10³/4^N) = 4,000 C/16^N.

62. At least 90% of the fragments observed in experiments performed to date are less than 2 kb in length (Welsh and McClelland, 1990; Williams et al., 1990; Welsh et al., 1991; Nadeau et al., 1992; Serikawa et al., 1992; Woodward et al., 1992).

63. In actuality, the repeat families B1, B2 and L1 are not distributed in a random manner, but instead, are each located preferentially in different chromosomal domains as discussed in Section 5.4.4. As a consequence, it is not possible to derive a mathematical formula that can be used to predict the numbers of PCR products to be expected with different sets of primers.

64. The CA-repeat subclass of microsatellites appears to form an altered DNA structure known as Z-DNA (Hamada and Kakunaga, 1982) that was thought to have special properties at one time, but not now.

65. Microsatellite polymorphisms have also been referred to as simple sequence length polymorphisms or SSLPs. However, the acronym SSLP is so similar to the unrelated acronym SSCP (for single-strand conformational polymorphism) that I have not used the term here to avoid confusion.

66. Stable elongation across TATA... stretches would require lower reaction temperatures, which, in turn, would result in higher levels of non-homologous pairing and non-specific PCR products.

67. Pentamer and hexamer repeat loci are also present in the genome at much lower frequencies.

68. The stated goal of the Whitehead/Mit Genome Center is to define a total of 6,000 microsatellite loci that can be placed into 1 cM "bins". When this goal is reached, each 1 cM bin will hold approximately four microsatellites, and the average interlocus distance will be 500 kb.

69. This simplifying assumption is only valid when there is no distortion of equal segregation of alleles from either of the two loci under analaysis.

70. The author is indebted to Earl Green for pointing out this simplification of the Chi-squared equation in the special case of the backcross as shown in equation 1.40 in his book (Green, 1981).

71. Many statistics textbooks have graphs that allow a more precise determination of the P values associated with a continuum of Chi-squared values.

72. As indicated in Figure 9.13, this "swept radius" is attained in all backcross experiments with 45 or more samples.

73. The probability of linkage is equal to one minus the probability of no linkage. In actual usage, this formulation is easier to work with since the probability of no linkage will be a linear function of the P value obtained from Chi-squared analysis in all cases where P is small (< 0.001). According to Bayes' theorem, the probability of no linkage is equal to the "joint probability" of no linkage divided by the sum of the joint probabilities of the two possible outcomes, linkage and no linkage. The joint probability of no linkage is equal to the "prior probability that linkage will not exist" (equivalent to the fraction of the genome outside the swept radius on either side of the marker) multiplied by "the conditional probability that the statistical test will detect linkage when, in fact, linkage does not exist" (equal to the P value derived from the Chi-squared test). Even if the swept radius is as large as 35 cM, the fraction of the genome not covered within this distance on either side of a marker locus is still close to one [(1500-70)/1,500 = 0.95]. Thus, the prior probability of no linkage can be safely estimated as one, and the joint probability can be approximated as the P value. The joint probability of linkage is equal to the "prior probability of linakge" (equivalent to the fraction of the genome swept within two radii of the marker locus) multiplied by the "conditional probability that the statistical test will indicate linkage when, in fact, linkage does exist" (equivalent to the power of the Chi-squared test, 1 - beta, where beta is defined as the probability of type II error). The power of the Chi-squared test is dependent on the actual data set and is not easily estimated. For the sake of simplicity, this value has been set equal to one. For a more detailed account of Bayesian analysis, the reader should consult an advanced statistics text.

74. The fraction of the genome swept is equal to twice the swept radius in centimorgans (see Figures 9.13 and 9.14) divided by the total genome length of ~1500 cM.

75. In actuality, it is prudent to set up at least five cages of brother-sister pairs for mating at each generation in case some are not fertile. However, the offspring from only a single pair should be chosen to continue the construction of the RI strain. Each original pair of F₂ progenitor animals can only be used to produce a single new RI strain.

76. In the case of the very first set of RI strains established from a cross between BALB/cAnNBy and C57BL/6JNBy, upper case letters were used to distinguish different members of the set from each other with individual strain names of CXBD, CXBE, CXBG and so on (Bailey, 1971). Most other RI sets conform to the use of numbers as distinguishing symbols.

77. At the time of writing, 31 strains from this double set had been verified genetically, two original strains were still undergoing tests, and eight original strains had become extinct but DNA samples were still available (Marshall et al., 1992).

78. Of course, like all inbred strains, new mutations can and will arise at a low frequency, especially at hypermutagenic loci like microsatellites (see Section 8.3.6.4). However, new mutant alleles are usually distinguishable as such since they are likely to be different from the alleles present in both progenitor strains.

79. If these direct protocols fail to detect polymorphisms, one could use the probe in hand to recover a cosmid or YAC clone containing the gene of interest, and then search this larger cloned fragment for a nearby microsatellite that can be developed as a linked DNA marker as described in Section 8.3.6. On the other hand, it may be more efficient to forget the RI approach at this point, and use a backcross mapping panel instead as described in Section 9.3.

80. As discussed in Section 9.2.5, this may not be true for SDPs associated with complex phenotypes rather than DNA markers.

81. This measure of the swept radius is based on the assumption that the RI-determined values for linkage distance are accurate representations of actual linkage distances. A more stringent measure of swept radius is discussed in Section 9.4.4 for backcross data and presented in Figure 9.13.

82. The discordant strain number used in these calculations is set at 0.5 units above the maximum allowable strain number shown in Figure 9.5 i n order to determine the map distance above which actual pairs of loci are more likely than not to produce integer discordance values that exceed the maximum number allowable for linkage determination.

83. Particular regions of the genome that are of special interest to mouse geneticists are likely to be more densely populated with marker loci than regions that are not under investigation in and of themselves. A good example of this bias in marker distribution is in the 2 cM long H-2 region of chromosome 17 with 16 loci that have been typed in the BXD set; all 16 loci have an identical SDP.
See the BXD data set for Chromosome 17 at MGI.

84. Statements concerning the observed number of genomic patches present along a chromosome must be qualified because of the possibility that additional patches lie hidden in regions between two adjacent typed loci for which SDPs are available. As the density of typed loci increases, the probability of hidden patches goes down but never reaches zero.

85. An alternative interpretation of these data is that both mean estimates of linkage distance are relatively true to the particular pair of parental strains used in each cross. As discussed in Section 7.2.3.3, different crosses can vary in their propensity to undergo recombination in individual chromosome intervals.

86. The LD₅₀ point for a chemical is the dosage that is lethal to 50% of the treated animals in a designated period of time.

87. The law of the product states that the probability of occurrence of n independent events is equal to the product of the probabilities of each individual event. For RI strains, the probability of occurrence of either progenitor allele at any locus is simply 0.5. Thus, the probability of occurence of n loci from the same parent is (0.5)ⁿ.

88. In theory, it is possible to make use of unbalanced SDPs to search for the multiple loci involved in expressing a trait. In the example described in the text, susceptible BXD strains would have to have B6 alleles at both susceptability loci, but resistance would mean only that a DBA allele was present at either or both loci. One could fill in an SDP using only susceptible strains and search for linkage relationships with this subset of data points, but the numbers may be too small to be significant. However, potential intervals of linkage suggested by this first step analysis could be vetted in pairs (since there will be two loci) with the requirement that all nonsusceptable strains must have at least one DBA allele at the two loci. The main problem with this type of analysis is that the small numbers of strains present in each characterized RI set will usually preclude one from obtaining statistically significant results.

89. In theory, it is also possible to use RI data of this type in a search for the responsible loci. Let us assume that only two loci are involved in the phenotypic expression of interest, and further, the two mixed genotypes cause an intermediate phenotype while the two single-progenitor genotypes cause a corresponding progenitor phenotype. Then an SDP can be written with three possible values for each strain: A, B and 0.5A0.5B. As in the case above, one could fill-in only the A and B values of an SDP (together expected in 50% of the strains) and search for a pair of potential linkage relationships that could be vetted by the requirement for the presence of alternative alleles in the strains that show the intermediate phenotype.

90. In a real-life study, geneticists usually wait to see at least two recombination events before concluding non-equivalence because of the possibility of isolated aberrant events or experimental errors.

91. As discussed in Section 7.2.3.3, a sample size of ~500 per individual cross represents an upper boundary beyond which there are greatly diminishing returns in resolving nearby loci.

92. There are two other explanations for apparent double crossover events that cover a single locus in a single animal. The first is the trivial possibility of mistyping. The second is the infrequent but real possibility of gene conversion without recombination at an isolated genetic site (Hammer et al., 1991). Neither of these explanations is likely to be valid when the region between the two breakpoints contains more than one typed locus.

93. Values for recombination fraction and map distance can be readily transformed into each other through the use of a mapping function as described in Section 7.2.2.3.

94. Of course, most previously determined inter-locus map distances are far from accurate. If the true map distance between markers was previously underestimated, this would obviously lead to a reduction in the frequency at which linkage could be demonstrated in any future cross that used these two markers.

95. Direct calculations of swept radii for the intercross are confounded by the greater complexity inherent in the data that can be generated. As described in Section 9.1.3.4, intercross data show three degrees of freedom in contrast to the one degree of freedom associated with backcross data. The intercross-specific Chi-squared test described in this earlier section can be used to evaluate significance in all comparisons of allelic segregation from two potentially linked loci.

96. The computer program provided in Appendix D can be used to derive recombination limits for any level of confidence.

97. The statistical approach described in Appendix D2 can be used to ascertain whether a significant difference in expression exists between any two sets of animals.

98. The formula for computing variances is given in Appendix D2.

99. To ensure complete genetic homogeneity of the F₁ population, it is important that all of the initial matings be conducted in the same gender direction, with one strain fixed as the maternal progenitor and the other strain fixed as the paternal progenitor. The choice of gender can be informed by the reproductive characteristics of strains described in Chapter 4 and in Table 4.1.

100. It may also be useful in this case to set up independent crosses with each parental strain. It may turn out that the data obtained from one of these backcrosses will be easier to analyze.

101. Although the idealized distribution for three polygenic loci shows a small dip in the middle ( Figure 9.19 bottom right-hand panel), with a typical-sized N₂ test population (<=200 animals), there is likely to be a finite standard error on top of the naturla variance that will cause the two peaks to merge into one.

102. Non-parametric tests of this type are used to determine whether the actual genotype at a particular marker locus shows a significant level of corrleation with the rank order of trait exporession in individual animals.

103. The vast majority of cells that have failed to undergo a mitotic crossover in the chromosomal region centromeric to the marker gene can be eliminated with alloantigen-specific antibody and complement. Only cells that have lost the alloantigen will survive. Of these, only a subset will have undergone allele loss as a result of a mitotic recombination event.

104. The X chromosome is essentially conserved in its entirety across all eutherian mammals, and is excluded from estimates of conserved synteny.

105. To keep this hypothetical problem simple, I have assumed that all of the derived YAC clones are non-chimeric and that end fragments from all of these clones areeasily isolated. Unfortunately, the real-life situation is likely to be less straightforward.

106. The efficacy of this approach and all others dependent on cDNA libraries is greatly increased by using a normalized library in which all transcripts are represented equally irrespective of their different relative abundancies within the tissue itself (Patanjali et al., 1991). In theory, the normalization process should even out the representation of all transcribed sequences so that clones of actin mRNA, for example, are no more frequent than clones of rare messengers from the same tissue. In practice, normalization only succeeds part way; very rare messengers will always be under-represented.

107. Whenever genomic fragments are used to probe for expressed sequences, it is essential to prehybridize the probe or target with unlabeled total mouse DNA in order to block highly repetitive sequences, which are present in the non-coding regions of a subset of mammalian transcripts.

108. In addition, two classes of artifactual products were also recovered. One class contained genomic fragments with random sequences having coincidental homology to splice sites. The other class contained true exons but with flanking intronic sequences on one side.

109. The program works by estimating solutions to different integrals over segments of the appropriate probability density function.