Genetics Lecture 9

More about linkage

Human linkage maps

The size of the human genome is big enough that it took a long while to discover any evidence of genetic linkage. It was not until the 1950s that the first autosomal linkage groups were discovered. These all involved the polymorphic blood groups. One of the first was between one form of hereditary elliptocytosis (an anaemia caused by malformed erythrocytes) and the rhesus blood group. This study was important in showing that linkage information could prove the existence of more than one form of the same disease, it showed that there were some families in which the disease elliptocytosis was clearly linked to rhesus but that there were others in which it was not. This implies the existence of at least two genes which when mutant could cause the same disease (genetic heterogeneity).

relationship to the human genome project

It was not until the discovery of extensive inherited variation in DNA sequence which could be traced in families using Southern blotting or by PCR that human linkage studies really took off.

types of genetic 'marker'

Any genetic variation can be studied and it is not necessary that the objects being measured in a linkage study are genes. Any polymorphic piece of DNA can be studied. Examples include

The use of Lod scores

    What makes a cross informative?
  • There must be one (best) or two (not quite so good) parents who is (are) doubly heterozygous for the two loci.

The first thing to say here is that I do not expect you to be able to calculate a Lod score. However, you must have a general understanding of what the lod score is because it is an important and often quoted statistic.

Suppose that we wish to decide whether two genes / markers are linked.

The map of the human genome

Many thousands of genetic markers have now been mapped so that there is a minimum of about one marker per centiMorgan and in some regions of the genome there are as many as ten markers per centiMorgan. On the left are two maps covering the same small region of chromosome 9 (a region in which I have a particular interest). The map on the right is a genetic map made by considering the inheritance of many markers (with names such as D9S66) in about 60 large families. The map on the left is not a genetic map, it is a map based on a large number of overlapping pieces of cloned DNA (currently being sequenced) and so it is an accurate reconstruction of the actual DNA sequence of that region of chromosome 9. As is to be hoped, the genetic map gives the same order of markers and approximately the same relative distances between them. There are some differences, The gene ABO is not resolved from marker locus D9S150 on the genetic map whereas, in physical reality there is a good sized gap. Differences like this occur because the rate of genetic recombination is not absolutely even throughout the genome, some areas are hot spots and others are areas of reduced recombination. This is reflected in the genetic distances between pairs of markers. The total genetic map length of the human genome is about 3,000 cM and by a lucky coincidence, the total genome length is about 3,000 million basepairs. So on average, 1 cM is equivalent to 1 Mb (Mb = million basepairs). In this region of chromosome 9 there is an elevated recombination rate compared to the genome average and so here 1 cM = ~ 300 kb.

Diagnosis of genetic disease through linkage analysis

The identification of linkage to a marker locus is very often the first step on the way to cloning a disease gene (see the next section). However, it also immediately provides diagnostic opportunities even before the disease gene itself has been identified and often when absolutely nothing is known about the nature of the underlying genetic defect. In this pedigree (which you will recognise as being the phase known pedigree above with added children) an autosomal dominant mutation is present. The disease gene has not been identified but it is known to be closely linked to DNA marker polymorphisms Aa and Bb and to map between them. Clearly, in this family the mutant disease gene is present on the chromosome which happens to have alleles A and B on it. [Remember that Aa and Bb have nothing whatsoever to do with the disease, they are simply two bits of the genome which are polymorphic and which are within about a million basepairs of the disease gene.] The unborn baby can be tested to see which alleles it has inherited from its father at the two marker loci. If it has inherited A and B then it almost certainly has also inherited the disease mutation.

The identification of 'positional candidate' genes

As well as the TSC1 success story above there have been many other examples of successful identification of disease genes based solely, or almost solely, on positional information. The classic example is the cystic fibrosis gene CFTR.


Recommended reading

The topics include:

Reading:


SAQs

Answers


Back to the top
Back to the introduction
Next lecture