Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Letter
  • Published:

Evidence of en bloc duplication in vertebrate genomes

Abstract

It has been 30 years since it was first proposed that the vertebrate genome evolved through several rounds of genome-wide duplications (polyploidizations)1. Despite rapid advances in genetics, including sequencing of the complete genomes of several divergent species, this hypothesis has not been tested rigorously and is still a matter of debate2. If polyploidizations occurred during chordate evolution, there should be a network of paralogous regions in the present-day jawed vertebrate (Gnathostomata) genomes3. Here we present an investigation of the major histocompatibility complex (MHC) paralogous regions, which we accomplished by characterizing the corresponding region in amphioxus by identifying nine anchor genes and sequencing both the anchor genes and the regions that flank them (a total of 400 kb). Phylogenetic analysis of 31 genes (including the anchor genes) in these regions shows that duplications occurred after the divergence of cephalochordates and vertebrates but before the Gnathostomata radiation. The distribution of human and amphioxus orthologs in their respective genomes and the relationship between these distributions support the en bloc duplication events. Our analysis represents the first step towards demonstrating that the human ancestral genome has undergone polyploidization. Moreover, reconstruction of the pre-duplicated region indicates that one of the duplicated regions retains the ancestral organization.

This is a preview of subscription content, access via your institution

Access options

Buy this article

USD 39.95

Prices may be subject to local taxes which are calculated during checkout

Figure 1: Definition of the anchor genes.
The alternative text for this image may have been generated using AI.
Figure 2: Analysis of amphioxus anchor-gene phylogenetic relationships; phylogenetic reconstructions for the nine amphioxus anchor genes.
The alternative text for this image may have been generated using AI.
Figure 3: Summary of the cosmid gene organizations.
The alternative text for this image may have been generated using AI.
Figure 4: Evolution of the chordate proto-MHC region and its genes.
The alternative text for this image may have been generated using AI.

Similar content being viewed by others

Accession codes

Accessions

GenBank/EMBL/DDBJ

References

  1. Ohno, S. Evolution by Gene Duplication (Springer, Berlin, 1970).

    Book  Google Scholar 

  2. Wolfe, K.H. Yesterday′s polyploids and the mystery of diploidization. Nature Rev. Genet. 2, 333–341 (2001).

    Article  CAS  Google Scholar 

  3. Lundin, L.G. Evolution of the vertebrate genome as reflected in paralogous chromosomal regions in man and the house mouse. Genomics 16, 1–19 (1993).

    Article  CAS  Google Scholar 

  4. Kumar, S. & Hedges, S.B. A molecular timescale for vertebrate evolution. Nature 392, 917–920 (1998).

    Article  CAS  Google Scholar 

  5. Fitch, W.S. Distinguishing homologous from analogous proteins. Syst. Zool. 19, 99–113 (1970).

    Article  CAS  Google Scholar 

  6. Sambrook, J., Fritsch, E.F. & Maniatis, T. Molecular Cloning (Cold Spring Harbor Laboratory Press, 1989).

    Google Scholar 

  7. Zehetner, G. & Lehrach, H., The Reference Library System—sharing biological material and experimental data. Nature 367, 489–491 (1994).

    Article  CAS  Google Scholar 

  8. Shiina, T. et al. Genomic anatomy of a premier major histocompatibility complex paralogous region on chromosome 1q21-q22. Genome Res. 11, 789–802 (2001).

    Article  CAS  Google Scholar 

  9. Boone, M. & Upton, C. BLAST Search Updater: a notification system for new database matches. Bioinformatics 16, 1054–1055 (2000).

    Article  CAS  Google Scholar 

  10. Huang, X. & Madan, A. CAP3: a DNA sequence assembly program. Genome Res. 9, 868–877 (1999).

    Article  CAS  Google Scholar 

  11. Bateman, A. et al. The Pfam protein families database. Nucleic Acids Res. 28, 263–266 (2000).

    Article  CAS  Google Scholar 

  12. Sonnhammer, E.L., von Heijne, G. & Krogh, A. A hidden Markov model for predicting transmembrane helices in protein sequences. Proc. Int. Conf. Intell. Syst. Mol. Biol. 6, 175–182 (1998).

    CAS  PubMed  Google Scholar 

  13. Kasahara, M., Nakaya, J., Satta, Y. & Takahata, N. Chromosomal duplication and the emergence of the adaptive immune system. Trends Genet. 13, 90–92 (1997).

    Article  CAS  Google Scholar 

  14. Maresco, D.L. et al. Localization of FCGR1 encoding Fcgamma receptor class I in primates: molecular evidence for two pericentric inversions during the evolution of human chromosome 1. Cytogenet. Cell Genet. 82, 71–74 (1998).

    Article  CAS  Google Scholar 

  15. Clark, M.S., Shaw, L., Kelly, A., Snell, P. & Elgar, G. Characterization of the MHC class I region of the Japanese pufferfish (Fugu rubripes). Immunogenetics 52, 174–185 (2001).

    Article  CAS  Google Scholar 

  16. Hansen, J.D., Strassburger, P., Thorgaard, G.H., Young, W.P. & Du Pasquier, L., Exprssion, linkage, and polymorphism of MHC-related genes in rainbow trout, Oncorhynchus mykiss . J. Immunol. 163, 774–786 (1999).

    CAS  PubMed  Google Scholar 

  17. Takami, K., Zaleska-Rutczynska, Z., Figueroa, F. & Klein, J. Linkage of LMP, TAP, and RING3 with Mhc class I rather than class II genes in the zebrafish. J. Immunol. 159, 6052–6060 (1997).

    CAS  PubMed  Google Scholar 

  18. Thompson, J.D., Higgins, D.G. & Gibson, T.J. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22, 4673–4680 (1994).

    Article  CAS  Google Scholar 

  19. Farris, J.S., Kallersjo, M., Kluge, A.G. & Bult, C. Testing significance of incongruence. Cladistics 10, 315–319 (1994).

    Article  Google Scholar 

  20. Gu, X. Statistical methods for testing functional divergence after gene duplication. Mol. Biol. Evol. 16, 1664–1674 (1999).

    Article  CAS  Google Scholar 

  21. Swofford, D.L. PAUP* . Phylogenetic Analysis Using Parsimony (*and other Methods). Version 4. (Sinauer Associates, Sunderland, Massachusetts, 2000).

    Google Scholar 

  22. Saitou, N. & Nei, M. The neighbour-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4, 406–425 (1987).

    CAS  Google Scholar 

  23. Kumar, S., Tamura, K. & Nei, M. MEGA: Molecular Evolutionary Genetics Analysis software for microcomputers. Comput. Appl. Biosci. 10, 189–191 (1993).

    Google Scholar 

  24. Strimmer, K. & von Haeseler, A. Quartet puzzling: a quartet maximum likelihood method for reconstructing tree topologies. Mol. Biol. Evol. 13, 964–969 (1996).

    Article  CAS  Google Scholar 

  25. Felsenstein, J. Confidence limits on phylogenenies: an approach using the bootstrap. Evolution 39, 783–791 (1985).

    Article  Google Scholar 

  26. Burge, C. & Karlin, S. Prediction of complete gene structures in human genomic DNA. J. Mol. Biol. 268, 78–94 (1997).

    Article  CAS  Google Scholar 

  27. Solovyev, V. & Salamov, A., The Gene-Finder computer tools for analysis of human and model organisms genome sequences. Proc. Int. Conf. Intell. Syst. Mol. Biol. 5, 294–302 (1997).

    CAS  PubMed  Google Scholar 

  28. Altschul, S.F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997).

    Article  CAS  Google Scholar 

  29. Gelfand, M.S., Mironov, A.A. & Pevzner, P.A. Gene recognition via spliced sequence alignment. Proc. Natl Acad. Sci. USA 93, 9061–9066 (1996).

    Article  CAS  Google Scholar 

  30. Guigo, R., Agarwal, P., Abril, J.F., Burset, M. & Fickett, J.W. An assessment of gene prediction accuracy in large DNA sequences. Genome Res. 10, 163–1642 (2000).

    Article  Google Scholar 

Download references

Acknowledgements

We would like to thank E. Faure for help in cloning the amphioxus RXR gene; M. Flajnik, C. Mawas, F. Birg, J. Berry, D. Birnbaum and L. Du Pasquier for critical reading of the manuscript; E. Danchin for helpful discussions, as well as all the members of the lab for their support; and M. Clark for the Takifugu rubripes BRD2 sequence. L. Abi-Rached is an M.R.T. fellow. This work was supported by INSERM.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Laurent Abi-Rached or Pierre Pontarotti.

Supplementary information

Rights and permissions

Reprints and permissions

About this article

Cite this article

Abi-Rached, L., Gilles, A., Shiina, T. et al. Evidence of en bloc duplication in vertebrate genomes. Nat Genet 31, 100–105 (2002). https://doi.org/10.1038/ng855

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue date:

  • DOI: https://doi.org/10.1038/ng855

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing