An open index of research

A status.lu publication

Biology & Genetics

Initial sequencing and analysis of the human genome

Eric S. Lander, Lauren M. Linton, Bruce Birren, Chad Nusbaum, Michael C. Zody · International Human Genome Sequencing Consortium

Published 15 February 2001 · Nature · Journal article

Summary

This paper reported the results of the publicly funded Human Genome Project, presenting and making freely available a draft sequence covering the great majority of the human genome along with an initial analysis. The consortium described the broad genomic landscape—including gene content, repeat elements, GC content, and recombination rates—and estimated a surprisingly low number of protein-coding genes, on the order of roughly 30,000–40,000. The work provided a foundational reference for human biology, medicine, and evolutionary studies.

Key findings

  • Produced and freely released a draft sequence covering about 90% of the euchromatic human genome, generated by hierarchical shotgun sequencing of overlapping clones.
  • Estimated the human genome to contain far fewer protein-coding genes than previously expected (initially roughly 30,000–40,000), with most of the genome non-coding and repetitive.
  • Characterized large-scale features of the genome, including the distribution of repeats, GC content, CpG islands, recombination rate, and sequence variation such as single-nucleotide polymorphisms.

Subjects & keywords

Cite this paper

APA

Eric S. Lander, Lauren M. Linton, Bruce Birren, Chad Nusbaum, & Michael C. Zody [International Human Genome Sequencing Consortium] (2001). Initial sequencing and analysis of the human genome. Nature. https://doi.org/10.1038/35057062

BibTeX
@article{lander2001initial,
  author    = {Eric S. Lander and Lauren M. Linton and Bruce Birren and Chad Nusbaum and Michael C. Zody and {International Human Genome Sequencing Consortium}},
  title     = {Initial sequencing and analysis of the human genome},
  journal   = {Nature},
  year      = {2001},
  doi       = {10.1038/35057062},
  url       = {https://doi.org/10.1038/35057062}
}

Related in Biology & Genetics

Accurate structure prediction of biomolecular interactions with AlphaFold 3

Josh Abramson, Jonas Adler and John M. Jumper

This paper introduced AlphaFold 3, a unified deep learning model that predicts the joint structure of complexes containing proteins, nucleic acids, small-molecule ligands, ions, and modified residues. It replaces much of the prior architecture with a diffusion-based module that directly generates atomic coordinates. The model achieved substantially improved accuracy over specialized tools across many interaction types, including protein-ligand and protein-nucleic acid complexes.

Nature Open access

De novo design of protein structure and function with RFdiffusion

Joseph L. Watson, David Juergens and David Baker

This paper introduced RFdiffusion, a generative diffusion model built on the RoseTTAFold network for de novo protein design. It enables a range of design tasks, including unconditional generation, symmetric oligomer design, functional motif scaffolding, and binder design. Many designs were experimentally validated, with solved structures closely matching the intended models.

Nature Open access

A draft human pangenome reference

Wen-Wei Liao, Mobin Asri and Jana Ebler

The Human Pangenome Reference Consortium presents a first draft human pangenome built from 47 phased, diploid genome assemblies of genetically diverse individuals. The assemblies cover more than 99% of the expected sequence per genome at over 99% base-level and structural accuracy, and are combined into a graph-based reference. Relative to GRCh38, the pangenome adds about 119 million base pairs of euchromatic polymorphic sequence and 1,115 gene duplications, improving representation of variation at structurally complex loci.

Nature Open access