Click on a chromosome for a closer view
This release is based on the NCBI 36 assembly of the human genome [November 2005]. The data consists of a reference assembly of the complete genome plus the Celera WGS and a number of alternative assemblies of individual haplotypic chromosomes or regions.
The International Human Genome Sequencing Consortium have published their scientific analysis of the finished human genome.
Since release 38 (April 2006) the gene annotation presented has been a combined Ensembl-Havana geneset, which incorporates more than 18,000 full-length protein-coding transcripts annotated by the Havana team with the Ensembl automatic gene build. The human genome sequence is now considered sufficiently stable that since 2004 the major genome browsers have come together to produce a common set of identifiers where CDS annotations of transcripts can be agreed and these identifiers are also shown.
The ENCODE (ENCyclopedia Of DNA Elements) project aims to find functional elements in the human genome.
Genes and transcripts are assigned names using a new procedure.
Read more...
| Assembly: | NCBI 36, Oct 2005 |
| Genebuild: | Ensembl, Dec 2006 |
| Database version: | 50.36l |
| Known protein-coding genes: | 20,067 |
| Novel protein-coding genes: | 1,461 |
| Pseudogenes: | 6,282 |
| RNA genes: | 4,810 |
| RNA pseudogenes: | 2,931 |
| Immunoglobulin/T-cell receptor gene segments: | 388 |
| Genscan gene predictions: | 49,796 |
| Gene exons: | 294,420 |
| Gene transcripts: | 61,318 |
| SNPs: | 13,099,397 |
| Base Pairs: | 3,253,037,807 |
| Golden Path Length: | 3,093,120,360 |
| Most common InterPro domains: | Top 40 Top 500 |
© 2008 WTSI / EBI. Ensembl is available to download for public use - please see the code licence for details. This is a mirror site of Ensembl from BGI-SZ.