Ensembl Release 50 Mirror from BGI-SZ Macaque

 

Explore the Macaca mulatta genome

Search Ensembl Macaca mulatta

Search:

e.g. chromosome 19 or 1:10000..200000 or Q59FM4.1

Karyotype

Click on a chromosome for a closer view

chromosome X chromosome 20 chromosome 19 chromosome 18 chromosome 17 chromosome 16 chromosome 15 chromosome 14 chromosome 13 chromosome 12 chromosome 11 chromosome 10 chromosome 9 chromosome 8 chromosome 7 chromosome 6 chromosome 5 chromosome 4 chromosome 3 chromosome 2 chromosome 1 Macaque karyotype selector

Jump directly to sequence position

Region:
From (bp):
To (bp):

About the Rhesus macaque genome

Assembly

Rhesus macaque Mmul_1 is a preliminary assembly of the rhesus monkey, Macaca mulatta using whole genome shotgun (WGS) reads from small and medium insert clones. Several WGS libraries, with inserts of 2-4 kb and 10 kb, fosmids with ~35kb inserts, and BACs with 180kb inserts were used to produce the data.

The release was produced by the Macaque Genome Sequencing Consortium, led by the Baylor College of Human Medicine, melding three separate complementary assemblies (created using the Atlas, Celera and PCAP systems). This involved iteratively splitting likely chimeric scaffolds and joining together existing scaffolds where possible. Chimeric scaffolds (<100 total) were identified by breaks in synteny with the human genome, which were confirmed to be artefacts by the other assemblies.

This is a draft sequence and may contain errors so users should exercise caution. Typical errors in draft genome sequences include misassemblies of repeated sequences, collapses of repeated regions, and unmerged overlaps (e.g. due to polymorphisms) creating artificial duplications. However base accuracy in contigs (contiguous blocks of sequence) is usually very high with most errors near the ends of contigs. [More about the assembly].

Annotation

The gene set for macaque was built using the ensembl pipeline. The species-specific resources for macaque are relatively limited, so we decided to take a combined approach utilizing macaque's great similarity to human to aid our annotation efforts. The gene structures are mainly based on alignments to human and macaque protein data. Both macaque and human cDNAs were used to add UTR structures, and finally gene predictions based on Uniprot proteins and human cdnas were used to fill gaps in the annotation.

What's New in Ensembl 50

Macaca mulatta News

  • Non-coding genes
    These have been updated for most species, including an miRNA update and HGNC names where possible.
  • Multiple alignments

    The multiple alignments are being extended with new species and 2X genomes.
    Read more...

General News

  • Canonical Transcripts

    Canonical transcripts have been defined for all genes in the core databases.
    Read more...

  • SSAHA
    From release 50 we will no longer be providing SSAHA sequence search. If you wish to run your own SSAHA sequence search you can download the files to generate the search hashes from our FTP site.
  • Projections of gene names and GO terms
    These have been done as usual, between a variety of species.

More news...

Statistics

Assembly: MMUL 1.0, Feb 2006
Genebuild: Ensembl, Jan 2006
Database version: 50.10i
Known protein-coding genes: 874
Projected protein-coding genes: 17,505
Novel protein-coding genes: 3,526
Pseudogenes: 1,762
RNA genes: 4,224
Genscan gene predictions: 125,893
Gene exons: 246,943
Gene transcripts: 38,146
Base Pairs: 3,093,871,206
Golden Path Length: 3,097,179,960
Most common InterPro domains: Top 40 Top 500

How the statistics are calculated


 

© 2008 WTSI / EBI. Ensembl is available to download for public use - please see the code licence for details. This is a mirror site of Ensembl from BGI-SZ.

                
Ensembl Release 50 Mirror from BGI-SZ release 50 - Jul 2008
HOME · SITEMAP HELP