Home > Platforms > Sequence Annotation
Sequence Annotation

Positioned at the end of the sequencing pipeline at GSC, our group is involved in "adding value" to the genome sequence. Members of our group perform genome analysis and do a genome-wide annotation of genes. Annotation is the process of taking the raw DNA sequence and adding analysis and interpretation to extract the biological significance contained within that sequence. Recently annotated genomes include Human chromosomes 7, 2 and 4, UTI89, a uropathogenic strain of Escherichia coli and Salmonella paratyphi A. Members of the analysis and annotationgroup are also part of WormBase -- the major public biological database for the nematode, Caenorhabditis elegans.

Genome assemblies typically go through an initial analysis which includes finding repeats with RECON, splice junction/intron analysis, determination of GC content, and detection of CpG islands. Next, the genome is put through the analysis pipeline to produce a set of mapped genes. Currently, the analysis pipeline comprises of a modified Ensembl pipeline, Eannot, an evidence-supported, gene model building program and a model merging program. Finally the annotated genome is displayed in Gbrowse, a graphical genome browser.

 
Sequence Annotation Links
454 Life Sciences
Applied Biosystems
Sequence Annotation Contact
Kym Hallsworth-Pepin
Group Leader of Sequencing Annotation
Send email

Washington University School of Medicine
The Genome Center
4444 Forest Park Ave
St. Louis, Missouri 63108
USA
 
Selected Sequence Annotation Publications

Title
Advances in genome biology and technology.
Authors
Mardis ER.
Journal
  Pharmacogenomics. 2004 Jun;5(4):355-6.