Overview |
Start
at bench |
Install
TOUCAN |
Get
Sequences |
Annotate |
MotifScanner |
Statistics |
ModuleSearcher |
MotifSampler |
Return to bench |
References
Creating
an Annotation Table for the Sequence Set
Although not
absolutely required in the context of our "story", we would like to
briefly describe a very useful feature of TOUCAN which creates a table containing
all important references and
accession numbers of the genes in the active sequence set.. From
the TOUCAN menu, choose "Get_Seq", "From Ensembl", "Get Info", and hit
the "Update" button. This starts the process of retrieving the
informations and creating the table, which can be followed in the
progress bar. Each row represents one gene of your dataset and each
column represents one kind of database identifier. This includes basic
gene, mRNA and protein IDs as well as sites of "data integration" like
Ensembl Gene, EMBL, HUGO, UniGene, UniProt, RefSeq, and LocusLink. In
addition, IDs in the fields of protein domains and motifs (Interpro),
as well as protein structure (PDB) are listed. As the number of
microarray platforms increases rapidly, also the number of columns in
TOUCAN displaying chip-specific identifiers is constantly growing.
Thereby, not only the world-leader in this technology, Affymetrix, is
covered but also other platforms. Finally, the table presents also IDs,
which are related to functional information of each gene, like the
involvement in biological processes, or the localization to specific
cellular compartments (GO, MIM).
To look at this information in detail, you can highlight
all rows and copy / paste the data into programs like MS EXCEL.