Overview | Start at bench | Install TOUCAN | Get Sequences | Annotate | MotifScanner | Statistics | ModuleSearcher | MotifSampler | Return to bench | References  
   
Creating an Annotation Table for the Sequence Set

Although not absolutely required in the context of our "story", we would like to briefly describe a very useful feature of TOUCAN which creates a table containing all important references and accession numbers of the genes in the active sequence set.. From the TOUCAN menu, choose "Get_Seq", "From Ensembl", "Get Info", and hit the "Update" button. This starts the process of retrieving the informations and creating the table, which can be followed in the progress bar. Each row represents one gene of your dataset and each column represents one kind of database identifier. This includes basic gene, mRNA and protein IDs as well as sites of "data integration" like Ensembl Gene, EMBL, HUGO, UniGene, UniProt, RefSeq, and LocusLink. In addition, IDs in the fields of protein domains and motifs (Interpro), as well as protein structure (PDB) are listed. As the number of microarray platforms increases rapidly, also the number of columns in TOUCAN displaying chip-specific identifiers is constantly growing. Thereby, not only the world-leader in this technology, Affymetrix, is covered but also other platforms. Finally, the table presents also IDs, which are related to functional information of each gene, like the involvement in biological processes, or the localization to specific cellular compartments (GO, MIM). To look at this information in detail, you can highlight all rows and copy / paste the data into programs like MS EXCEL.
              
         
Annotate
                           

Previous <       > Next