Applied Bioinformatics [Databases]

Exercises

UniProt/Swiss-Prot

The UniProt database is a central database of protein sequence and function created by joining the information contained in Swiss-Prot, TrEBML, and PIR. The Swiss-Prot Protein Sequence Database is a resource for protein sequences produced in collaboration between the University of Geneva and the EMBL Data Library. It is a curated protein sequence database which provides a high level of annotation (such as function, domain, post-translational modifications, variations, etc). It is annotated from literature by researchers. TrEMBL is the unannotated supplement SwissProt. It consists of entries in SwissProt-like format derived from the translation of all coding sequences (CDS) in the EMBL sequences database, except CDS which are already included in Swiss-Prot. The Protein Information Resource (PIR) located at the Georgetown University joined the UniProt consortium in 2002. It also describes functionally annotated protein sequences.

A subset of UniProt is UniProtKB (UniProt knowledgebase), which consinsts of Swiss-Prot and TrEMBLE.


Please direct questions and comments to Martin Haubrock.