Fasta format
General information
This format contains a one line header followed by lines of sequence data. Sequences in Fasta formatted files are preceded by a line starting with a »>« symbol. The first word on this line is the name of the sequence. The rest of the line is a description of the sequence. The remaining lines in that format should contain sequence information itself.
The Fasta format is one of the most often used formats in bioinformatics. The Fasta format is easy to handle and contains no additional information (nonsequence characters) within the sequence.
Example
>FOSB_MOUSE Protein fosB. 338 bp MFQAFPGDYDSGSRCSSSPSAESQYLSSVDSFGSPPTAAASQECAGLGEMPGSFVPTVTA ITTSQDLQWLVQPTLISSMAQSQGQPLASQPPAVDPYDMPGTSYSTPGLSAYSTGGASGS GGPSTSTTTSGPVSARPARARPRRPREETLTPEEEEKRRVRRERNKLAAAKCRNRRRELT DRLQAETDQLEEEKAELESEIAELQKEKERLEFVLVAHKPGCKIPYEEGPGPGPLAEVRD LPGSTSAKEDGFGWLLPPPPPPPLPFQSSRDAPPNLTASLFTHSEVQVLGDPFPVVSPSY TSSFVLTCPEVSAFAGAQRTSGSEQPSDPLNSPSLLAL
Please direct questions and comments to Martin Haubrock.