Bioinformatics for Beginners – File formats: Part 1. Reference sequences
The most widely used file format for reference sequences is the fasta format. Both nucleotide and protein sequences can be represented in fasta format. A fasta formatted file begins with a single-line description, followed by the sequence data. The description line starts with a greater-than (“>”) symbol. In the...