We will be offering a mothur workshop March 30-April 1. Learn more.
Given a list of accession numbers (i.e. sequence names) and one or more file formats, generate a new file that contains only those sequences.
accnos, fasta, name, group, alignreport; each takes a file name
accnos and one of fasta/name/group/alignreport
- read accnos file into a set<string> container, close the file
- read through the file to be parsed and for each entry, if the sequence name is in the set<string> container:
- spit the data out to the new file and delete the entry from the set<string> container
- otherwise do nothing