We will be offering an R workshop December 18-20, 2019. Learn more.

Difference between revisions of "List.seqs"

From mothur
Jump to: navigation, search
m (New page: Given an input file, write out the names of the sequences found within it __NOTOC__ ==Options== fasta, name, group, alignreport; each takes a file name ==Required== One of the options m...)
 
Line 1: Line 1:
Given an input file, write out the names of the sequences found within it
+
The [[list.seqs]] command will write out the names of the sequences found within a [[fasta file | fasta]], [[name file | name]], [[group file | group]], or [[align.report file]].  This could be useful for using the [[get.seqs]] and [[remove.seqs]] commands as well as to generate a [[group file]].    To complete this analysis, you need to download the folder compressed in the [[Media:Esophagus.zip | Esophagus.zip]] archive.
  
 
__NOTOC__
 
__NOTOC__
  
 
==Options==
 
==Options==
fasta, name, group, alignreport; each takes a file name
+
At least one of the following options must be used.  Each of these options will generate a file ending in accnos that contains a single column containing the list of the sequences contained in the input file.
  
==Required==
+
===fasta option===
One of the options must be provided
+
The fasta option is used as presented in the following command:
  
==Output==
+
mothur > list.seqs(fasta=esophagus.fasta)
A file with the extension “.accnos” (i.e. accession numbers) in place of the previous extension; the content should be a single column and on each row should be a separate sequence accession
+
  
==Algorithm==
+
The resulting esophagus.accnos file looks something like:
# as you scan through the input file, put every sequence name into a list<string> container
+
 
# sort the list<string> container once it is filled
+
59_10_1
# print the list container out to the accnos file
+
59_10_10
 +
59_10_11
 +
59_10_13
 +
59_10_15
 +
59_10_16
 +
59_10_17
 +
...
 +
 
 +
 
 +
===name option===
 +
The name option is used as presented in the following command (be sure to run [[unique.seqs]] on esophagus.fasta first):
 +
 
 +
mothur > list.seqs(name=esophagus.names)
 +
 
 +
 
 +
 
 +
===group option===
 +
The group option is used as presented in the following command:
 +
 
 +
mothur > list.seqs(group=esophagus.groups)
 +
 
 +
 
 +
 
 +
===alignreport option===
 +
The alignreport option is used as presented in the following command:
 +
 
 +
mothur > list.seqs(alignreport=esophagus.align.report)

Revision as of 13:21, 30 July 2009

The list.seqs command will write out the names of the sequences found within a fasta, name, group, or align.report file. This could be useful for using the get.seqs and remove.seqs commands as well as to generate a group file. To complete this analysis, you need to download the folder compressed in the Esophagus.zip archive.


Options

At least one of the following options must be used. Each of these options will generate a file ending in accnos that contains a single column containing the list of the sequences contained in the input file.

fasta option

The fasta option is used as presented in the following command:

mothur > list.seqs(fasta=esophagus.fasta)

The resulting esophagus.accnos file looks something like:

59_10_1
59_10_10
59_10_11
59_10_13
59_10_15
59_10_16
59_10_17
...


name option

The name option is used as presented in the following command (be sure to run unique.seqs on esophagus.fasta first):

mothur > list.seqs(name=esophagus.names)


group option

The group option is used as presented in the following command:

mothur > list.seqs(group=esophagus.groups)


alignreport option

The alignreport option is used as presented in the following command:

mothur > list.seqs(alignreport=esophagus.align.report)