Show simple item record Hinchliff, Cody E. 2012-11-15T18:36:34Z 2012-11-15T18:36:34Z 2012-11-15
dc.identifier doi:10.5061/dryad.6p76c3pb/10
dc.identifier doi:10.5061/dryad.6p76c3pb/10
dc.description This script accesses a directory, and traverses all FASTA files in it, recording the names of all taxa present in each file. Then it creates a tab-delimited file containing a matrix where the rows represent the taxa and the columns the FASTA files. The intended use is for a directory containing a set of FASTA files each corresponding to a single locus, and containing homologous sequences of that locus for different taxa. The script will record a 1 in the resulting matrix if a taxon is present in a locus file, or a 0 if not. Key point: the script does not intelligently differentiate FASTA files from other types, and it will attempt to parse any file in the directory. For this reason, you should remove all other files before you run the script. It will create (or overwrite!) a file in the passed directory called 'sampling_matrix.txt' that may be opened in any conventional spreadsheet or text-editor app. This file should be in the proper format for use in the Decisivator application. This script requires BioPython to be installed.
dc.relation.ispartof doi:10.5061/dryad.6p76c3pb
dc.subject decisiveness
dc.title makesamplingmatrix
dc.type Dataset *
.dryad.pageviews 417
.dryad.downloads 234

Files in this item

Submission is temporarily disabled for routine maintenance.
Please try again later.

Search for data

Be part of Dryad

We encourage organizations to: