Show simple item record

dc.contributor.author Hinchliff, Cody E.
dc.date.accessioned 2012-11-15T18:36:34Z
dc.date.available 2012-11-15T18:36:34Z
dc.date.issued 2012-11-15
dc.identifier doi:10.5061/dryad.6p76c3pb/10
dc.identifier doi:10.5061/dryad.6p76c3pb/10
dc.identifier.uri http://hdl.handle.net/10255/dryad.41116
dc.description This script accesses a directory, and traverses all FASTA files in it, recording the names of all taxa present in each file. Then it creates a tab-delimited file containing a matrix where the rows represent the taxa and the columns the FASTA files. The intended use is for a directory containing a set of FASTA files each corresponding to a single locus, and containing homologous sequences of that locus for different taxa. The script will record a 1 in the resulting matrix if a taxon is present in a locus file, or a 0 if not. Key point: the script does not intelligently differentiate FASTA files from other types, and it will attempt to parse any file in the directory. For this reason, you should remove all other files before you run the script. It will create (or overwrite!) a file in the passed directory called 'sampling_matrix.txt' that may be opened in any conventional spreadsheet or text-editor app. This file should be in the proper format for use in the Decisivator application. This script requires BioPython to be installed.
dc.relation.ispartof doi:10.5061/dryad.6p76c3pb
dc.rights.uri http://creativecommons.org/publicdomain/zero/1.0/
dc.subject decisiveness
dc.title makesamplingmatrix
dc.type Dataset *
.dryad.pageviews 401
.dryad.downloads 197

Files in this item