Data from: Using supermatrices for phylogenetic inquiry: an example using the sedges

 

Files in this package

Content in the Dryad Digital Repository is offered "as is." By downloading files, you agree to the Dryad Terms of Service. To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data. CC0 (opens a new window) Open Data (opens a new window)

Title phase1_at_nf.phy
Downloaded 17681 times
Description Nucleotide data for Cyperaceae species from various markers, collected from GenBank using the software tool "phlawd". This alignment corresponds to the fully unfiltered alignment--labeled AT/NF in our manuscript.
Download phase1_at_nf.phy.reduced (30.57Mb)
Details View File Details
Title phase2_at_rf.phy
Downloaded 207 times
Description Nucleotide data for Cyperaceae species from various markers, collected from GenBank using the software tool "phlawd". This alignment corresponds to the unscaffolded, rogues-filtered alignment--labeled AT/RF in our manuscript.
Download phase2_at_rf.phy.reduced (26.56Mb)
Details View File Details
Title phase3_sc_nf.phy
Downloaded 204 times
Description Nucleotide data for Cyperaceae species from various markers, collected from GenBank using the software tool "phlawd". This alignment corresponds to the scaffolded alignment with rogues unfiltered--labeled SC/NF in our manuscript.
Download phase3_sc_nf.phy.reduced (7.766Mb)
Details View File Details
Title phase4_sc_rf.phy
Downloaded 210 times
Description Nucleotide data for Cyperaceae species from various markers, collected from GenBank using the software tool "phlawd". This alignment corresponds to the maximally filtered alignment: scaffolded and having had rogues removed--labeled SC/RF in our manuscript.
Download phase4_sc_rf.phy.reduced (6.976Mb)
Details View File Details
Title cyp_states
Downloaded 204 times
Description Latitudinal range data for all currently recognized species of Cyperaceae from Govaerts et al. (2007), World Checklist of Cyperaceae. Range data are encoded as tropical (state 1) or extratropical (state 0), and represent the position of the latitudinal midpoint of each species range, as estimated based on the geographic distribution data encoded within the World Checklist of Cyperaceae referenced above.
Download cyp_states.csv (113.3Kb)
Details View File Details
Title Appendix_I
Downloaded 181 times
Description Formulas for summary statistics used in the creation of Figure 1 from the text.
Download Appendix_I.pdf (37.14Kb)
Details View File Details
Title Appendix_II
Downloaded 175 times
Description ML bootstrap majority rule consensus tree topologies from 300-replicate RAxML bootstrap searches using alignments 1-3 described in the text. Branch labels are bootstrap proportions.
Download Appendix_II.pdf (872.1Kb)
Details View File Details
Title filter_fasta.py
Downloaded 370 times
Description Usage: ./filter_fasta.py [path to input dir] [path to accepted taxon list]. Input files are expected to be in fasta format. The script will traverse all files in the input dir, so the input dir should contain only fasta files. The taxon list should be a line-delimited text file containing the names of tips as they correspond to those in the fasta alignments.
Download filter_fasta.py (1.686Kb)
Details View File Details
Title instability_multicore.py
Downloaded 239 times
Description This script will calculate I^s scores, as described in (Hinchliff, C. E. and E. H. Roalson. 2012. Using supermatrices for phylogenetic inquiry: an example using the sedges. Systematic Biology). It requires a set of trees sharing a common set of tips, to be input as a newick file (though any format readable by dendropy should be trivial to use, just change the format in the appropriate line). It outputs a comma-delimited table containing the raw instability scores (the numerator from the right side of the equation in the referenced paper), as well as the scaled I^s scores. Taxa that move more have higher scores.
Download instability_multicore.py (7.832Kb)
Details View File Details
Title makesamplingmatrix
Downloaded 235 times
Description This script accesses a directory, and traverses all FASTA files in it, recording the names of all taxa present in each file. Then it creates a tab-delimited file containing a matrix where the rows represent the taxa and the columns the FASTA files. The intended use is for a directory containing a set of FASTA files each corresponding to a single locus, and containing homologous sequences of that locus for different taxa. The script will record a 1 in the resulting matrix if a taxon is present in a locus file, or a 0 if not. Key point: the script does not intelligently differentiate FASTA files from other types, and it will attempt to parse any file in the directory. For this reason, you should remove all other files before you run the script. It will create (or overwrite!) a file in the passed directory called 'sampling_matrix.txt' that may be opened in any conventional spreadsheet or text-editor app. This file should be in the proper format for use in the Decisivator application. This script requires BioPython to be installed.
Download makesamplingmatrix.py (2.951Kb)
Details View File Details

When using this data, please cite the original publication:

Hinchliff CE, Roalson EH (2012) Using supermatrices for phylogenetic inquiry: an example using the sedges. Systematic Biology 62(2): 205-219. http://dx.doi.org/10.1093/sysbio/sys088

Additionally, please cite the Dryad data package:

Hinchliff CE, Roalson EH (2012) Data from: Using supermatrices for phylogenetic inquiry: an example using the sedges. Dryad Digital Repository. http://dx.doi.org/10.5061/dryad.6p76c3pb
Cite | Share
Download the data package citation in the following formats:
   RIS (compatible with EndNote, Reference Manager, ProCite, RefWorks)
   BibTex (compatible with BibDesk, LaTeX)