Dryad aims to make data archiving as simple and as rewarding as possible through a suite of services not necessarily provided by publishers or institutional websites:
Dryad hosts research data underlying scientific and medical publications. Historically, the repository has been strongest in the life sciences. Most data in the repository is associated with peer-reviewed journal articles, but data associated with non-peer reviewed publications from other reputable sources (such as dissertations) is also accepted. At this time, all Dryad submissions must be in English.
Most types of files can be submitted (e.g., text, spreadsheets, video, photographs, software code) including compressed archives of multiple files. Ordinarily, no more than 10GB of material are submitted for a single publication; larger data sets are accepted but will be subject to additional fees (starting 1 September 2013).
Data that was originally collected for another publication may be submitted as long as it is referenced by the current publication. Non-data files may also be submitted to Dryad provided the files are integral to the publication and can be released in the public domain; note that policies for the use of Dryad versus Supplementary Materials vary by journal.
While Dryad can host software scripts and snapshots of software source code, we recommend the use of a public software repository with version control for the ongoing maintenance of software packages. Software in Dryad must be released under a CC0 license.
Your data may be submitted at different points in the publication process. If the data accompany an article in a journal that has not integrated its submission process (see list of integrated journals) the best time to do so is immediately after notice of acceptance.
If you received an invitation to submit data to Dryad from a journal, then that journal has integrated its submission process with Dryad. Please follow the instructions from the journal to determine whether the data should be submitted before review (at the time of initial manuscript submission) or upon acceptance (but still prior to publication).
If a delayed-release data embargo is allowed by your journal, you may select that option at the time of submission, prior to publication. You may also submit data associated with previously published works (in which case you can simplify the submission process by entering the DOI or PubMedID to get started).
Frequently, journals provide guidelines for where to archive particular kinds of data and what community standards to follow. Dryad defers to journal editors and reviewers in setting and enforcing such standards.
Submitters should aim to provide sufficient data and descriptive information such that another researcher would be able to evaluate and reproduce the findings described in the publication. This will generally include any data that are used in statistical tests, as well the individual data points behind the published figures and tables. Submitters are advised to follow community data standards for the content and format of data files.
We strongly encourage you to include a ReadMe file that provides additional information to make sense of the files (e.g. instructions for use of software scripts, variable abbreviations, measurement units, and data codes).
Providing well-documented data in a community-accepted format is in the interest of the submitter since it encourages others to reuse and thus cite the original work.
The submission process is designed to be self-explanatory, but here is what you can expect. You will need to register the first time you come to Dryad and be signed in so that you can return to a submission at a later date should it be necessary. If you received a customized Dryad submission URL from your journal, please use that to enter the system so that the manuscript information can be automatically retrieved. If you are entering data for a previously published work, you may use a DOI or PubMedID to pull up the publication information. If the prior publication lacks a DOI and PubMedID, or if you are submitting data prior to publication in a journal that is not yet integrated with Dryad, you will first need to describe the publication (e.g. the author names, the title, the journal). You will need to make a legal affirmation on behalf of all the authors. Then you will upload your data files and ReadMe files and modify the metadata to your liking. You have the opportunity to save an incomplete submission and return to it later.
If you encounter any any difficulties, or realize that you need to make modifications after you have submitted your data package, please contact us through our web form or email help@datadryad.org.
A ReadMe file is intended to help ensure that the data can be correctly interpreted. You may submit one ReadMe for the data package as a whole and/or provide a separate ReadMe for each individual data file to supplement the file descriptions. We recommend that a ReadMe be a plain text file containing the following:
Data citation practices are actively evolving and vary among journals. Dryad does not have a recommendation for the placement of data citations at this time. Some publishing organizations, such as CrossRef, recommend reporting the data both in the text (e.g., within the Methods or in a dedicated Data Availability section) and in the Bibliography.
When referencing data in the text, we recommend the following as a template:
Data available from the Dryad Digital Repository: http://dx.doi.org/10.5061/dryad.[NNNN]
In the Bibliography, we recommend a citation similar to:
Heneghan C, Thompson M, Billingsley M, Cohen, D (2011) Data from: Medical-device recalls in the UK and the device-regulation process: retrospective review of safety notices and alerts. Dryad Digital Respository. http://dx.doi.org/10.5061/dryad.585t4
By default, data are released at the time the publication is made available online. Some journals that have integrated their submission process with Dryad allow authors to designate that individual files be released only after one year from the default release date. For these journals only, the embargo option can be selected within the Dryad interface at the time of data submission. If selected, the metadata for the file will be visible on the publication release date, but the file itself will not be downloadable. Archiving the data prior to publication guarantees that the data are safely archived while allowing the original authors to complete ongoing work. However, we encourage researchers to use this option sparingly; the risk of being scooped on a forthcoming publication is generally quite small, while the negative impact on citations of delaying the release of the data is likely to be more common. Note also that only the data required to support the claims made in the publication need to be archived. Of course, you are welcome to archive more if you like.
Since a guiding principle of Dryad is to make its contents freely available for research and educational use, there are no access costs for individual users or institutions. Instead, Dryad supports its nonprofit mission by recovering costs upfront. Starting 1-September-2013, Dryad will charge fees upon submission. The submitter is asked to pay this fee at the time of submission unless (a) the associated journal, or another organization, has already contracted with Dryad to cover the submission fee, or (b) the submitter is based in a country classified by the World Bank as a low-income or lower-middle-income economy. Additional submission fees will apply to data packages in excess of 10 GB and from journals without integrated submission. The price schedule is detailed here.
Dryad performs basic checks on each submission (can the files be opened? are they free of viruses? are they free of copyright restrictions? do they appear to be free of sensitive data?). The completeness and correctness of the metadata (e.g. information about the associated publication, the date on which any embargo is to be lifted, indexing keywords) are checked and the DOI is officially registered. The data packages are backed up by a digital library network to ensure the availability and integrity of the contents for the long-term. The information content of the original file is never intentionally modified or processed, but copies made be made in different file formats to facilitate preservation. The data package is publicly released once the online version of the associated publication is available unless an alternative release option has been selected, and the metadata are shared with indexing services to promote discoverability.
Yes, Dryad counts data package views and data file views and downloads, and displays this information on each data package and data file page. While considerable effort has been made to exclude automated web crawlers from Dryad’s counts, views and downloads may still be overestimates of human usage.
You may also monitor the impact of Dryad data packages and files through citations, sharing links on social media, etc. using emerging services such as ImpactStory.org, altmetric.com and the Data Citation Index. While some of these services do allow you to see who has discussed your data on social media, Dryad does not track, and cannot report, the identity of the individuals who view or download files.
The data hosted by Dryad have been dedicated to the public domain, to the extent possible under the law, under the terms of Creative Commons Zero (CC0), in order to minimize legal barriers and maximize the impact on research and education. However, CC0 does not waive other persons’ rights in the work (e.g., privacy) and, in many jurisdictions, does not waive your moral rights as an author (e.g., to the integrity of the work). Furthermore, while CC0 does not assert a legal requirement for attribution, community norms for scholarly communication do set expectations for how to attribute the original creators. Some partner journals provide authors the option of a limited-duration embargo (typically one year) from the time of publication before the data become available, and journal editors have the discretion to grant longer embargoes under special circumstances.
Note that a very small number of early submissions to Dryad are available under a license other than CC0.
No, all the content in Dryad is free to download and reuse.
When citing data found in Dryad, please cite both the original article as well as the Dryad data package. It is recommended that the data package be cited in the bibliography of the original publication so that the link between the publication and data is indexed by third party services. Dryad provides a generic citation string that includes authors, year, title, repository name and the Digital Object Identifier (DOI) of the data package, e.g.
Westbrook JW, Kitajima K, Burleigh JG, Kress WJ, Erickson DL, Wright SJ (2011) Data from: What makes a leaf tough? Patterns of correlated evolution between leaf toughness traits and demographic rates among 197 shade-tolerant woody species in a neotropical forest. Dryad Digital Repository. http://dx.doi.org/10.5061/dryad.8525
Dryad also assigns a DOI to each data file, which should only be used in contexts where the citation to the data package as a whole is already understood or would not be necessary (such as when referring to the specific file used as part of the methods section of an article).
If you are using a large number of data sources, it may be necessary to provide a list of the relevant data packages/files rather than citing each individually in the References. The list can then be submitted to Dryad so others who read your publication can locate all of the original data.
Data citations can be easily downloaded using the links at the bottom of the Citation Box on the data package page. Citation can currently be downloaded in two generic formats: RIS (compatible with software such as EndNote, Reference Manager, ProCite, and RefWorks) and BibTex (compatible with software such as LaTeX and BibDesk)
You may also use links in the Citation Box to bookmark or share a link to the data package on a variety of platforms (e.g. Delicious, Facebook, Twitter and Reddit).
A data package is the entire set of data files associated with one publication plus the metadata describing the combined set.
Journals may work with Dryad in a number of ways. First, a journal may have integrated its submission process with Dryad. The journal may choose a number of options, e.g. directing authors to submit data either before review or after acceptance and whether to allow data embargoes or not. Second, journals or their parent organizations may handle submission fees on behalf of authors (starting September 1, 2013). Third, journals and their parent organizations may participate in Dryad’s governance through Membership. To see what relationship, if any, your journal has to Dryad, please consult this Table. We welcome inquiries from journals not yet listed, and encourage authors to suggest Dryad as a data archiving service to their editors.
Each data package and data file in Dryad is assigned a Digital Object Identifier, or DOI, a permanent, globally unique, and resolvable identifier that is an important component of a data citation.
We recommend using the URL form of the DOI (e.g. “http://dx.doi.org/10.5061/dryad.6h722”) whenever referencing a data package or data file.
The DOI for each data file is composed of the base DOI for the data package “http://dx.doi.org/10.5061/dryad.[NNNN]” with the version number appended as “/1″, “/2″, etc. When a new version of a data package or data file are released, the DOI is updated with the suffix “.2″, “.3”, etc. Thus the third version of the second data file in the second version of a Dryad data package would have a DOI of the form http://dx.doi.org/10.5061/dryad.[NNNN].2/2.3. Note that the DOI without the version suffix will resolve to the most recent version, and the presence of older versions will be indicated.
Dryad DOIs are assigned through the EZID service from the California Digital Library and are registered by DataCite.
All data submitted to Dryad is released to the public domain under Creative Commons Zero (CC0), which reduces legal and technical impediments to the reuse of data by waiving copyright and related rights to the extent permitted by law. In most cases, CC0 does not actually affect the legal status of your data, since facts in and of themselves are not eligible for copyright in most countries (e.g. see this commentary from Bitlaw regarding U.S. copyright law).
CC0 does not exempt those who reuse the data from following community norms for scholarly communication, in particular from citation of the original data authors. On the contrary, by removing unenforceable legal barriers, CC0 facilitates the discovery, reuse, and citation of that data. Any publication that makes substantive reuse of the data is expected to cite both the data package and the original publication from which it was derived.
"Community norms can be a much more effective way of encouraging positive behaviour, such as citation, than applying licenses. A well functioning community supports its members in their application of norms, whereas licenses can only be enforced through court action and thus invite people to ignore them when they are confident that this is unlikely." (Panton Principles FAQ)
"…when you federate a query from 50,000 databases (not now, perhaps, but definitely within the 70-year duration of copyright!) will you be liable to a lawsuit if you don’t formally attribute all 50,000 owners?" (Science Commons Database Protocol FAQ)
Similarly, we do not support the use of licenses that (questionably) assert copyright and add well-meaning, but potentially problematic, restrictions on reuse, such as “non-commercial”, “no derivative” and "share-alike" conditions.
"... given the potential for significantly negative unintended consequences of using copyright, the size of the public domain, and the power of norms inside science, we believe that copyright licenses and contractual restrictions are simply the wrong tool [for data], even if those licenses and contracts are used with the best of intentions." (Science Commons Database Protocol FAQ)
Dryad’s use of CC0 to make the terms of reuse explicit has some important advantages:
It is important to note that if you have data that, due to pre-existing agreements, cannot be released under the terms of CC0, please do not submit that data to Dryad. Journals that require data archiving as a condition of publication can make exceptions for such special cases.
Yes, we encourage researchers to include Dryad as part of their data management plan when applying for grants or developing a data management strategy for their institution. Data submitted to Dryad can help researchers meet expectations (e.g. for funders in the US and UK) that data be archived for long-term preservation, available for reuse at no cost and with open, explicit terms of reuse. We recommend consulting an online data management planning tool such as DMPTool or DMPOnline.
See Pricing Plans and Submission Fees
All text on this page is available under a
CC-BY 3.0 license.
Last revised: 2013-04-12
Learn more about: