Dryad aims to make data archiving as simple and as rewarding as possible through a suite of services not necessarily provided by publishers or institutional websites:
Submission integration is a free service that allows journal publishers to coordinate the submission of manuscripts with submission of data to Dryad, thus streamlining the process and decreasing the burden on authors. Learn more about submission integration.
Dryad hosts research data underlying scientific and medical publications. Historically, the repository has been strongest in the life sciences. Most data in the repository is associated with peer-reviewed journal articles, but data associated with non-peer reviewed publications from other reputable sources (such as dissertations and books) is also accepted. At this time, all descriptive information must be in English.
Most types of files can be submitted (e.g., text, spreadsheets, video, photographs, software code) including compressed archives of multiple files. View additional guidance on preservation-friendly file types.
Data sets larger than 20GB are subject to additional charges.
Data that was originally collected for another publication may be submitted as long as it is referenced by the current publication. Non-data files may also be submitted to Dryad provided the files are integral to the publication and can be released to the public domain. Note that policies for the whether to submit non-data Supplementary Materials to the journal or to Dryad vary among journals.
Dryad does not accept submissions that contain personally identifiable human subject information. Human subjects data must be properly anonymized and prepared under applicable legal and ethical guidelines. Please see additional guidance on human subjects data.
Dryad does not accept any files with licensing terms that are incompatible with the Creative Commons Zero waiver.
While Dryad can host software scripts and snapshots of software source code, we recommend the use of a public software repository with version control for the ongoing maintenance of software packages. Software in Dryad must be released with a Creative Commons Zero waiver.
Data archived in Dryad are publicly available, and any human subjects data must be properly anonymized and prepared under applicable legal and ethical guidelines. When de-identifying your data, both direct and indirect identifiers need to be considered. Dryad does not allow any direct identifiers, but a dataset may contain up to 3 indirect identifiers. Direct identifiers include variables such as participant’s name, initials, email, and postal code; indirect identifiers are data that if combined might lead to identification (see additional guidance on human subjects data).
Data archived in Dryad are publicly available. Potential risks to endangered species, and species vulnerable to poaching or retaliation killing, must be carefully assessed before submitting your data. Specifically, publishing location data and habitat descriptions for threatened species may expose them to hunters, poachers and illegal trade, which can lead to their further decline. We ask that researchers determine the vulnerability of high-risk cases and consider masking or removing the exact geo-coordinates of endangered species from their data set (see additional guidance on threatened species data).
Due to the diversity of datatypes hosted by the repository, Dryad does not require a single file format, or even set of file formats, but rather encourages users to follow accepted community standards where they exist. Users are strongly encouraged to ensure that data can be easily extracted from the files provided (e.g., by using CSV rather than PDF). View additional guidance and a list of preservation-friendly file formats.
Your data may be submitted at various points in the publication process, depending on your publication outlet. Journals that are integrated with Dryad have specific requirements. Look up your journal to determine the proper workflow.
This depends on the current workflow stage of your submission (see below). Please do not create multiple Dryad submissions associated with the same publication.
Some integrated journals enable their authors to make data privately available to editors and reviewers during the article review process. (Look up your journal to determine whether it uses data review). Dryad does not allow files to be deleted while an article is under review. We only allow additions during this time. This restriction ensures that all reviewers see the files that were originally submitted.
To add new or corrected files:
If/when your publication is accepted, your Dryad submission will automatically move out of review and into our queue for curation and archiving, and will no longer be accessible for modification.
You may have received a message from Dryad that your data package was approved, but will not be available in the repository until the associated article is published. Modifying data packages during this stage requires curator assistance. Please send an email to help@datadryad.org including your Dryad DOI and detailing the changes needed.
If any substantive changes to data packages are needed after archiving, a new version of the package must be created by Dryad curators. See Dryad's versioning policy. Please send an email to help@datadryad.org including your Dryad DOI and detailing the changes needed.
Providing well-documented data in a community-accepted format is in the interest of the submitter since it encourages others to reuse and thus cite the original work. It is the shared responsibility of the submitter, the editors, and the reviewers to ensure the data are provided according to community standards. In some cases, journals provide explicit guidelines in their instructions to authors. Please also see our best practices for creating reusable Dryad data packages.
All human subjects data must be properly anonymized and prepared under applicable legal and ethical guidelines. The following article may be helpful in preparing your human subjects data for archiving: Hrynaszkiewicz I, Norton ML, Vickers AJ, Altman DG (2010) Preparing raw clinical data for publication: guidance for journal editors, authors, and peer reviewers. BMJ 340, c181. https://doi.org/10.1136/bmj.c181
See our best practices for creating reusable Dryad data packages
.The submission process is designed to be self-explanatory, but here is what you can expect:
If you encounter any difficulties, or realize that you need to make modifications after you have submitted your data package, please contact us through our web form or email help@datadryad.org.
See how to provide a description of all details relevant to a data package
Data citation practices are actively evolving and vary among journals. Dryad does not have a recommendation for the placement of data citations at this time. Some publishing organizations, such as CrossRef, recommend reporting the data both in the text (e.g., within the Methods or in a dedicated Data Availability section) and in the Bibliography.
When referencing data in the text, we recommend the following as a template (substitute your DOI suffix for the xxxxx):
Data available from the Dryad Digital Repository: https://doi.org/10.5061/dryad.xxxxx
In the Bibliography, we recommend a citation similar to:
Heneghan C, Thompson M, Billingsley M, Cohen, D (2011) Data from: Medical-device recalls in the UK and the device-regulation process: retrospective review of safety notices and alerts. Dryad Digital Repository. https://doi.org/10.5061/dryad.585t4
By default, data are released at the time the publication is made available online. Some journals that have integrated their submission process with Dryad allow authors to designate that individual files be released one year after the date the publication is made available online. The embargo option can be selected within the Dryad interface at the time of data submission. Longer embargoes can be granted only with written permission from the journal editor or publisher. If selected, the metadata for the file will be visible on the publication release date, but the file itself will not be downloadable until the embargo expires.
Embargoing the data prior to publication guarantees that the data are safely archived, but not accessible, while the original authors complete ongoing work. However, we encourage researchers to use this option sparingly; the risk of being scooped on a forthcoming publication is generally quite small, while a negative citation impact from delaying the release of the data is likely to be widespread. Note also that only the data required to support the claims made in the publication need to be archived. Of course, you are welcome to archive more if you like.
Since a guiding principle of Dryad is to make its contents freely available for research and educational use, there are no access costs for individual users or institutions. Instead, Dryad supports its nonprofit mission by recovering costs up front.
Dryad collects a Data Publishing Charge (DPC) of $120US upon data publication, unless there is a sponsor or fee waiver in place. Additional charges apply to data packages in excess of 20GB. See payment details.
Your credit card is verified by our payment processing system when you submit your data, but you will be not be charged until our curatorial team approves your data package. If your publication status is "in preparation or in review," you will be not be charged unless/until your publication is accepted.
You will receive an email confirmation and receipt when your card is charged.
Dryad performs basic checks on each submission (can the files be opened? are they free of viruses? are they free of copyright restrictions? do they appear to be free of sensitive data?). The completeness and correctness of the metadata (e.g. information about the associated publication, the date on which any embargo is to be lifted, indexing keywords) are checked and the DOI is officially registered.
The data package is publicly released once the online version of the associated publication is available (unless an alternative release option has been selected). Since paper titles, abstracts, authors etc. often change during the publication process, Dryad confirms and updates this information against the accepted/published article.
Dryad metadata are shared with indexing services to promote discoverability. Data packages are accessible and backed up for long-term preservation within the DataONE network. The information content of the original file is never intentionally modified or processed, but copies may be made in different file formats to facilitate preservation.
If substantive changes to data packages are needed after archiving, a new version of the package must be created (with curator assistance). Read Dryad's versioning policy. Please contact help@datadryad.org if you need to correct or update a data package that is already publicly available.
Yes, Dryad counts data package views and data file views and downloads, and displays this information on each data package and data file page. While considerable effort has been made to exclude automated web crawlers from Dryad’s counts, views and downloads may still be overestimates of human usage.
You may also monitor the impact of Dryad data packages and files through citations, sharing links on social media, etc. using emerging services such as ImpactStory.org, altmetric.com and the Data Citation Index. While some of these services do allow you to see who has discussed your data on social media, Dryad does not track, and cannot report, the identity of the individuals who view or download files.
The data hosted by Dryad have been dedicated to the public domain, to the extent possible under the law, under the terms of Creative Commons Zero (CC0), in order to minimize legal barriers and maximize the impact on research and education. However, CC0 does not waive other persons’ rights in the work (e.g., privacy) and, in many jurisdictions, does not waive your moral rights as an author (e.g., to the integrity of the work).
Furthermore, while CC0 does not assert a legal requirement for attribution, community norms for scholarly communication do set expectations for how to attribute the original creators. Some partner journals provide authors the option of a limited-duration embargo (typically one year) from the time of publication before the data become available, and journal editors have the discretion to grant longer embargoes under special circumstances.
Note that a very small number of early submissions to Dryad are available under a license other than CC0.
No, all the content in Dryad is free to download and reuse.
When citing data found in Dryad, please cite both the original article as well as the Dryad data package. It is recommended that the data package be cited in the bibliography of the original publication so that the link between the publication and data is indexed by third party services. Dryad provides a generic citation string that includes authors, year, title, repository name and the Digital Object Identifier (DOI) of the data package, e.g.
Westbrook JW, Kitajima K, Burleigh JG, Kress WJ, Erickson DL, Wright SJ (2011) Data from: What makes a leaf tough? Patterns of correlated evolution between leaf toughness traits and demographic rates among 197 shade-tolerant woody species in a neotropical forest. Dryad Digital Repository. https://doi.org/10.5061/dryad.8525
Dryad also assigns a DOI to each data file, which should only be used in contexts where the citation to the data package as a whole is already understood or would not be necessary (such as when referring to the specific file used as part of the methods section of an article).
If you are using a large number of data sources, it may be necessary to provide a list of the relevant data packages/files rather than citing each individually in the References. The list can then be submitted to Dryad so others who read your publication can locate all of the original data.
Data citations can be easily downloaded using the links at the bottom of the Citation Box on the data package page. Citation can currently be downloaded in two generic formats: RIS (compatible with software such as EndNote, Reference Manager, ProCite, and RefWorks) and BibTex (compatible with software such as LaTeX and BibDesk)
You may also use links in the Citation Box to bookmark or share a link to the data package on a variety of platforms (e.g. Delicious, Facebook, Twitter and Reddit).
A data package is the entire set of data files associated with one publication plus the metadata describing the combined set.
Journals may work with Dryad in a number of ways.
To see what relationship, if any, your journal has to Dryad, please look up your journal.
We welcome inquiries from journals not yet listed, and encourage authors to suggest Dryad as a data archiving service to their editors.
Each data package and data file in Dryad is assigned a Digital Object Identifier, or DOI, a permanent, globally unique, and resolvable identifier that is an important component of a data citation.
We recommend using the URL form of the DOI (e.g. “https://doi.org/10.5061/dryad.6h722”) whenever referencing a data package or data file.
The DOI for each data file is composed of the base DOI for the data package “https://doi.org/10.5061/dryad.xxxxx” with the file number appended as “/1″, “/2″, etc.
When a new version of a data package or data file are released, the DOI is updated with the suffix “.2″, “.3”, etc. Thus, the third version of the second data file in the second version of a Dryad data package would have a DOI of the form https://doi.org/10.5061/dryad.xxxxx.2/2.3. Note that the DOI without the version suffix will resolve to the most recent version, and the presence of older versions will be indicated.
Dryad DOIs are assigned through the DataCite system.
All data submitted to Dryad is released to the public domain under Creative Commons Zero (CC0), which reduces legal and technical impediments to the reuse of data by waiving copyright and related rights to the extent permitted by law. In most cases, CC0 does not actually affect the legal status of your data, since facts in and of themselves are not eligible for copyright in most countries (e.g. see this commentary from Bitlaw regarding U.S. copyright law).
CC0 does not exempt those who reuse the data from following community norms for scholarly communication, in particular from citation of the original data authors. On the contrary, by removing unenforceable legal barriers, CC0 facilitates the discovery, reuse, and citation of that data. Any publication that makes substantive reuse of the data is expected to cite both the data package and the original publication from which it was derived.
"Community norms can be a much more effective way of encouraging positive behaviour, such as citation, than applying licenses. A well functioning community supports its members in their application of norms, whereas licenses can only be enforced through court action and thus invite people to ignore them when they are confident that this is unlikely." (Panton Principles FAQ)
"…when you federate a query from 50,000 databases (not now, perhaps, but definitely within the 70-year duration of copyright!) will you be liable to a lawsuit if you don’t formally attribute all 50,000 owners?" (Science Commons Database Protocol FAQ)
Similarly, we do not support the use of licenses that (questionably) assert copyright and add well-meaning, but potentially problematic, restrictions on reuse, such as “non-commercial”, “no derivative” and "share-alike" conditions.
"... given the potential for significantly negative unintended consequences of using copyright, the size of the public domain, and the power of norms inside science, we believe that copyright licenses and contractual restrictions are simply the wrong tool [for data], even if those licenses and contracts are used with the best of intentions." (Science Commons Database Protocol FAQ)
Dryad’s use of CC0 to make the terms of reuse explicit has some important advantages:
It is important to note that if you have data that, due to pre-existing agreements, cannot be released under the terms of CC0, please do not submit that data to Dryad. Journals that require data archiving as a condition of publication can make exceptions for such special cases.
Data packages in Dryad are replicated across multiple systems to support failover, improve access times, allow recovery from disk failures, and preserve bit integrity. The data packages are discoverable and backed up for long-term preservation within the DataONE network.
Dryad's content is not currently being replicated through the CLOCKSS digital library network. A CLOCKSS working group is developing a new and separate set of guidelines for databases; when these are in place, it may be possible for Dryad to enter into a new agreement with CLOCKSS.
Yes, we encourage researchers to include Dryad as part of their data management plan when applying for grants or developing a data management strategy for their institution. Data submitted to Dryad can help researchers meet expectations (e.g. for funders in the US and UK) that data be archived for long-term preservation, available for reuse at no cost and with open, explicit terms of reuse. We recommend consulting an online data management planning tool such as DMPTool or DMPOnline.
See Payment plans and Data Publishing Charges
All text on this page is available under a
CC-BY 3.0 license.
(opens in a new window)
We encourage organizations to: