1 of 2

Finding and Preparing Published Data

BETYdb is designed for both previously published data and 'primary' data. Most of this documentation assumes that you have already identified a data set that you want to upload, or have a set of papers from which you would like to extract data and summary statistics.

Meta-analyses

If you are planning to do a meta-analysis, even if this is not your first time, please read 'Uses and Misuses of Meta-analysis in Ecology" \cite{Koricheva_2014}. Many texts are available, but the recent "Handbook of Meta-analysis in Ecology and Evolution" is probably the most comprehensive and specific for plant sciences.

For a meta-analysis, the first step is to find papers that contain the target data.

The easiest approach to use a search engine such as , , or . Starting with queries such as "scientific name + trait", and allowing these results to guide further queries. Often, the references (particularly of meta-analyses and reviews) and forward citations will point to other studies.

Another starting point for the programmatically inclined - which aids in documenting searches - is to submit queries programmatically. Carl Davidson wrote a to search for citations based on species and trait name. In addition, the rOpenSci project has a .

Preparing Published Data

Mendeley

Mendeley provides a central location for the collection, annotation, and tracking of the journal articles that we use. Features of Mendeley that are useful to us include:

Collaborative annotation & notes sharing
- Text highlighter
- Sticky notes for comments in the text
- Notes field for text notes in the reference documentation
Read/ unread & favorites:
Papers can be marked as read or unread, and may be starred.
Groups
Tagging

Each project has two groups: "projectname" and "projectname_out" for the papers with data to be entered and for the papers with data that has been entered, respectively. Papers in the _out group may contain data for future entry (for example, traits that are not listed in Table \ref{tab:traits}).

Each project manager may have one or more projects and each project should have one group. Group names should refer to plant species, plant functional types, or another project specific name. Please make sure that David LeBauer is invited to join each project folder.

Open Mendeley desktop
Click Edit → New Group or Ctrl+Shift+M
Create group name following instructions above

Adding and Annotating Papers

When naming a group, tag folders so that instructions for a technician would include the folder and the tag to look for, e.g. "please enter data from projectx" or "please enter data from papers tagged y from project x". To access the full text and PDF of papers from off campus, use the service. If you are managing a Mendeley folder that undergraduates are actively entering data from, please plan to spend between 15 min and 1 hour per week maintaining it - enough to keep up with the work that the undergraduates are doing.

Adding a reference

If the DOI number is available (most articles since 2000)
1. Select project folder
2. Right click and select Add entry manually...

Annotating a Reference

Each week, please identify and prepare papers that you would like to be entered next by completing the following steps:

Use the star label to identify the papers that you want the student to focus on next.
- Start by keeping a minimum of 2 and a maximum of 5 highlighted
  at once so that students can focus on the ones that you want.
  Students have been entering 1-3 papers per week, once we get

Comment or Highlight the following information

Sample size
Covariates (see table \ref{tab:covariates})
Treatments

Finding a citation in Mendeley

To find a citation in Mendeley, go to the project folder. By default, data entry technicians should enter data from papers which have been indicated by a yellow star and in the order that they were added to the list. Information and data to be collected from a paper can be found under the 'Notes' tab and in highlighted sections of the paper.

Recording extracted data and transformations

Google Spreadsheets are used to keep a record of any data that is not entered directly from the original publication. Please share all spreadsheets with the user betydb@gmail.com in addition to any collaborators.

Any raw data that is not directly entered into the database but that
is used to derive data or stats using equations in Tables \ref{tab:conversions} or \ref{tab:stats}.
Any data extracted from figures, along with the figure number
Any calculations that were made. These calculations should be

Each project has a Google document spreadsheet with the title "project_data". In this spreadsheet, each reference should have a separate worksheet labeled with the citation key (authorYYYabc format). Do not enter data into excel first as this is prone to errors and information such as equations may be lost when uploading or copy-pasting.

Preparing Published Data

Mendeley

Mendeley provides a central location for the collection, annotation, and tracking of the journal articles that we use. Features of Mendeley that are useful to us include:

Collaborative annotation & notes sharing
- Text highlighter
- Sticky notes for comments in the text
- Notes field for text notes in the reference documentation
Read/ unread & favorites:
Papers can be marked as read or unread, and may be starred.
Groups
Tagging

Open Mendeley desktop
Click Edit → New Group or Ctrl+Shift+M
Create group name following instructions above

Adding and Annotating Papers

Adding a reference

If the DOI number is available (most articles since 2000)
1. Select project folder
2. Right click and select Add entry manually...

Annotating a Reference

Each week, please identify and prepare papers that you would like to be entered next by completing the following steps:

Use the star label to identify the papers that you want the student to focus on next.
- Start by keeping a minimum of 2 and a maximum of 5 highlighted
  at once so that students can focus on the ones that you want.
  Students have been entering 1-3 papers per week, once we get

Comment or Highlight the following information

Sample size
Covariates (see table \ref{tab:covariates})
Treatments

Finding a citation in Mendeley

Recording extracted data and transformations

Any raw data that is not directly entered into the database but that
is used to derive data or stats using equations in Tables \ref{tab:conversions} or \ref{tab:stats}.
Any data extracted from figures, along with the figure number
Any calculations that were made. These calculations should be

For each paper, use comment bubbles, notes field, and highlighter to indicate:

Name(s) of traits to be collected
Methods:
- Site name
- Location
- Number of replicates
- Statistics to collect
- Identify treatment(s) and control
- Indicate if study was conducted in greenhouse, pot, or growth chamber
Data to collect
- Identify figures number and the symbols to extract data from.
- Table number and columns with data to collect
Covariates
Management data (for yields)
Units in 'to' and 'from' fields used to convert data
Esoteric information that other scientists or technicians might not catch and that is not otherwise recorded in the database
Any data that may be useful at a later date but that can be skipped for now.

Finding and Preparing Published Data

hashtagMeta-analyses

Preparing Published Data

hashtagMendeley

hashtag Adding and Annotating Papers

hashtagAdding a reference

hashtagAnnotating a Reference

hashtagFinding a citation in Mendeley

hashtagRecording extracted data and transformations

Finding and Preparing Published Data

hashtagMeta-analyses

Preparing Published Data

hashtagMendeley

hashtag Adding and Annotating Papers

hashtagAdding a reference

hashtagAnnotating a Reference

hashtagFinding a citation in Mendeley

hashtagRecording extracted data and transformations

Meta-analyses

Mendeley

Adding and Annotating Papers

Adding a reference

Annotating a Reference

Finding a citation in Mendeley

Recording extracted data and transformations

Meta-analyses

Mendeley

Adding and Annotating Papers

Adding a reference

Annotating a Reference

Finding a citation in Mendeley

Recording extracted data and transformations