This was an adaptation created by Michelle Gaynor (University of Florida) as part of a BCEENET (https://bceenetwork.org/) workshop on data cleaning co-facilitated with Pam Soltis (University of Florida) using the species Shortia galacifolia, the Oconee bells or acony bell, which is a rare North American plant in the family Diapensiaceae found in the southern Appalachian Mountains.
Reference the original resource for more background information: https://qubeshub.org/publications/1899/
Students will clean an open sources herbarium dataset using best practices to accurately and clearly designate each step taken to collect, clean, and analyze open access biodiversity data. This exercise uses Excel or R.
Upon completion of this module, each student should be able to:
- Access biodiversity data from open sources.
- Use descriptive, retrievable, and consistent file names to manage datasets.
- Identify common problems with digital datasets
- Rectify common problems with digital datasets
- Apply disciplinary knowledge for smart data cleaning
- Explain the importance of reproducible data and cleaning steps
- Document data cleaning steps to provide reproducibility.
Access data and Excel and RStudio examples from GitHub: https://github.com/mgaynor1/BCEENET-DataCleaning