Update 3

April 25, 2021

Long Term Ecological Research (LTER) Datasets: R package

Project Sponsor: National Center for Ecological Analysis and Synthesis (NCEAS)
Mentors: Dr. Julien Brun, Dr. Allison Horst, Dr. Sang-Yun Oh
Team Members: Sam Guo, Adhitya Logan, Lia Ran, Sophia Sternberg, Karen Zhao

Goal

Build an R package and accompanying website that contains one sample dataset from each of the 28 Long Term Ecological Research (LTER) Network sites for use in data science education as an alternative to the popular but controversial iris dataset.

Background Info

The LTER network is a set of 28 National Science Foundation-funded research programs to observe long-term and large scale ecological effects at various sites across the United States. LTER datasets are archived in a centralized data repository managed by the Environmental Data Initiative (EDI). Our project sponsor, NCEAS, is an environmental science research center at UC Santa Barbara that operates the LTER Network Office.

Progress

  • Collaborated on the existing R package lterdatasets

  • Began adding our selected vignette with related files (documentation, code for processing raw data) to this package via pull requests on GitHub

  • Enabled GitHub Actions, which runs a CI/CD workflow to make sure our pull requests don’t introduce any breaking changes and are compatible with the existing package infrastructure. In the future, GitHub Actions will automatically deploy changes made to our website

  • Reviewed each other’s pull requests and merged them after confirming that there are no errors and that the GitHub Actions workflow passed successfully

  • Opened issues to create discussions with our teammates regarding prospective datasets that could be incorporated into the package

  • GitHub Projects helps us organize and automate the progress that we’ve made on our issues and pull requests, making our collaboration flow smoother

  • Researchers at the LTER office were asked about promising and charismatic datasets, and we started to explore datasets from the resulting list

Next steps

  • Continue to explore datasets and create vignettes to teach data science concepts and techniques

  • Refine vignettes using feedback from teammates in pull requests

  • Learn how to customize and build website for our package from Dr. Horst

  • Brainstormed Ideas: Can create a separate page for each dataset analyzed in the package, Offer interactivity within modifying visualizations, sample code terminal, etc., Customize dataset pages with their own themes and colors based on the dataset itself

  • Build website using pkgdown and GitHub Actions

  • Design a hex sticker for our package. Examples of other stickers can be found here