Data Carpentry’s aim is to teach researchers basic concepts, skills, and tools for working with data so that they can get more done in less time, and with less pain.
Interested in teaching these materials? We have an onboarding video available to prepare Instructors to teach these lessons. After watching this video, please contact team@carpentries.org so that we can record your status as an onboarded Instructor. Instructors who have completed onboarding will be given priority status for teaching at centrally-organized Data Carpentry Geospatial workshops.
Getting Started
Data Carpentry’s teaching is hands-on, so participants are encouraged to use their own computers to ensure the proper setup of tools for an efficient workflow. To most effectively use these materials, please make sure to download the data and install everything before working through this lesson.
This workshop assumes no prior experience with the tools covered in the workshop. However, learners with prior experience working with geospatial data may be able to skip the Geospatial Project Organization and Management lesson. Similarly, learners who have prior experience with the
R
programming language may wish to skip the Introduction to R for Geospatial Data lesson.To get started, follow the directions in the Setup tab to get access to the required software and data for this workshop.
Data
The data and lessons in this workshop were originally developed through a hackathon funded by the National Ecological Observatory Network (NEON) - an NSF funded observatory in Boulder, Colorado - in collaboration with Data Carpentry, SESYNC and CYVERSE. NEON is collecting data for 30 years to help scientists understand how aquatic and terrestrial ecosystems are changing. The data used in these lessons cover two NEON field sites:
- Harvard Forest (HARV) - Massachusetts, USA - fieldsite description
- San Joaquin Experimental Range (SJER) - California, USA - fieldsite description
The data for this workshop are available on Figshare under a CC-BY license. This is a reduced subset of the original data publication by Wasser and Jones (2016). You can download all of the data used in this workshop by clicking this download link. Clicking the download link will download all of the files as a single compressed (
.zip
) file. To expand this file, double click the folder icon in your file navigator application (for Macs, this is the Finder application).These data files represent teaching version of the data, with sufficient complexity to teach many aspects of data analysis and management, but with many complexities removed to allow students to focus on the core ideas and skills being taught.
Dataset Folder or file name Description Site layout shapefiles “vector” folder A set of shapefiles for the NEON’s Harvard Forest field site and US and (some) state boundary layers. Airborne remote sensing data “raster” folder LiDAR data collected by the NEON Airborne Observation Platform (AOP) and processed at NEON including a canopy height model, digital elevation model and digital surface model for NEON’s Harvard Forest and San Joaquin Experimental Range field sites. Point locations of field plots .csv files Two files that contain geographic coordinates of plot locations in the Harvard Forest field sites
Workshop Overview
Lesson | Overview |
---|---|
Introduction to Geospatial Concepts | Understand data structures and common storage and transfer formats for spatial data. |
Introduction to R for Geospatial Data | Import data into R, calculate summary statistics, and create publication-quality graphics. |
Introduction to Geospatial Raster and Vector Data with R | Open, work with, and plot vector and raster-format spatial data in R. |