• Skip to main content
itrc_logo

EDM

Home
Interactive Directory
Introduction and Overview
Introduction
Overview of Guidance Document
Data Management Planning
Data Management Planning Home
Data Management Planning Overview
Data Governance
Data Lifecycle
Data Access, Sharing, and Security
Data Storage, Documentation, and Discovery
Data Disaster Recovery
Data Quality
Data Quality Home
Data Quality Overview 
Analytical Data Quality Review: Verification, Validation, and Usability
Using Data Quality Dimensions to Assess and Manage Data Quality
Considerations for Choosing an Analytical Laboratory 
Active Quality Control During Screening-level Assessments
Field Data Collection
Field Data Collection Home
Introduction to and Overview of Field Data Collection Best Practices
Defining Field Data Categories and Collection Methods
Field Data Collection Process Development Considerations
Field Data Collection Quality Assurance and Quality Control (QA/QC)
Field Data Collection Training Best Practices
Field Data Collection Training Best Practices Training Development Checklist
Other Considerations for Field Data Collection
Data Exchange
Data Exchange Home
Data Exchange Overview
Valid Values
Electronic Data Deliverables and Data Exchange
Data Migration Best Practices
Traditional Ecological Knowledge
Traditional Ecological Knowledge Home
What is Traditional Ecological Knowledge?
Acquiring Traditional Ecological Knowledge Data
Using and Consuming Traditional Ecological Knowledge Data
Managing Traditional Ecological Knowledge Data
Geospatial Data
Geospatial Data Home
Overview of Best Practices for Management of Environmental Geospatial Data
Organizational Standards for Management of Geospatial Data
Geospatial Data Standards
Geospatial Data: GIS Hardware
Geospatial Metadata
Geospatial Data Software
Geospatial Data Collection Consistency
Geospatial Data Field Hardware
Geospatial Data Dissemination: Web Format
Geospatial Visualization of Environmental Data
Public Communications
Public Communications Home
Public Communication and Stakeholder Engagement
Environmental Data Management Systems
Environmental Data Management Systems Home
Environmental Data Management Systems
Case Studies
Case Studies Home
Historical Data Migration Case Study: Filling Minnesota’s Superfund Groundwater Data Accessibility Gap
Case Study: USGS Challenges with secondary use of multi-source water quality monitoring data
LEK Case Study: Collection and Application of Local Ecological Knowledge to Local Environmental Management in Duluth, Minnesota
TEK Case Study: Improving Coastal Resilience in Point Hope, Alaska
Case Study: Integration of Traditional Ecological Knowledge to the Remediation of Abandoned Uranium Sites
Case Study: Local Ecological Knowledge of Historic Anthrax in a Natural Gas Field
Rest in Peace? A Cautionary Tale of Failure to Consult with an Indigenous Community
Case Study: Use of Traditional Ecological Knowledge to Support Revegetation at a Former Uranium Mill Site
Additional Information
Supplemental Resources
References
Acronyms
Glossary
Acknowledgments
Team Contacts
Navigating this Website
Document Feedback

 

Environmental Data Management (EDM) Best Practices
HOME

Defining Field Data Categories and Collection Methods

ITRC has developed a series of fact sheets that summarizes the latest science, engineering, and technologies regarding environmental data management (EDM) best practices. This fact sheet describes:

  • defining categories of data collected in the field
  • low volume data
  • high volume data
  • unstructured data
  • metadata

Additional information related to field data collection is provided in the Introduction to and Overview of Field Data Collection Best Practices fact sheet, as well as fact sheets on Field Data Collection Process Development Considerations, Field Data Quality Assurance and Quality Control (QA/QC), Field Data Collection Training Best Management Practices, and Other Considerations for Field Data Collection.

1 OVERVIEW

Data collected in the field include a vast range of potential observations, measurements, and sample types depending upon the data and information needs of the project. Matrixes observed, measured, or sampled may include air, water, soil, sediment, tissue, etc. Data may be collected over wide-ranging space and time scales and may be recorded in hard copy, digital, or both formats. 

Before an organization can develop processes and implement field data collection programs, it is important to understand the basics of what categories of data you are collecting, what form is it in, and data quality objectives. The following sections will elaborate on the various categories of data and provide a framework for understanding your project before you begin the process of developing a field data collection program.

2 SAMPLE AND LOCATION IDENTIFIERS

Sample identification is a key element in the field data capture process and is fundamental to managing data. Proper sample identifier assignment ensures that laboratory results from each sample can be traced to their point of origin and throughout the data lifecycle and can be used to interpret the results correctly. Data management plans should have defined sample-naming procedures. Essentially, every sample collected or generated in the field must be assigned a unique sample identifier that is traceable to the field collection records. This applies to normal field samples, different fractions created in the field (for example, filters, filtrates), and quality control (QC) samples. Location identifier naming procedures should also be prescribed in data management plans when locations are revisited or if there is the possibility that sampling locations will be revisited. For more information on developing guidance on these naming conventions in a data management plan or governance document, see the Data Governance and Data Lifecycle fact sheets. Best practices for the collection and management of geospatial data can be found in the Geospatial Data Collection Consistency subtopic sheet. 

In addition to sample naming, the relationships between samples need to be documented. Data users will need to know, for example, that a sample was split in the field into two subsamples, or that multiple individual samples were composited in the field, or that two soil samples are from the same boring, and so forth. This is often done by using fields within sample identifiers to contain linking information, but in addition to this it is preferable to also record the relationship information in metadata collected in the field and with additional identifiers in the database. A soil boring, for example, would be assigned a unique boring identifier, and every soil horizon from that boring would have not only its own unique sample identifier but also (as metadata) the associated soil boring identifier. Additional information on metadata is included below in Section 3.4. 

Location identification is essential in the field data capture process for sensor data as well. When a handheld sensor internally records the location (using GPS) and date/time of a measurement, then location identifiers are of secondary importance. However, proper location identifier assignment ensures that field measurements made with manually recorded data and metadata can be traced to their point of origin. 

3 FIELD DATA CATEGORIES

The text and tables below describe a wide range of field data categories. They are not intended to be all inclusive but do provide a range of categories and examples. The data categories are grouped by the volume and kind of data they generate, that is, low-volume, high-volume, and unstructured data. Along with any measurement in the field, the metadata should also be recorded.

  • Low-volume data are discrete field or lab data collected at a single point in time during a field sampling event, or analysis of a field sample in the lab. The amount of data is low compared to ongoing monitoring done by sensors collecting data every 5–10 minutes.  
  • High-volume data are those data generated in short time intervals for continuous monitoring, thus producing a large data set of frequent measurements. High-volume data is stored on the sensor device and may be transmitted in real time to an environmental data management system (EDMS) using telemetry to provide database users near real-time data results and alerts. Otherwise, sensor data can be downloaded from sensor devices periodically and incorporated into an EDMS. 
  • Unstructured data are not organized in a structured database schema table. Examples include files that can be organized in a file folder structure, such as video or audio recording files, photograph files, and handwritten scanned documents or PDF files. 
  • Metadata are the who, what, where, when, and how low-volume, high-volume, or unstructured data are collected and recorded. 

Some unstructured data can be transcribed in various ways into structured data; for example, information recorded on a hard copy field data sheet form is transcribed into a spreadsheet, or numbers and species of trees are counted on aerial photographs and transcribed into a data table. Considerations for migration of unstructured data to a structured EDMS are discussed in the Data Migration Best Practices Fact Sheet. 

Tables 1, 2, and 3 provide examples of low-volume, high-volume, and unstructured data, respectively. Due to the many potential sources of data, a discussion regarding data sources is beyond the scope of this document. Figure 1 shows the general flow of field data.

Figure 1. Example flow of field data.
Source: Adapted from Geosyntec Consultants, Undated. Used with permission.

3.1 Low-Volume Data

3.1.1 Field Samples and Field Observations

When field samples are collected for later analysis (for example, laboratory samples), the field team members must capture information about the sample collection process. This includes a unique sample identifier (see Section 2 above), date/time, location, collection methods, collector, and preservation method on the sample chain of custody for each sample collected. Field observations can vary widely and may take the form of soil boing logs (lithology, color, odor, moisture), well construction logs (screen interval, filter pack, construction materials), water quality logs (temperature, pH, dissolved oxygen, turbidity), etc. Sample information and field observations may be recorded data by hand or digitally but should be sufficient to ensure that field data capture provides complete and consistent information. The importance of a well-designed field form cannot be overstressed in this context, and this topic is discussed at length in Field Data Collection Process Development Considerations.

3.1.1.1 Chain of Custody (COC) Data for Discrete Samples
Figure 2. Field technician completing a paper chain of custody form.

COC procedures provide a record of physical sample custody to track samples from the time of collection to delivery at a destination laboratory. COC documentation is the record that a sample was collected, handled according to method requirements (for example, preservation), shipped to a laboratory, and directly linked to the analytical results. Sample tracking can be simplified and expedited using bar-coded labels. A separate COC form of one or more pages may be prepared for each shipping container. When multiple shipping containers are used, it can be helpful to have the shipping containers numbered (for example, Cooler #1) and the number of bottles in each container listed on the COC. COCs are often designed and controlled by the laboratory. However, not all laboratories require COCs, and different laboratories may have different nomenclature for submittal. Communicate with laboratories to ensure that the data captured are sufficient. COCs can be completed digitally or as paper documents as shown in Figure 2. The following data may be included for each sample in the shipping container:

  • sample identifier 
  • date/time of collection 
  • number and type of containers 
  • preservation 
  • requested analyses, which should be specified using controlled vocabulary valid values to avoid miscommunication
  • sample turn-around time requirements 
  • signatures/dates of sample collectors and individuals involved in sample transfer
Figure 3. Field technician using low-volume point sensor.

3.1.2 Point Sensors

Point sensors from various manufacturers measure a plethora of parameters at a single point in space and in media such as air, water, pore water, confined spaces, and process liquid streams. One example is shown in Figure 3. Low-volume data sensor configurations are typically handheld devices measuring a single or limited number of parameters at one moment in time. With handheld devices the sensor readings are digitally displayed for the user to record on paper or to record in a mobile digital electronic form, or they are recorded in the device memory for later download into a structured electronic table or file.

3.1.2.1 Point Sensor Calibration and Calibration Data Handling

The instrument sensors should be maintained and calibrated as specified by the manufacturer. Best practice suggests that sensor calibration also be checked prior to going into the field using manufacturer recommended calibration procedures to verify accuracy. It is also recommended that the calibration fluids be taken to the field as well in case the instrument falls out of calibration during the course of the day. The calibration data for each instrument/sensor should be maintained in the project data archive along with the collected sensor data and metadata. During post-field data analysis, it is ordinarily necessary to use instrument calibration data to convert from instrument values/units to final engineering values/units.

3.1.2.2 Data Unpacking, Processing, and Archiving

When sensor data from low-volume sensors are recorded manually on log forms or in a field notebook, the data are often transcribed into electronic form. After transcription, QA/QC procedures should be applied to ensure no transcription errors occur. For more guidance on QA/QC of field data, see the Field Data Collection Quality Assurance and Quality Control (QA/QC) Fact Sheet.

Figure 4. Example folder structure that maintains context.

When sensor data from low-volume sensors are recorded electronically in instrument memory, data are downloaded from the instrument(s) using communications software provided by or recommended by the manufacturer. These sensor data may include metadata, measurements, and calibration data. Even if the data are copied into a database, the original files should be preserved in their native electronic format and should never be renamed to serve as a record of where the data originated. Files transferred from field recording devices should be copied to the directory where project files are maintained, typically a network server folder(s). Here the data files, metadata files, and sensor calibration data files will be used for data processing and will remain for archiving. These folders should be organized with future discovery and access in mind, in a structure that maintains context in the folder path names. (See Figure 4 for an example folder structure.) Complete records of the provenance of these files should be maintained.

All these practices are suggestions, and your individual process should apply these suggestions as applicable to meet your organization’s existing document and data archival policies and regulations. Additional information on this topic is available in the Data Storage, Documentation, and Discovery Fact Sheet.

Table 1. Examples of low-volume data

Category Description Spatial/ Temporal Extent Method Data QA/QC
Field Samples & Field Observations
Field Samples Collected for Further Analysis (For Example, Laboratory Samples) as Recorded on
Sample Chain of Custody Forms

Air, water, sediment, soil, rock, tissue collected for chemical analysis, for sample dating, for species
abundance, toxicity studies, etc.

Examples: Air samples for polychlorinated biphenyls (PCB) analysis;
tree ring cores for dating; benthic; sediment for invertebrate abundance; lithology

Single point space and time Physical collection (for example, box core, slurp gun, air puff) The data will be generated in the lab. Whether that is 1,000 miles away or in a trailer at the job site,
the data are not generated at the point of sampling.
Chain of custody, QC sampling (including field replicates, field blanks, trip blanks, etc., as
appropriate)
Subjective Observations

Biological, habitat

Examples: vegetation counts, bio index

A sample area, for example, a river reach or habitat grid Observation in the field Recorded observations Replicated observations and double-entry or read-back of recorded (audio, video, or other)
observationsVerification against maps or other information sources
Operation, Maintenance, and Monitoring Data relating to equipment used for environmental remediation, which may include equipment temperature,
pressure, flow, power consumption, discharges, etc.
Single point measurement(s) over time Programmable logic controllers (PLC), supervisory control and data acquisition (SCADA), or manual
readings
Recorded observations, parameter measurements Equipment calibrations
Point Sensors
Water Sensors

Sensors that measure physical, chemical, or biological parameters in water.

Examples: temperature,
turbidity, conductivity, biological oxygen demand (BOD), pH, dissolved oxygen, chlorophyll

Single measurement at a single point in space and time In situ point sensor measurement or measurement of collected water samples; typically handheld devices Parameter measurements Sensor calibration
Air Sensors

Sensors that measure chemical parameters in air or other gas environments

Examples: O2, CO,
H2S, VOCs, etc.

Single measurement at a single point in space and time In situ point sensor measurement; typically handheld devices Parameter measurements Sensor calibration
Other In Situ Sensors Examples: membrane interface probe, laser-induced fluorescence probe, etc. Single measurement at a single point in space and time In situ point sensor measurement Parameter measurements Sensor calibration

3.2 High-Volume Data

High-volume sensor data require the same information previously described for sample and location identifiers, low-volume point sensor calibration data, and data unpacking/storage. Data from high-volume sensors is often handled differently, however, due to its nature. High-volume sensor data may be recorded in instrument memory, as with low-volume sensors, or it may be transmitted in real time or near real-time to servers elsewhere using telemetry. In this case, data unpacking may be done manually as described previously for low-volume data, or software may be created/used to process the data in real or near real-time. In the case of real or near real-time data processing, software quality control systems should be in place to ensure software created to unpack and process transmitted data in real or near real-time are faultless and fault-tolerant. This includes software for unpacking data, deriving final measurement values by applying instrument calibrations or corrections for instrument drift, or any other real-time data processing needed. The final archive of data files should follow a similar project folder structure described above to maintain the data files in their original context.

Table 2. Examples of high-volume data

Category Description Spatial/ Temporal Extent Method Data QA/QC
Point Sensor Measurements
Water Sensors

Sensors that measure physical, chemical, or biological parameters in water

Examples: temperature, turbidity, conductivity, BOD, pH, dissolved oxygen, chlorophyll

Repeating measurements at a fixed location over time or repeating measurements while the sensor is moved
in space

In situ point sensor measurement.

Examples: sensor fixed on a mooring; conductivity, temperature, and depth (CTD) probe lowered thru the water column

Parameter measurements Sensor calibration and drift corrections
Air Sensors

Sensors that measure chemical parameters in air or other gas environments

Examples: O2, CO, H2S, VOCs, etc.

Repeating measurements at a fixed location over time

In situ point sensor measurement.

Examples: Fixed air sampling station

Parameter measurements Sensor calibration and drift corrections
Meteorological Sensors

Sensors that measure physical meteorological parameters

Examples: wind direction, wind speed, VOC atmospheric pressure, air temperature

Repeating measurements at a fixed location over time or repeating measurements while the sensor is moved
in space

In situ point sensor measurement.

Examples: Fixed weather station, weather balloon

Parameter measurements Sensor calibration
Remote Sensing
Air and Water Movement Sensors

Acoustic sensors used to measure water current velocities or wind speed profiling over a depth or height
range.

Example: acoustic Doppler current profiler (ADCP), sonic detection and ranging (SODAR)

Repeating measurements of a profile at a fixed location over time or repeating measurements of a profile
while the sensor is moved in space
Acoustic remote sensing measurement Parameter measurements Sensor calibration
Bathymetric Sensors

Acoustic sensors used to measure distance to the seafloor, lake, or riverbed

Examples: Echo sounder, multibeam, side scan sonar

Repeating profile acoustic measurements while the sensor is moved in space Single acoustic beam or multi acoustic beam measurements of distance to seafloor Time, location, elevation, distance to seafloor Sensor calibration
Water Level Sensors Includes transducers that measure pressure in groundwater monitoring wells, corrected for barometric pressure, to determine groundwater depths used to calculate groundwater elevations Repeating measurements at a fixed location over time or repeating measurements while the sensor is moved in space In situ point sensor measurement Parameter measurements Sensor calibration

3.3 Unstructured Data

Any relevant unstructured field data should be captured appropriately into the study data management system. Field team members should be prepared to manage the following unstructured data and their metadata and to archive them as appropriate:

  • photo documentation
  • video documentation
  • sound recordings
  • handwritten documentation

Hard copy handwritten documentation should be maintained with project documents in the organization’s project file folder structure, but these records contain information that is less usable to the project team and may become more and more difficult to find over time. Good practice is to uniquely number each loose form sheet and each field notebook or logbook. Scanning hard copy documentation and archiving the files in network server folders with chronological file names is recommended to help protect against loss of project data. Data files (for example, spreadsheets) created in the field should be named in such a way as to ensure their uniqueness. When scanning/photographing any field form, ensure that the digital version is readable. 

The following unstructured data should be captured and maintained electronically in the project folder file structure defined above. 

  • field logs including observations (that is, PDF copies of handwritten field logs)
  • field measurement data sheets (that is, PDF copies of handwritten data sheets)
  • instrument calibration sheets (that is, PDF copies of handwritten calibration sheets)
  • COC forms (that is, PDF copies of handwritten COC sheets)

The following information describes processing and archival considerations for unstructured data. As with sensor data files, unstructured data files (including photographs, videos, PDF log forms, and PDF COC forms) should never be renamed and should be preserved in their native electronic format. Files are typically transferred from field recording devices or hard copy scanners to network server folders where they should be organized in file folders whose structure maintains type and context in the folder path names. Complete records of the provenance of these files should be maintained.

Table 3. Examples of unstructured data

Category Description Spatial/ Temporal Extent Method Data QA/QC
Photographs

Photographs taken in the field.

Examples: field operations documentation, field samples, habitat surveys

Single point in space and time Photograph Photographic image files Verify accuracy of captions/metadata
Video

Video recorded in the field.

Examples: habitat surveys

Single point in space over time Video Video files Verify accuracy of captions/metadata
Scanned Field Notes (PDF or JPG) Notes that are written in field logbooks or completed data sheet field forms that are scanned with a
camera, creating a JPG, or using a tool that results in a PDF
Single point in space over time Scanning PDF Files Verify images of scanned field notes are legible and complete

3.4 Metadata

Metadata are the additional pieces of information that are critical and should be collected to make the data usable now and in the future as it pertains to the larger data set. Often these metadata, such as location, are recorded automatically by instruments, sensors, or photography/videography equipment. Where these metadata are not recorded automatically, simple methods such as writing the location identifier on a small whiteboard and including it in the photo frame can help ensure that the correct metadata are captured and remain associated with the data. It is also important to establish valid values for metadata where possible.

Table 4 provides examples of the metadata associated with low-volume, high-volume, and unstructured data. For information specific to geographic metadata, see the Geospatial Metadata subtopic sheet.

Table 4. Examples of metadata

Category Metadata
Low-Volume Data
Field Samples & Field Observations
Field Samples Collected for Further Analysis (For Example Laboratory Samples) as Recorded on Sample Chain of Custody forms Date/time (including time zone, if appropriate)
Location
Collector
Collection equipment
Collection method
Preservation method
Unique sample identifier
Sub-sample identifier (where splits or aliquots are created in the field)
Composite type and scope (where composite samples are created in the field)
Sample material (matrix)
Sample type (for example, normal field sample or quality control sample)
Number of sample containers
Subjective Observations and Operation, Maintenance, and Monitoring Date/Time (including time zone, if appropriate)
Location
Collector
Collection equipment
Collection method
Point Sensors
Water, Air, and Other In Situ Sensors Date/time (including time zone, if appropriate)
Location
Collector
Sensor/instrument serial number
Sensor/instrument settings
Sensor/instrument serial number
High-Volume Data
Point Sensors
Water and Air Sensors Date/time (deployed and recovered) (including time zone, if appropriate)
Location (fixed or moving)
Collector
Sensor/instrument model
Sensor/instrument serial number
Sensor/instrument settings
Meteorological Sensors Date/time (deployed and recovered) (including time zone, if appropriate)
Location (fixed or moving)
Collector
Sensor/instrument model
Sensor/instrument serial number
Sensor/instrument settings
Remote Sensors
Air and Water Movement Sensors Date/Time (deployed and recovered) (including time zone, if appropriate)
Location (fixed or moving)
Collector
Sensor/instrument model
Sensor/instrument serial number
Sensor/instrument settings
Bathymetric Sensors Date/Time (deployed and recovered) (including time zone, if appropriate)
Collector
Instrument serial number
Water Level Sensors Date/time (including time zone, if appropriate)
Location
Collector
Sensor/instrument model
Sensor/instrument serial number
Sensor/instrument settings
Unstructured Data
Photographs Asset file name
File type
Date/time (including time zone, if appropriate)
Photographer
Location
Direction
Photographer/videographer
Video
Scanned Field Notes (PDF or JPG) Asset file name
File type

4 REFERENCES AND ACRONYMS

The references cited in this fact sheet, and the other ITRC EDM Best Practices fact sheets, are included in one combined list that is available on the ITRC web site. The combined acronyms list is also available on the ITRC web site.

image_pdfPrint this page/section


EDM

Home
glossaryGlossary
referencesReferences
acronymsAcronyms
ITRC
Contact Us
About ITRC
Visit ITRC
social media iconsClick here to visit ITRC on FacebookClick here to visit ITRC on TwitterClick here to visit ITRC on LinkedInITRC on Social Media
about_itrc
Permission is granted to refer to or quote from this publication with the customary acknowledgment of the source (see suggested citation and disclaimer). This web site is owned by ITRC • 1250 H Street, NW • Suite 850 • Washington, DC 20005 • (202) 266-4933 • Email: [email protected] • Terms of Service, Privacy Policy, and Usage Policy ITRC is sponsored by the Environmental Council of the States.