Example wastewater dataset. — ww

A dataset containing the simulated wastewater concentrations (labeled here as log_genome_copies_per_ml) by sample collection date (date), the site where the sample was collected (site) and the lab where the samples were processed (lab). Additional columns that are required attributes needed for the model are the limit of detection for that lab on each day (labeled here as log_lod) and the population size of the wastewater catchment area represented by the wastewater concentrations in each site.

Usage

ww_data

Format

ww_data

A tibble with 102 rows and 6 columns

date: Sample collection date, formatted in ISO8601 standards as YYYY-MM-DD
site: The wastewater treatment plant where the sample was collected
lab: The lab where the sample was processed
log_genome_copies_per_ml: The natural log of the wastewater concentration measured on the date specified, collected in the site specified, and processed in the lab specified. The package expects this quantity in units of log estimated genome copies per mL.
log_lod: The log of the limit of detection in the site and lab on a particular day of the quantification device (e.g. PCR). This should be in units of log estimated genome copies per mL.
site_pop: The population size of the wastewater catchment area represented by the site variable
location: A string indicating the location that all of the data is coming from. This is not a necessary column, but instead is included to more realistically mirror a typical workflow

Source

vignette_data.R

Details

This data is generated via the default values in the generate_simulated_data() function. They represent the bare minumum required fields needed to pass to the model, and we recommend that users try to format their own data to match this format.

The variables are as follows: