Skip to contents

A dataset containing the simulated wastewater concentrations (labeled here as log_genome_copies_per_ml) by sample collection date (date), the site where the sample was collected (site) and the lab where the samples were processed (lab). Additional columns that are required attributes needed for the model are the limit of detection for that lab on each day (labeled here as log_lod) and the population size of the wastewater catchment area represented by the wastewater concentrations in each site.

Usage

ww_data

Format

ww_data

A tibble with 102 rows and 6 columns

date

Sample collection date, formatted in ISO8601 standards as YYYY-MM-DD

site

The wastewater treatment plant where the sample was collected

lab

The lab where the sample was processed

log_genome_copies_per_ml

The natural log of the wastewater concentration measured on the date specified, collected in the site specified, and processed in the lab specified. The package expects this quantity in units of log estimated genome copies per mL.

log_lod

The log of the limit of detection in the site and lab on a particular day of the quantification device (e.g. PCR). This should be in units of log estimated genome copies per mL.

site_pop

The population size of the wastewater catchment area represented by the site variable

location

A string indicating the location that all of the data is coming from. This is not a necessary column, but instead is included to more realistically mirror a typical workflow

Source

vignette_data.R

Details

This data is generated via the default values in the generate_simulated_data() function. They represent the bare minumum required fields needed to pass to the model, and we recommend that users try to format their own data to match this format.

The variables are as follows: