A dataset containing the simulated wastewater concentrations
(labeled here as log_genome_copies_per_ml
) by sample collection date
(date
), the site where the sample was collected (site
) and the lab
where the samples were processed (lab
). Additional columns that are
required attributes needed for the model are the limit of detection for
that lab on each day (labeled here as log_lod
) and the population size of
the wastewater catchment area represented by the wastewater concentrations
in each site
.
Format
ww_data
A tibble with 102 rows and 6 columns
- date
Sample collection date, formatted in ISO8601 standards as YYYY-MM-DD
- site
The wastewater treatment plant where the sample was collected
- lab
The lab where the sample was processed
- log_genome_copies_per_ml
The natural log of the wastewater concentration measured on the date specified, collected in the site specified, and processed in the lab specified. The package expects this quantity in units of log estimated genome copies per mL.
- log_lod
The log of the limit of detection in the site and lab on a particular day of the quantification device (e.g. PCR). This should be in units of log estimated genome copies per mL.
- site_pop
The population size of the wastewater catchment area represented by the site variable
- location
A string indicating the location that all of the data is coming from. This is not a necessary column, but instead is included to more realistically mirror a typical workflow
Details
This data is generated via the default values in the
generate_simulated_data()
function. They represent the bare minumum
required fields needed to pass to the model, and we recommend that users
try to format their own data to match this format.
The variables are as follows: