Project

General

Profile

Full Featured Beachwatch Data

A new Beachwatch dataset removes null columns and has better analysis features
Added by Eric Busboom 6 months ago

I've just released a new version of the Beachwatch data which include "features", new columns specifically designed for analysis:

https://data.sandiegodata.org/dataset/sandiegodata-org-beachwatch

These features include a "measure code" that groups together each unique combination of analyte/methodname/unit, so you can easily select a specific group for analysis. As shown in the example notebook, this will usually be 24, the Enterolert measurements of Enterococcus.

The dataset also has mean, median and quantile groups for each of the combinations of station and method code, so rather than analyzing the raw result measurements, you can work with "high" and "low" values, scaled to a particular measurement and station. Some of examples of this sort of analysis, using Logistic Regression, is in this example notebook:

https://github.com/san-diego-water-quality/water-datasets/blob/master/derived/sandiegodata.org-beachwatch/notebooks/Examples.ipynb


Comments