Everything researchers need to know about our platform, data, and collaboration opportunities
MyBeachBook is a citizen science platform for water quality monitoring, combining field assessments with satellite analysis.
MyBeachBook is a cross-platform mobile application (Android, iOS, Web) built with Flutter. It enables citizen scientists to collect GPS-tagged, timestamped, photo-documented water quality assessments. On the backend, we run satellite analysis pipelines for algal bloom prediction, temporal trend analysis, and environmental monitoring.
The platform is backed by Firebase (Firestore, Storage, Auth) with Google Maps integration, Google ML Kit for on-device image labeling, and iNaturalist API for species identification.
Each field assessment captures 40+ data categories:
All submissions are geohash-validated (precision 9 = ~100m) to confirm the contributor is physically at the location, and admin-moderated before inclusion.
Our data addresses a critical gap in remote sensing research: concurrent ground truth paired with satellite observations. Most remote sensing studies lack field-validated data collected at the same locations and time periods as satellite passes.
Our backend processes multi-source satellite imagery using STAC APIs, independent of Google Earth Engine.
Data is downloaded at 120m resolution (from native 10-20m) for computational efficiency, with cloud masking via SCL using progressive thresholds (20% → 40% → 60% → 80% → 95%).
We migrated from GEE to STAC APIs for platform independence and reproducibility. Our pipeline uses xarray + stackstac for lazy loading and chunked processing, with multi-level caching (in-memory → disk via NetCDF → STAC download). This eliminates dependency on Google's infrastructure and gives us full control over the processing chain.
Primary STAC endpoint: Element84, with automatic fallback to Planetary Computer and Copernicus.
Water-type-specific machine learning models trained on satellite-derived features and citizen science ground truth.
We train separate models for each water body type (lake, river, tidal) because the physics driving algal growth differ between environments. The pipeline:
The v2 model uses 33 input features organized into categories:
Engineered substrate features:
Each water body type gets 3 additional custom features that capture its unique physical dynamics:
For example, a lake's bloom risk depends on thermal stratification and nutrient retention in fine sediments, while a river's depends on upstream population pressure and flow-mediated nutrient loading. Generic models miss these physics.
The v2 training script compares four algorithms per water type and selects the best by 5-fold cross-validated R²:
XGBoost wins in most cases. Two targets are trained per water type: chlorophyll-a concentration and cyanobacteria index.
V2 model results (XGBoost + substrate features, 33 input features):
Satellite measurement accuracy:
Improving retrieval algorithm accuracy through better atmospheric correction and validation methodology is a key area where research collaboration would have the most impact.
Multi-year temporal analysis of algal bloom patterns from Sentinel-2 imagery.
Comprehensive environmental reports combining satellite analysis with citizen science data.
Each report is a 24+ page document (Word + PDF) covering 10 major sections:
Composite scoring:
Key technical decisions and system design.
Before running analysis on any location, we verify the sampling point is actually in permanent water using a 5-year NDWI timeseries. If the pin falls on land, an 8-directional search automatically adjusts toward the nearest deeper water pixel.
Climate zones for regional benchmarking are latitude-based: Arctic (>60°), Temperate Cold (45–60°), Temperate Warm (30–45°), Subtropical (15–30°), Tropical (<15°).
We actively seek academic partnerships to improve our satellite retrieval algorithms and validate models against ground truth.
We're seeking academic collaborators in:
We bring: a deployed platform, 355+ ground truth sites, satellite processing infrastructure, municipal relationships, and potential co-funding through Alberta Innovates and other programs.
Yes. The platform provides a natural fit for graduate research including:
We're happy to support HQP (Highly Qualified Personnel) components for NSERC or other grant applications.
We're always looking for research partners to push the boundaries of satellite-derived water quality monitoring
Get in Touch