Collection of public notebooks for all projects on BossDB
This repository generates Google Colab notebooks for every public BossDB project using nightly metadata exports from the BossDB metadata service. Each notebook includes dataset URIs and coordinate ranges for exploring BossDB data through intern.
- Downloads the nightly metadata file
mongo-data.jsonfrom S3 - Fills a notebook template for each project
- Writes the generated notebooks to
notebooks/ - Keeps them updated through nightly automation
- Provides a CLI (
create-notebooks) for local regeneration
The workflow at .github/workflows/generate-notebooks.yml runs nightly at 05:20 UTC, installing dependencies, running create-notebooks, and committing updated notebooks to notebooks/.
You can also trigger it manually via GitHub Actions → “Nightly Generate Colab Notebooks” → Run workflow.
AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, and GITHUB_TOKEN (provided automatically) are configured under Settings → Secrets and Variables → Actions
Notebook generation can also be run locally:
# Install dependencies and create .venv
uv sync
# Install CLI command so "create-notebooks" becomes available
uv tool install .
# Activate environment
source .venv/bin/activate
# Run the generator locally
create-notebooks # downloads metadata and writes in notebooks/
# Make sure `bossdb` AWS credentials are configured in your environment.