mirror of
https://github.com/dathere/ckan_geoconnex_bulk_runner.git
synced 2026-06-20 19:32:56 +00:00
📦 (WIP) Program that sets up a bulk integration between a CKAN water data hub's datasets and vector geospatial features with Geoconnex. Ran as a Docker container and built with Rust.
- Rust 98%
- Dockerfile 2%
| .github/workflows | ||
| bulk_loader | ||
| ckan_geoconnex_bulk_runner_py | ||
| generate_release | ||
| geoconnex_utils | ||
| .gitignore | ||
| Cargo.lock | ||
| Cargo.toml | ||
| README.md | ||
ckan_geoconnex_bulk_runner
https://github.com/user-attachments/assets/779fe866-d511-44f3-91a9-a1c2e1cfa189
Status: This codebase is currently a work in progress and more documentation is planned.
The ckan_geoconnex_bulk_runner codebase is part of a multi-service infrastructure to sync water data hubs using CKAN to the Geoconnex knowledge graph.
- geoconnex_utils: Helper functions used throughout the ckan_geoconnex_bulk_runner project including JSON-LD construction and JSON schema validation.
- geoconnex_release: Compatible CKAN datasets and vector geospatial features for all connected water data hubs are uploaded to a
ckan-geoconnex-web-resources.jsonlfile in the latest GitHub release. - bulk_loader: Requests and outputs the latest JSONL file from the latest GitHub release. This is ran as a Docker container by Geoconnex on a periodic frequency to upload all water data hub web resources to the Geoconnex knowledge graph following the Geoconnex bulk contribution specification.
- ckan_geoconnex_bulk_runner_py: Python library intended for usage by the ckanext-gztr and DataPusher+ CKAN extensions.
This runner is expected to be implemented for a water data hub with the relevant fields and/or ckanext-gztr (not open-source yet) and/or DataPusher+ enabled. For questions reach out to datHere, Center for Geospatial Solutions, or add an issue/discussion.