📦 (WIP) Program that sets up a bulk integration between a CKAN water data hub's datasets and vector geospatial features with Geoconnex. Ran as a Docker container and built with Rust.
Find a file
2026-05-05 19:35:13 -04:00
config feat: add/update schemas, config, README, and test 2026-05-05 16:08:44 -04:00
src refactor: limit, dataset name, also add validation function 2026-05-05 19:35:13 -04:00
tests feat: location schema, architecture, ckanaction update, new test 2026-05-05 17:32:27 -04:00
.gitignore build: add .gitignore 2026-05-04 16:19:16 -04:00
Cargo.lock feat: location schema, architecture, ckanaction update, new test 2026-05-05 17:32:27 -04:00
Cargo.toml feat: location schema, architecture, ckanaction update, new test 2026-05-05 17:32:27 -04:00
README.md feat: add/update schemas, config, README, and test 2026-05-05 16:08:44 -04:00

ckan_geoconnex_bulk_runner

Status: This codebase is currently a work in progress and more documentation is planned.

The ckan_geoconnex_bulk_runner codebase is meant to run as a container for a bulk integration of a CKAN instance's relevant datasets and vector geospatial features (e.g. for water data hubs) to the Geoconnex knowledge graph. The codebase ultimately runs as a program outputting to standard output JSON-LD on a new line for each approved dataset/location which the Geoconnex crawler then uses to update the Geoconnex knowledge graph.

Refer to the "Contributing via Bulk Containers" documentation here for more information: https://docs.geoconnex.us/contributing/bulk/

This runner is expected to be implemented for a water data hub with the relevant fields and/or DataPusher+ enabled. For questions reach out to datHere, Center for Geospatial Solutions, or add an issue/discussion.

Installation and setup

cargo run --release

Run tests

cargo test

To include print statements in test output, run:

cargo test -- --nocapture