A command-line interface (CLI) for working with fiboa.
In order to make working with fiboa easier we have developed command-line interface (CLI) tools such as inspection, validation and file format conversions.
You will need to have Python 3.9 or any later version installed.
Run pip install fiboa-cli in the CLI to install the validator.
Optional: To install additional dependencies for specific converters,
you can for example run: pip install fiboa-cli[xyz] with xyz being the converter name.
Note on versions:
- fiboa CLI >= 0.3.0 and < 0.11.0 works with fiboa version = 0.2.0
- fiboa CLI < 0.3.0 works with fiboa version = 0.1.0
After the installation you should be able to run the following command: fiboa
You should see usage instructions and available commands for the CLI.
fiboa CLI supports various commands to work with the files:
- fiboa CLI
- Getting Started
- Commands
- Validation
- Create fiboa GeoParquet from GeoJSON
- Create fiboa GeoJSON from GeoParquet
- Inspect fiboa GeoParquet file
- Merge fiboa GeoParquet files
- Create JSON Schema from fiboa Schema
- Validate a fiboa Schema
- Improve a fiboa Parquet file
- Update an extension template with new names
- Converter for existing datasets
- Development
To validate a fiboa GeoParquet or GeoJSON file, you can for example run:
- GeoJSON:
fiboa validate example.json --collection collection.json - GeoParquet:
fiboa validate example.parquet --data
Check fiboa validate --help for more details.
The validator also supports remote files.
http://orhttps://: no further configuration is needed.s3://:s3fsneeds to be installed (runpip install .[s3]) and you may need to set environment variables. Refer here for how to define credentials.gs://:gcsfsneeds to be installed (runpip install .[gcs]). By default,gcsfswill attempt to use your default gcloud credentials or, attempt to get credentials from the google metadata service, or fall back to anonymous access.
To create a fiboa-compliant GeoParquet for a fiboa-compliant set of GeoJSON files containing Features or FeatureCollections, you can for example run:
fiboa create-geoparquet geojson/example.json -o example.parquet -c geojson/collection.json
Check fiboa create-geoparquet --help for more details.
To create one or multiple fiboa-compliant GeoJSON file(s) for a fiboa-compliant GeoParquet file, you can for example run:
- GeoJSON FeatureCollection:
fiboa create-geojson example.parquet -o dest-folder - GeoJSON Features (with indentation and max. 100 features):
fiboa create-geojson example.parquet -o dest-folder -n 100 -i 2 -f
Check fiboa create-geoparquet --help for more details.
To look into a fiboa GeoParquet file to get a rough understanding of the content, the following can be executed:
fiboa describe example.parquet
Check fiboa describe --help for more details.
Merges multiple fiboa datasets to a combined fiboa dataset:
fiboa merge ec_ee.parquet ec_lv.parquet -o merged.parquet -e https://fiboa.github.io/hcat-extension/v0.1.0/schema.yaml -i ec:hcat_name -i ec:hcat_code -i ec:translated_name
Check fiboa merge --help for more details.
To create a JSON Schema for a fiboa Schema YAML file, you can for example run:
fiboa jsonschema example.json --id=https://fiboa.github.io/specification/v0.1.0/geojson/schema.json -o schema.json
Check fiboa jsonschema --help for more details.
To validate a fiboa Schema YAML file, you can for example run:
fiboa validate-schema schema/schema.yaml
Check fiboa validate-schema --help for more details.
Various "improvements" can be applied to a fiboa GeoParquet file. The commands allows to
- change the CRS (
--crs) - change the GeoParquet version (
-gp1) and compression (-pc) - add/fill missing perimeter/area values (
-sz) - fix invalid geometries (
-g) - rename columns (
-r)
Example:
fiboa improve file.parquet -o file2.parquet -g -sz -r old=new -pc zstd
Check fiboa improve --help for more details.
Once you've created and git cloned a new extension, you can use the CLI to update all template placeholders with proper names.
For example, if your extension is meant to have
- the title "Timestamps Extension",
- the prefix
ts(e.g. fieldts:createdorts:updated), - is hosted at
https://github.io/fiboa/timestamps-extension(organization:fiboa, repositorytimestamps-extension), - and you run fiboa in the folder of the extension.
Then the following command could be used:
fiboa rename-extension . -t Timestamps -p ts -s timestamps-extension -o fiboa
Check fiboa rename-extension --help for more details.
The CLI ships various converters for existing datasets.
To get a list of available converters/datasets with title, license, etc. run:
fiboa converters
Use any of the IDs from the list to convert an existing dataset to fiboa:
fiboa convert de_nrw
See Implement a converter for details about how to
To install in development mode run pip install -e . in this folder.
For the tests first run pip install -r requirements-dev.txt to install pytest.
Then you can run pytest to execute the tests.
All code is formatted with a specific ruff style, so all code looks the same. Code
will be formatted automatically. Your pull-requests will be tested for compliance.
Run pre-commit run --all-files to format your local code manually (or configure it in your IDE).
Install the pre-commit hook with pre-commit install, so you never commit incorrectly formatted code.
The following high-level description gives an idea how to implement a converter in fiboa CLI:
- Create a new file in
fiboa_cli/datasetsbased on thetemplate.py - Implement the
convert()function / test it / run it - Add missing dependencies into a separate dependency group in
setup.py - Add the converter to the list above
- Create a PR to submit your converter for review
An in-depth guide how to create a cloud-native fiboa dataset using fiboa CLI is available at: https://github.com/fiboa/data/blob/main/HOWTO.md