In development, experimental
At this time, canonical sources are still hosted on data.ecoforecast.org.
Arrow provides an easy way to access remote parquet files from most languages widely used in data science. Here we access all forecasts submitted to a particular theme. (Users looking to load only a single model should specify that on the path for faster access. The STAC catalog can be used to explore available models).
The examples below show 'cloud-native' connections to the data -- 'lazy' connections that do not download the entire asset, but allow us to filter, subset, and operate directly on the remote data product.
At this time, duckdb access substantially faster than arrow.
R users can get a dplyr-compatible lazy remote tibble as follows:
ibis provides a more Pythonic interface to SQL: