Weekly statistics about Source Cooperative's data storage and usage, generated from S3 inventory reports.
This repository contains three types of CSV reports, updated weekly:
accounts/
)Storage metrics grouped by account (data contributor).
Filename format: accounts/YYYYMMDD.csv
Columns:
account
- Account identifierrepositories
- Number of repositories per accountobjects
- Total file countstorage_gb
- Storage used in gigabytesavg_object_size_mb
- Average file size in megabytesoldest_file
- Timestamp of oldest filenewest_file
- Timestamp of newest filerepositories/
)Detailed breakdown by individual repository.
Filename format: repositories/YYYYMMDD.csv
Columns:
account
- Account identifierrepository
- Repository nameobjects
- Total file countstorage_gb
- Storage used in gigabytesavg_object_size_mb
- Average file size in megabytesoldest_file
- Timestamp of oldest filenewest_file
- Timestamp of newest filesource/
)High-level metrics for the entire Source Cooperative platform.
Filename format: source/YYYYMMDD.csv
Columns:
metric
- Metric namevalue
- Metric valueMetrics included:
YYYYMMDD.csv
format based on the report generation date[account]/[repository]
folder structureThese statistics are automatically generated from S3 inventory reports using AWS Athena queries. The source code for the generation process is available at github.com/source-cooperative/source-stats.
For questions about this data or to report issues, please contact Source Cooperative support or create an issue in the source-stats repository.