Weekly statistics about Source Cooperative's data storage and usage, generated from S3 inventory reports.
This repository contains three types of CSV reports, updated weekly:
accounts/)Storage metrics grouped by account (data contributor).
Filename format: accounts/YYYYMMDD.csv
Columns:
account - Account identifierrepositories - Number of repositories per accountobjects - Total file countstorage_gb - Storage used in gigabytesavg_object_size_mb - Average file size in megabytesoldest_file - Timestamp of oldest filenewest_file - Timestamp of newest filerepositories/)Detailed breakdown by individual repository.
Filename format: repositories/YYYYMMDD.csv
Columns:
account - Account identifierrepository - Repository nameobjects - Total file countstorage_gb - Storage used in gigabytesavg_object_size_mb - Average file size in megabytesoldest_file - Timestamp of oldest filenewest_file - Timestamp of newest filesource/)High-level metrics for the entire Source Cooperative platform.
Filename format: source/YYYYMMDD.csv
Columns:
metric - Metric namevalue - Metric valueMetrics included:
YYYYMMDD.csv format based on the report generation date[account]/[repository] folder structureThese statistics are automatically generated from S3 inventory reports using AWS Athena queries. The source code for the generation process is available at github.com/source-cooperative/source-stats.
For questions about this data or to report issues, please contact Source Cooperative support or create an issue in the source-stats repository.