Provide a fast and simpler way to get up-to-date CSV file export #1670
Open
Labels
Data export
We export data nightly as CSV, MongoDB… See: https://world.openfoodfacts.org/data
environment
P2
🚅 Performance
Summary:
CSV file is an easy way to consume OFF data. As of today, we have to download the whole database (2 GB!). It's quite long and consuming precious OFF server resources.
Expected behaviour:
Quick download based on diff or decentralized resources (peer-to-peer).
To be investigated
zsync investigation
zsync looks promising:
Implementation:
apt install zsync # debian, ubuntu
zsyncmake en.openfoodfacts.org.products.csv # create en.openfoodfacts.org.products.csv.zsync (~4 Mb)
(implement in./script/export_database.pl
?)-z
option can save more bandwith but building the compressed file take up to 2 minutes at 99% cpuzsync https://static.openfoodfacts.org/data/en.openfoodfacts.org.products.csv.zsync
Part of
The text was updated successfully, but these errors were encountered: