Can anyone with GridFS experience tell me if data duplication is to be expected?
I’m having massive duplication issues. 250MB becomes 1,500MB in the DB
I’ve started with 250MB of CSV files, that I need to put in storage for Meteor to retrieve at a future date. After uploading my total database size is 1,500MB (before, my it was just 20MB).
The relevant collections:
cfs._tempstore.chunks - 40 KB
cfs.files.filerecord - 168 KB
cfs_gridfs._tempstore.chunks - 473 MB
cfs_gridfs._tempstore.files - 40 KB
cfs_gridfs.filesStore.chunks - 537 MB
cfs_gridfs.filesStore.files - 40 KB
1 - Where is the remaining +/- 500MB going?
2 - Why is tempstore not clearing the duplicate chunks?
3 - Why is cfs_gridfs.filesStore.chunks, double the size of the original files?
4 - Is this GridFS behavior or just a CollectionFS pattern?