79GB for all of the English articles minus the media. That's smaller than I would have guessed. You can fit this large slice of our culture on a $20.99 flash drive and with 49GB left over. That seems like a good econo-cultural indicator, storage cost per wikipedia. I wish I could short that index.
When thinking about this sort of thing I always find it fun to think about information density perception. I could hand you a USB drive and it could either contain a significant chunk of the sum of human knowledge, taking you lifetimes to even skim through, or it could contain a 2.5 hour movie you'd think nothing of.
Multiple layers of things at work there of course but that's what makes it fun to think about.
>79GB for all of the English articles minus the media.
I think thats an error on the github, wikipedia_en_all_novid is all text + pictures, just no videos. Text alone is ~15GB zipped. My 2014 Media dump was ~76GB, so that 80GB for full text+media checks out.