Sort:  

Text is very compressible... if very repetitive, then it becomes almost a zero problem.

Currently blocks get compressed by itself...

I am not saying this will be a feature, but on storage technology, there are technologies advancements called similarity and deduplication. Once these get implemented on top of decentralized technologies like blockchains, these problems disappear.

Will take some time for this to be "standard" but eventually it will come because the hardware will enable it enough.