Sites like Project Gutenberg offer thousands of public domain books. Using their "mirror" sites allows researchers to download entire libraries in bulk.
In an era of complex file types, the humble .txt file remains king for three reasons:
They take up minimal disk space, making 10,000 files surprisingly lightweight.