This is great, thanks!
As this version reads from a file, I guess one don't have to import html2text?
And to read the path to the file I want to test I added:
test_file = sys.argv[1]
and changed the 'with open' line to
with open(test_file, 'r') as content_file:
That way its quick and easy to just run python read.py /path/to/testfile.txt
to check a file.
Quick, easy and fun way to check readability before publishing. Who knows, maybe it will improve my writing?