It does not quite work yet, as we wrongly pull in page 2 at the end of the
article due to yet-to-be-implemented duplicate avoidance.
Conflicts:
src/readability_lxml/readability.py
src/tests/gen_test.py
src/tests/regression.py
Sometimes you just want to generate the data files without the YAML
specification. This change lets you do that. In doing so, I switched to use
the argparse module for argument parsing.
Conflicts:
src/tests/gen_test.py
These test cases provide a baseline from which we can start improving the
readability algorithm and making sure that we do not horribly break anything.
Conflicts:
src/tests/regression.py