You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
python-readability/readability
Richard Harding edccec5d3b Work on why we have an empty <body/> tag
- Seems to come because the sanitizer ends up with two nodes, not one. The
first is an empty body, the second is the article div.
- Fix up the tabs so we can work with the file. Needs lots of pep8 love.
- Implement an initial hack that at least gets it working atm.
- Start to add test cases, sample html files we can test against, etc.
12 years ago
..
__init__.py Expose Document in readability package 13 years ago
cleaners.py Updated scoring algorithm to match readability.js v1.7.1 13 years ago
debug.py Fixed indentation, encoding issue and README bug. Thanks to Greg Jastrab. Bump version to 0.2.3 13 years ago
encoding.py Fixed indentation, encoding issue and README bug. Thanks to Greg Jastrab. Bump version to 0.2.3 13 years ago
htmls.py Sorted out unicode issues, thanks to Lee Semel. 13 years ago
readability.py Work on why we have an empty <body/> tag 12 years ago