Commit Graph

4 Commits (master)

Author SHA1 Message Date
anekos 6842ea906e Fix causing lxml error 4 years ago
Yuri Baburov dc648e7d0b Added a test for issue #48 but can't reproduce it -- seems to work fine. 9 years ago
Yuri Baburov 1fac7e685a Added a feature to allow more images per article (with a test) 9 years ago
Richard Harding edccec5d3b Work on why we have an empty <body/> tag
- Seems to come because the sanitizer ends up with two nodes, not one. The
first is an empty body, the second is the article div.
- Fix up the tabs so we can work with the file. Needs lots of pep8 love.
- Implement an initial hack that at least gets it working atm.
- Start to add test cases, sample html files we can test against, etc.
12 years ago