Commit Graph

172 Commits (fix-sfgate)
 

Author SHA1 Message Date
Nicolas Perriault 2c5ba594dd Refs #209 - Increase score for elements containing large amount of text. 9 years ago
Gijs 8510106638 Merge pull request #211 from mozilla/add-support-for-wbr-tag
Added support for the wbr html tag to JSDOMParser.
9 years ago
Nicolas Perriault 8806e999d1 Added support for the wbr html tag to JSDOMParser. 9 years ago
Gijs a801846a45 Merge pull request #204 from mozilla/tweak-great-grandparent-scoring
Updated great grandparent node scoring.
9 years ago
Gijs 5bf56177be Merge pull request #207 from mozilla/better-dm
Improved embedded video elements detection.
9 years ago
Nicolas Perriault ae0833522c Improved embedded video elements detection. 9 years ago
Nicolas Perriault 46304bb5fe Updated great grandparent node scoring. 9 years ago
Nicolas Perriault 66071e573d Merge pull request #194 from mozilla/score-intermediary-headers
Fixes #180 - Score intermediary headings.
9 years ago
Nicolas Perriault 88ef3893b5 Fixes #180 - Score intermediary headings. 9 years ago
Nicolas Perriault 6344b3f736 Merge pull request #196 from mozilla/strip-related-contents
Refs #195 - Exclude nodes likely to be related content.
9 years ago
Nicolas Perriault dc1b2c9fa0 Refs #195 - Exclude nodes likely to be related content. 9 years ago
Margaret Leibovic affa0edbdd Merge pull request #197 from mozilla/support-dailymotion-videos
Ref #195 - Add support for dailymotion videos.
9 years ago
Nicolas Perriault cc18cb5787 Ref #195 - Add support for dailymotion videos. 9 years ago
Nicolas Perriault 4721837e27 Merge pull request #193 from mozilla/score-great-grandparent-nodes
Fixes #113 - Score great grandparent nodes.
9 years ago
Nicolas Perriault 9dbc009376 Fixes #113 - Recursive node ancestor scoring. 9 years ago
Gijs f71ec9ceae Merge pull request #191 from mozilla/preserve-list-items
Fixes #183 - Preserve list items.
9 years ago
Nicolas Perriault 44879722b6 Fixes #183 - Preserve list items. 9 years ago
Alexis Métaireau 5912e0c872 Add Firefox User-Agent when generating the test case. 9 years ago
Gijs 79aa2fca87 Merge pull request #189 from mozilla/dont-remove-headings
Fixes #150 - Keep article intermediary headings.
9 years ago
Margaret Leibovic af6da2a87d Merge pull request #190 from mozilla/improved-author-meta-extraction
Improved author metadata detection.
9 years ago
Nicolas Perriault 0d696051e9 Merge pull request #188 from gijsk/improve-isprobably-readerable
Make isProbablyReaderable include <pre>, and deal with long <br>-separat...
9 years ago
Nicolas Perriault 7aee44adb2 Improved author metadata detection. 9 years ago
Gijs Kruitbosch 5f184053cd Make isProbablyReaderable include <pre>, and deal with long <br>-separated paragraphs and/or shorter-than-5-paragraph text and such. 9 years ago
Gijs Kruitbosch d9a475e8d4 Fix benchmark script, add isProbablyReaderable benchmark 9 years ago
Nicolas Perriault 2451a07a7d Fixes #150 - Keep article intermediary headings. 9 years ago
Gijs 62f5d43c70 Merge pull request #187 from leibovic/classnames
Fixes #184 - Don't strip class names from article content
9 years ago
Margaret Leibovic 319a50b4f0 Fixes #184 - Don't strip class names from article content 9 years ago
Gijs 49e40768aa Merge pull request #185 from mozilla/score-section-tags-by-default
Fixes #139 #143: Added more weight to section tags.
9 years ago
Nicolas Perriault f6ffa6acde Fixes #139 #143: Added more weight to section tags. 9 years ago
Gijs 32d8a526f9 Merge pull request #175 from mozilla/improve-title-extraction
Fixes #174 - Remove aggressive article title formatting rule.
9 years ago
Nicolas Perriault 58cd789cd3 Improved title extraction 'algorithm'. 9 years ago
Gijs 647658a47b Merge pull request #172 from mozilla/js-beautify
Fixes #130 - Using js-beautify for HTML formatting.
9 years ago
Nicolas Perriault de89036cd5 Fixes #130 - Using js-beautify for HTML formatting. 9 years ago
Gijs b37ff08bc7 Merge pull request #169 from mozilla/clean-footer-tags
Fixes #163 - Avoid including footer tag contents.
9 years ago
Nicolas Perriault 12c6a11f67 Fixes #163 - Avoid including footer tag contents. 9 years ago
Gijs 87c0bc0144 Merge pull request #167 from mozilla/better-headline-extraction
Fixes #164 - Add support for title alt semantic metadata.
9 years ago
Nicolas Perriault 6eeabf90c1 Fixes #164 - Add support for title alt semantic metadata. 9 years ago
Margaret Leibovic eb7ec7231e Merge pull request #135 from gijsk/links
Bug 1147584 - Don't strip unlikely <a>s, and replace useless <a>s with textContent
9 years ago
Gijs Kruitbosch 0ff82de0f4 Implement createTextNode, do more relaxed escaping there, update testcase. 9 years ago
Margaret Leibovic 37a8cd4171 Bug 1147584 - Don't remove unlikely <a> tags, and replace <a> tags with their text content if they won't be useful links 9 years ago
Gijs a6014f5854 Merge pull request #132 from gijsk/heise-ad-prioritization
Don't look at banners and skyscrapers, remove <noscript> elements
9 years ago
Gijs Kruitbosch a6346a0ad4 Don't look at banners and skyscrapers, remove <noscript> elements 9 years ago
Nicolas Perriault 1fbed57832 Merge pull request #129 from mozilla/readability-constructor-options
Refs #128 - Add support for options to Readability constructor.
9 years ago
Nicolas Perriault 4424b0bad7 Refs #128 - Add support for options to Readability constructor. r=@gijsk 9 years ago
Nicolas Perriault d9fc21155e Merge pull request #123 from gijsk/fix-test-result-stream2
Fixed test result output being sent at once
9 years ago
Gijs Kruitbosch c53ca31907 Fixed test result output being sent at once 9 years ago
Gijs 0251bc0817 Merge pull request #120 from mozilla/drop-social-buttons
Refs #117 - Drop social/share buttons.
9 years ago
Nicolas Perriault 4d41f5e4ed Refs #117 - Drop social/share buttons. 9 years ago
Nicolas Perriault 56b45e3fa9 Merge pull request #100 from mozilla/fix-jsdomparser-localName-case
Fixes #99 - Fixed JSDOMParser tag name case handling.
9 years ago
Nicolas Perriault d725ebc953 Fixes #99: JSDOMParser tag name case handling. r=@gijsk 9 years ago