Commit Graph

  • d5af8e91a9 No more stderr logging, only raising exception. George Alegre 2016-12-13 19:00:46 -0300
  • 86e5a3f3d2 Update comment example Vahid Kharazi 2016-08-09 06:50:03 +0430
  • e4efc87a20 Update readability.py Yuri Baburov 2016-07-19 17:30:23 +0600
  • b20d5c15ef Improved Document class documentation Yuri Baburov 2016-07-19 17:25:52 +0600
  • 08f0fe6482 Merge cc5f0cbe9e into b6e5921f27 eromoe 2016-07-18 14:17:29 +0000
  • b6e5921f27 Merge pull request #85 from lwm/add-badge Yuri Baburov 2016-07-15 21:16:38 +0700
  • aafcf52e58 add travis badge Luke Murphy 2016-07-15 16:15:06 +0200
  • 5337adc590 Merge pull request #82 from lwm/fixup-readme Yuri Baburov 2016-07-15 21:11:51 +0700
  • 86cde20ba6 Merge pull request #84 from lwm/add-travis-file Yuri Baburov 2016-07-15 21:10:40 +0700
  • a1d6bbcd3f add travis file Luke Murphy 2016-07-14 22:00:40 +0200
  • 82837e4b5c makeover for the README [ci skip] Luke Murphy 2016-07-11 16:13:27 +0200
  • cc5f0cbe9e change to default td Mithril 2016-07-04 16:27:54 +0800
  • 58bf7188b6 calculate node score from deep to shallow Mithril 2016-07-04 15:51:29 +0800
  • cfc9d00783 Fixed an issue with passing a list ofkeywords in string format. Zamubafoo 2016-06-02 22:27:40 -0400
  • 75f2ea0d00 Version bump to 0.6.2 Yuri Baburov 2016-04-09 14:34:00 +0600
  • 47e473fb91 Merge pull request #73 from alphapapa/patch-1 Yuri Baburov 2016-04-09 14:29:07 +0600
  • 8443a87f5c Update readability.py alphapapa 2016-04-03 21:38:17 -0500
  • 5fc2d3684a Use Mozilla User-Agent alphapapa 2016-04-03 21:32:36 -0500
  • 65d1ebb06d Fixed #70 and added xpath option Yuri Baburov 2015-09-29 18:36:25 +0200
  • fae95bad45 Bump to v0.6.1 -- fixed logging 0.6.1 Yuri Baburov 2015-08-26 16:14:04 +0600
  • c0d794fdd8 Update readability.py Yuri Baburov 2015-08-26 15:11:12 +0500
  • 398f6ad748 Bump to 0.6.0.5 0.6.0.5 Yuri Baburov 2015-07-27 12:00:58 +0600
  • 8ff11e68a6 Debugging improvements. Bump to 0.6.0.5 Yuri Baburov 2015-07-27 11:59:17 +0600
  • fcdbe563a5 Fixed #49. Bump to 0.6.0.4 Yuri Baburov 2015-07-27 10:06:28 +0600
  • c51886d923 Bump to 0.6.0.3 Fixed installation. 0.6.0.3 Yuri Baburov 2015-07-27 09:38:44 +0600
  • 24bb20c761 Added dev branch features. Bumped to version 0.6 0.6 Yuri Baburov 2015-07-26 19:29:50 +0600
  • e8f86bdcf9 Several updates from dev version. dev Yuri Baburov 2015-07-26 15:17:43 +0600
  • 154658798b Merge pull request #64 from martinth/master Yuri Baburov 2015-07-26 14:11:37 +0500
  • 13c6b4cf0e Merge e634273705 into 83a7ce67c1 dlarochelle 2015-07-24 11:40:05 +0000
  • 83a7ce67c1 Merge pull request #68 from digitaldavenyc/python3 Yuri Baburov 2015-07-24 11:47:41 +0500
  • 1ac3e019bd fix for setup, convert print to python 3 compatible format Dave Padovano 2015-07-23 14:05:12 -0400
  • 1aabdb3d27 Merge pull request #67 from horva/fix-logging-config Yuri Baburov 2015-07-13 09:10:35 +0500
  • f0ff9b2425 Move logging.basicConfig to main function Marko Horvatic 2015-06-24 16:21:04 +0200
  • e2bc1ea055 Improved #65 which has given warning, added cssselect lib, bumped to 0.5.1 Yuri Baburov 2015-05-06 14:33:14 +0600
  • 77f8e3d5ed Merge 2686e3df7c into 1cb17d919b Asad Dhamani 2015-05-06 07:47:07 +0000
  • 1cb17d919b Merge pull request #65 from avalanchy/best_elem_is_root Yuri Baburov 2015-05-06 12:44:41 +0500
  • bf9e7404fa Failure if best_elem is root (fix #58) Mariusz Osiecki 2015-05-06 09:34:55 +0200
  • 386e48d29b Fixes checking of declared encodings in get_encoding. Martin Thurau 2015-04-30 11:47:32 +0200
  • 046d2c10c3 Fixes regex declaration in get_encoding. Martin Thurau 2015-04-29 23:36:50 +0200
  • ce7ca26835 Adds compatibility `raise_with_traceback` method to support different `raise` syntax Martin Thurau 2015-04-29 23:35:18 +0200
  • 3ac56329e2 Corrects some things were 2to3 did to much. Martin Thurau 2015-04-29 19:33:43 +0200
  • aa4132f57a Adds Python 3.4 support. Martin Thurau 2015-04-29 16:18:21 +0200
  • 13cca1dd19 Adds tox configuration. Martin Thurau 2015-04-29 14:55:03 +0200
  • 1d4ee9d421 Releasing as version 0.5 Yuri Baburov 2015-04-27 16:00:08 +0600
  • 987570bef0 Updated package links for Python 2.7 and Python 3 support Yuri Baburov 2015-04-27 15:59:31 +0600
  • dc648e7d0b Added a test for issue #48 but can't reproduce it -- seems to work fine. Yuri Baburov 2015-04-27 15:58:39 +0600
  • c715426584 Releasing as version 0.4 Yuri Baburov 2015-04-27 14:54:13 +0600
  • 1fac7e685a Added a feature to allow more images per article (with a test) Yuri Baburov 2015-04-27 14:35:00 +0600
  • c6796195a7 Fixed makefile testing. Yuri Baburov 2015-04-27 14:32:40 +0600
  • d04d41b749 Insert text inside iframe for correct output Miguel Galves 2014-02-07 18:27:15 -0200
  • be2a1c4646 Let width and height attributes Miguel Galves 2014-02-07 18:26:47 -0200
  • f1759c1404 Allows iframes containing youtube or vimeo videos. People like them Miguel Galves 2014-02-07 18:04:15 -0200
  • 40e430c27d Makefile updates Yuri Baburov 2015-04-27 13:10:36 +0600
  • 0a082ff020 Fix for Mac OS X 10.10 Yuri Baburov 2015-04-27 13:10:21 +0600
  • 8048160d66 WIP: update to support python2 and python3 Yuri Baburov 2015-04-19 15:05:50 +0600
  • 2686e3df7c Switch to UnicodeDammit for encoding detection Asad Dhamani 2015-04-04 07:31:58 +0530
  • 2abc32d3d2 Merge 5a4974496f into 332ad810de Anders Norås 2015-03-25 22:29:19 +0000
  • d203b65486 Merge 6a1c779edd into 332ad810de Alexander Sapronov 2015-03-17 07:02:25 +0000
  • 332ad810de Bumped to 0.3.0.6 0.3.0.6 Yuri Baburov 2015-03-16 21:38:17 +0500
  • e4bcbe57d7 Fixes #53 Yuri Baburov 2015-03-16 22:19:36 +0600
  • 6a1c779edd Fix. Add tox warmonger 2015-02-23 20:59:29 +0600
  • 68be580610 Add parenthesis for `print` to make it Python 3 compatible Sumin Byeon 2015-02-15 22:38:44 +0900
  • 297b8cc6a3 refactoring warmonger 2015-02-12 17:21:34 +0600
  • 2ceec5d140 refactoring warmonger 2015-02-12 17:19:48 +0600
  • 950366ac71 refactoring warmonger 2015-02-12 17:16:53 +0600
  • 6f1b969f21 refactoring warmonger 2015-02-12 17:10:58 +0600
  • 3002ff5dc8 refactoring warmonger 2015-02-12 17:08:28 +0600
  • 057868406a refactoring warmonger 2015-02-12 17:04:23 +0600
  • b1b1e99abc autopep8, + .travis warmonger 2015-02-12 17:03:14 +0600
  • e634273705 Parameterize numerous hard coded numbers. David Larochelle 2015-01-23 11:16:34 -0500
  • ea28266265 Parametarize CONTENT_SCORE_DIV_BONUS, CONTENT_SCORE_PRE_TD_BONUS, CONTENT_SCORE_ADDRESS_OL_PENALTY, CONTENT_SCORE_HEADER_PENALTY David Larochelle 2015-01-21 16:46:44 -0500
  • 1923f4d1a7 Parameterize SIBLING_CONTENT_LENGTH_SUM David Larochelle 2015-01-21 16:30:33 -0500
  • 0956179894 Parameterize MIN_EMBED_COMMENT_LENGTH David Larochelle 2015-01-21 15:07:10 -0500
  • 0242c2a5b9 Replace hard coded constant with class variables for LOW_WEIGHT_LINK_DENSITY_THRESHOLD and HIGH_WEIGHT_LINK_DENSITY_THRESHOLD David Larochelle 2015-01-15 15:38:29 -0500
  • 15703186d5 Replace hard coded constant with class variables for P_TO_INPUT_RATIO and HEADER_LINK_DENSITY_THRESHOLD David Larochelle 2015-01-15 15:31:01 -0500
  • e69c1d09a2 Replace hard coded constant with class variables for COMMA_COUNT David Larochelle 2015-01-15 15:00:14 -0500
  • f1386e2805 Replace hard coded constant with class variables for LONG_NODE_LINK_DENSITY_THRESHOLD and LONG_NODE_LENGTH David Larochelle 2015-01-15 14:46:27 -0500
  • 74ff927fbc Replace hard coded constants with class variables for MIN_SIBLING_SCORE_THRESHOLD and BEST_SCORE_MULTIPLIER_THRESHOLD David Larochelle 2015-01-15 14:37:18 -0500
  • 287f902d34 Merge a355c6ea72 into aeb4f4c782 Telofy 2015-01-14 06:10:46 +0000
  • e3e02393e7 Make sure the result is deterministic if 2 candidates have the same content_score. David Larochelle 2015-01-13 13:44:35 -0500
  • aeb4f4c782 Merge pull request #59 from seomoz/mac_10_10 Yuri Baburov 2015-01-13 17:41:30 +0500
  • c8c2f8809c Fix mac version comparison in setup.py for 10.10 Matthew Peters 2015-01-12 22:19:09 -0800
  • 2d4cfdb2c8 Merge pull request #56 from nathanathan/patch-1 Yuri Baburov 2014-12-20 02:11:53 +0500
  • 75e2e0cb3a Defaulting to utf-8 when chardet returns None Nathan Breit 2014-12-18 18:48:22 -0800
  • 71294f094f Encoding improvements Yuri Baburov 2014-12-05 23:10:10 +0100
  • a355c6ea72 Resolved problem with title.text being None Telofy 2014-11-06 14:51:04 +0100
  • 28923e8a73 Merge 44d61e557f into 0c2f29ed0d palkeo 2014-09-27 22:43:33 +0000
  • 0c2f29ed0d Version bump. Yuri Baburov 2014-09-22 15:32:46 +0700
  • 638f73f6a2 Fix for #52: <input type="hidden"> are not counted any more for "form removal" heuristic. Yuri Baburov 2014-09-22 15:31:31 +0700
  • 5855beb32a WIP; Backported features from stable branch Yuri Baburov 2014-09-22 15:29:49 +0700
  • ae1f1adfff Switched to use python logging module. Added xpath option (undocumented yet). Yuri Baburov 2014-08-25 09:36:40 +0700
  • 44d61e557f Python3 port. palkeo 2014-07-16 22:42:15 -0300
  • 2fab5ffa6b Merge pull request #48 from mperdomo1/master Yuri Baburov 2014-05-18 15:27:06 +0700
  • 3a43a3fe7e Added code to check declared encodings first and check them from kennethreitz/requests/utils.py. Also I added some superset encodings I have found in Chinese pages that are mishandled by chardet/character declarations. Mark Perdomo 2014-05-13 15:09:47 +0800
  • 1a4d3697bc Allow latest lxml on Mac OS X 10.9, see issue #39 for comments and setup instructions 0.3.0.3 Yuri Baburov 2014-04-02 15:16:19 +0700
  • f7704e3e0e Insert text inside iframe for correct output Miguel Galves 2014-02-07 18:27:15 -0200
  • 0c6ed25028 Let width and height attributes Miguel Galves 2014-02-07 18:26:47 -0200
  • 8d12e33281 Allows iframes containing youtube or vimeo videos. People like them Miguel Galves 2014-02-07 18:04:15 -0200
  • c410eb110d Ignores .Python dir created by virtualenv Miguel Galves 2014-02-07 18:00:21 -0200
  • 5a4974496f Increased the number of images allowed compared to the number of paragraphs. New algorithm multiplies the number of paragraphs by two, in essences allowing twice as many images as paragraphs. This should be a better fit for blog posts with many images, such as recipes. anoras 2014-01-09 13:58:25 +0100