This website works better with JavaScript.
d5af8e91a9
No more stderr logging, only raising exception.
George Alegre
2016-12-13 19:00:46 -0300
86e5a3f3d2
Update comment example
Vahid Kharazi
2016-08-09 06:50:03 +0430
e4efc87a20
Update readability.py
Yuri Baburov
2016-07-19 17:30:23 +0600
b20d5c15ef
Improved Document class documentation
Yuri Baburov
2016-07-19 17:25:52 +0600
08f0fe6482
Merge cc5f0cbe9e
into b6e5921f27
eromoe
2016-07-18 14:17:29 +0000
b6e5921f27
Merge pull request #85 from lwm/add-badge
Yuri Baburov
2016-07-15 21:16:38 +0700
aafcf52e58
add travis badge
Luke Murphy
2016-07-15 16:15:06 +0200
5337adc590
Merge pull request #82 from lwm/fixup-readme
Yuri Baburov
2016-07-15 21:11:51 +0700
86cde20ba6
Merge pull request #84 from lwm/add-travis-file
Yuri Baburov
2016-07-15 21:10:40 +0700
a1d6bbcd3f
add travis file
Luke Murphy
2016-07-14 22:00:40 +0200
82837e4b5c
makeover for the README [ci skip]
Luke Murphy
2016-07-11 16:13:27 +0200
cc5f0cbe9e
change to default td
Mithril
2016-07-04 16:27:54 +0800
58bf7188b6
calculate node score from deep to shallow
Mithril
2016-07-04 15:51:29 +0800
cfc9d00783
Fixed an issue with passing a list ofkeywords in string format.
Zamubafoo
2016-06-02 22:27:40 -0400
75f2ea0d00
Version bump to 0.6.2
Yuri Baburov
2016-04-09 14:34:00 +0600
47e473fb91
Merge pull request #73 from alphapapa/patch-1
Yuri Baburov
2016-04-09 14:29:07 +0600
8443a87f5c
Update readability.py
alphapapa
2016-04-03 21:38:17 -0500
5fc2d3684a
Use Mozilla User-Agent
alphapapa
2016-04-03 21:32:36 -0500
65d1ebb06d
Fixed #70 and added xpath option
Yuri Baburov
2015-09-29 18:36:25 +0200
fae95bad45
Bump to v0.6.1 -- fixed logging
0.6.1
Yuri Baburov
2015-08-26 16:14:04 +0600
c0d794fdd8
Update readability.py
Yuri Baburov
2015-08-26 15:11:12 +0500
398f6ad748
Bump to 0.6.0.5
0.6.0.5
Yuri Baburov
2015-07-27 12:00:58 +0600
8ff11e68a6
Debugging improvements. Bump to 0.6.0.5
Yuri Baburov
2015-07-27 11:59:17 +0600
fcdbe563a5
Fixed #49 . Bump to 0.6.0.4
Yuri Baburov
2015-07-27 10:06:28 +0600
c51886d923
Bump to 0.6.0.3 Fixed installation.
0.6.0.3
Yuri Baburov
2015-07-27 09:38:44 +0600
24bb20c761
Added dev branch features. Bumped to version 0.6
0.6
Yuri Baburov
2015-07-26 19:29:50 +0600
e8f86bdcf9
Several updates from dev version.
dev
Yuri Baburov
2015-07-26 15:17:43 +0600
154658798b
Merge pull request #64 from martinth/master
Yuri Baburov
2015-07-26 14:11:37 +0500
13c6b4cf0e
Merge e634273705
into 83a7ce67c1
dlarochelle
2015-07-24 11:40:05 +0000
83a7ce67c1
Merge pull request #68 from digitaldavenyc/python3
Yuri Baburov
2015-07-24 11:47:41 +0500
1ac3e019bd
fix for setup, convert print to python 3 compatible format
Dave Padovano
2015-07-23 14:05:12 -0400
1aabdb3d27
Merge pull request #67 from horva/fix-logging-config
Yuri Baburov
2015-07-13 09:10:35 +0500
f0ff9b2425
Move logging.basicConfig to main function
Marko Horvatic
2015-06-24 16:21:04 +0200
e2bc1ea055
Improved #65 which has given warning, added cssselect lib, bumped to 0.5.1
Yuri Baburov
2015-05-06 14:33:14 +0600
77f8e3d5ed
Merge 2686e3df7c
into 1cb17d919b
Asad Dhamani
2015-05-06 07:47:07 +0000
1cb17d919b
Merge pull request #65 from avalanchy/best_elem_is_root
Yuri Baburov
2015-05-06 12:44:41 +0500
bf9e7404fa
Failure if best_elem is root (fix #58 )
Mariusz Osiecki
2015-05-06 09:34:55 +0200
386e48d29b
Fixes checking of declared encodings in get_encoding.
Martin Thurau
2015-04-30 11:47:32 +0200
046d2c10c3
Fixes regex declaration in get_encoding.
Martin Thurau
2015-04-29 23:36:50 +0200
ce7ca26835
Adds compatibility `raise_with_traceback` method to support different `raise` syntax
Martin Thurau
2015-04-29 23:35:18 +0200
3ac56329e2
Corrects some things were 2to3 did to much.
Martin Thurau
2015-04-29 19:33:43 +0200
aa4132f57a
Adds Python 3.4 support.
Martin Thurau
2015-04-29 16:18:21 +0200
13cca1dd19
Adds tox configuration.
Martin Thurau
2015-04-29 14:55:03 +0200
1d4ee9d421
Releasing as version 0.5
Yuri Baburov
2015-04-27 16:00:08 +0600
987570bef0
Updated package links for Python 2.7 and Python 3 support
Yuri Baburov
2015-04-27 15:59:31 +0600
dc648e7d0b
Added a test for issue #48 but can't reproduce it -- seems to work fine.
Yuri Baburov
2015-04-27 15:58:39 +0600
c715426584
Releasing as version 0.4
Yuri Baburov
2015-04-27 14:54:13 +0600
1fac7e685a
Added a feature to allow more images per article (with a test)
Yuri Baburov
2015-04-27 14:35:00 +0600
c6796195a7
Fixed makefile testing.
Yuri Baburov
2015-04-27 14:32:40 +0600
d04d41b749
Insert text inside iframe for correct output
Miguel Galves
2014-02-07 18:27:15 -0200
be2a1c4646
Let width and height attributes
Miguel Galves
2014-02-07 18:26:47 -0200
f1759c1404
Allows iframes containing youtube or vimeo videos. People like them
Miguel Galves
2014-02-07 18:04:15 -0200
40e430c27d
Makefile updates
Yuri Baburov
2015-04-27 13:10:36 +0600
0a082ff020
Fix for Mac OS X 10.10
Yuri Baburov
2015-04-27 13:10:21 +0600
8048160d66
WIP: update to support python2 and python3
Yuri Baburov
2015-04-19 15:05:50 +0600
2686e3df7c
Switch to UnicodeDammit for encoding detection
Asad Dhamani
2015-04-04 07:31:58 +0530
2abc32d3d2
Merge 5a4974496f
into 332ad810de
Anders Norås
2015-03-25 22:29:19 +0000
d203b65486
Merge 6a1c779edd
into 332ad810de
Alexander Sapronov
2015-03-17 07:02:25 +0000
332ad810de
Bumped to 0.3.0.6
0.3.0.6
Yuri Baburov
2015-03-16 21:38:17 +0500
e4bcbe57d7
Fixes #53
Yuri Baburov
2015-03-16 22:19:36 +0600
6a1c779edd
Fix. Add tox
warmonger
2015-02-23 20:59:29 +0600
68be580610
Add parenthesis for `print` to make it Python 3 compatible
Sumin Byeon
2015-02-15 22:38:44 +0900
297b8cc6a3
refactoring
warmonger
2015-02-12 17:21:34 +0600
2ceec5d140
refactoring
warmonger
2015-02-12 17:19:48 +0600
950366ac71
refactoring
warmonger
2015-02-12 17:16:53 +0600
6f1b969f21
refactoring
warmonger
2015-02-12 17:10:58 +0600
3002ff5dc8
refactoring
warmonger
2015-02-12 17:08:28 +0600
057868406a
refactoring
warmonger
2015-02-12 17:04:23 +0600
b1b1e99abc
autopep8, + .travis
warmonger
2015-02-12 17:03:14 +0600
e634273705
Parameterize numerous hard coded numbers.
David Larochelle
2015-01-23 11:16:34 -0500
ea28266265
Parametarize CONTENT_SCORE_DIV_BONUS, CONTENT_SCORE_PRE_TD_BONUS, CONTENT_SCORE_ADDRESS_OL_PENALTY, CONTENT_SCORE_HEADER_PENALTY
David Larochelle
2015-01-21 16:46:44 -0500
1923f4d1a7
Parameterize SIBLING_CONTENT_LENGTH_SUM
David Larochelle
2015-01-21 16:30:33 -0500
0956179894
Parameterize MIN_EMBED_COMMENT_LENGTH
David Larochelle
2015-01-21 15:07:10 -0500
0242c2a5b9
Replace hard coded constant with class variables for LOW_WEIGHT_LINK_DENSITY_THRESHOLD and HIGH_WEIGHT_LINK_DENSITY_THRESHOLD
David Larochelle
2015-01-15 15:38:29 -0500
15703186d5
Replace hard coded constant with class variables for P_TO_INPUT_RATIO and HEADER_LINK_DENSITY_THRESHOLD
David Larochelle
2015-01-15 15:31:01 -0500
e69c1d09a2
Replace hard coded constant with class variables for COMMA_COUNT
David Larochelle
2015-01-15 15:00:14 -0500
f1386e2805
Replace hard coded constant with class variables for LONG_NODE_LINK_DENSITY_THRESHOLD and LONG_NODE_LENGTH
David Larochelle
2015-01-15 14:46:27 -0500
74ff927fbc
Replace hard coded constants with class variables for MIN_SIBLING_SCORE_THRESHOLD and BEST_SCORE_MULTIPLIER_THRESHOLD
David Larochelle
2015-01-15 14:37:18 -0500
287f902d34
Merge a355c6ea72
into aeb4f4c782
Telofy
2015-01-14 06:10:46 +0000
e3e02393e7
Make sure the result is deterministic if 2 candidates have the same content_score.
David Larochelle
2015-01-13 13:44:35 -0500
aeb4f4c782
Merge pull request #59 from seomoz/mac_10_10
Yuri Baburov
2015-01-13 17:41:30 +0500
c8c2f8809c
Fix mac version comparison in setup.py for 10.10
Matthew Peters
2015-01-12 22:19:09 -0800
2d4cfdb2c8
Merge pull request #56 from nathanathan/patch-1
Yuri Baburov
2014-12-20 02:11:53 +0500
75e2e0cb3a
Defaulting to utf-8 when chardet returns None
Nathan Breit
2014-12-18 18:48:22 -0800
71294f094f
Encoding improvements
Yuri Baburov
2014-12-05 23:10:10 +0100
a355c6ea72
Resolved problem with title.text being None
Telofy
2014-11-06 14:51:04 +0100
28923e8a73
Merge 44d61e557f
into 0c2f29ed0d
palkeo
2014-09-27 22:43:33 +0000
0c2f29ed0d
Version bump.
Yuri Baburov
2014-09-22 15:32:46 +0700
638f73f6a2
Fix for #52 : <input type="hidden"> are not counted any more for "form removal" heuristic.
Yuri Baburov
2014-09-22 15:31:31 +0700
5855beb32a
WIP; Backported features from stable branch
Yuri Baburov
2014-09-22 15:29:49 +0700
ae1f1adfff
Switched to use python logging module. Added xpath option (undocumented yet).
Yuri Baburov
2014-08-25 09:36:40 +0700
44d61e557f
Python3 port.
palkeo
2014-07-16 22:42:15 -0300
2fab5ffa6b
Merge pull request #48 from mperdomo1/master
Yuri Baburov
2014-05-18 15:27:06 +0700
3a43a3fe7e
Added code to check declared encodings first and check them from kennethreitz/requests/utils.py. Also I added some superset encodings I have found in Chinese pages that are mishandled by chardet/character declarations.
Mark Perdomo
2014-05-13 15:09:47 +0800
1a4d3697bc
Allow latest lxml on Mac OS X 10.9, see issue #39 for comments and setup instructions
0.3.0.3
Yuri Baburov
2014-04-02 15:16:19 +0700
f7704e3e0e
Insert text inside iframe for correct output
Miguel Galves
2014-02-07 18:27:15 -0200
0c6ed25028
Let width and height attributes
Miguel Galves
2014-02-07 18:26:47 -0200
8d12e33281
Allows iframes containing youtube or vimeo videos. People like them
Miguel Galves
2014-02-07 18:04:15 -0200
c410eb110d
Ignores .Python dir created by virtualenv
Miguel Galves
2014-02-07 18:00:21 -0200
5a4974496f
Increased the number of images allowed compared to the number of paragraphs. New algorithm multiplies the number of paragraphs by two, in essences allowing twice as many images as paragraphs. This should be a better fit for blog posts with many images, such as recipes.
anoras
2014-01-09 13:58:25 +0100