Commit Graph

  • d8595b7103 Quickfix for #41 0.3.0.2 Yuri Baburov 2013-10-10 13:47:58 +0700
  • 318f25c577 Minor fix in encoding guessing. Claiming it v0.3.0.1 Yuri Baburov 2013-10-10 02:57:53 +0700
  • 08658d1d31 Released v 0.3, and uploaded to the pypi. Yuri Baburov 2013-10-10 02:39:37 +0700
  • ba4cc34af6 Merge 6c0e08d686 into 4e3192f5ab Francis Tseng 2013-09-18 10:58:28 -0700
  • 6c0e08d686 gitignore cleanup Francis Tseng 2013-09-18 13:52:43 -0400
  • d58d563299 encoding regex fix Francis Tseng 2013-09-18 13:50:51 -0400
  • 0104d4ba87 python 3 port, retaining 2.7 compatibility Francis Tseng 2013-09-18 13:47:39 -0400
  • 1540b9690b Merge 25fe58ab0d into 4e3192f5ab Francis Tseng 2013-08-11 14:18:27 -0700
  • 25fe58ab0d py3 relative import fix Francis Tseng 2013-08-11 17:18:21 -0400
  • 0e33b26432 python 3 update Francis Tseng 2013-08-05 17:15:13 -0400
  • e93f694574 Merge 9c9b09e274 into 4e3192f5ab Miguel Galves 2013-05-24 07:55:02 -0700
  • 9c9b09e274 Retorna Galves, Miguel 2013-05-24 11:54:52 -0300
  • e5ca0a3545 REmocao Galves, Miguel 2013-05-24 11:53:12 -0300
  • e2fb87bfc7 Atualizações Galves, Miguel 2013-05-24 11:50:20 -0300
  • f127e87e11 Fixes get_title when title.text is Non (<title></title>) Galves, Miguel 2013-05-24 11:30:28 -0300
  • 4e3192f5ab Merge pull request #29 from hush-hush/master Yuri Baburov 2012-10-17 00:28:02 -0700
  • 36b0262e2b Merge e2e78e4d55 into c923995606 hush-hush 2012-10-17 00:27:47 -0700
  • e2e78e4d55 Make lxml clean tree available for user modifications. hush-hush 2012-09-13 17:09:14 +0200
  • c923995606 Merge pull request #27 from sunlightlabs/master Yuri Baburov 2012-08-29 20:11:04 -0700
  • 88f9855ec9 Merge fdba8d9e11 into 9cd5fb6226 dvogel 2012-08-28 11:07:16 -0700
  • fdba8d9e11 Added check on title.text to avoid a TypeError on None. Drew Vogel 2012-08-28 13:57:52 -0400
  • 77e9707f7f pulled from buriy Nathan Nifong 2012-07-18 10:48:37 -0700
  • e880edee71 re-introduced part of the code I removed that defined the title variable, oops Nathan Nifong 2012-07-17 16:37:25 -0700
  • fe3bd56b16 Removed offending code that broke short_title() Nathan Nifong 2012-07-17 16:29:41 -0700
  • 9cd5fb6226 Bump to 0.2.6.1 Yuri Baburov 2012-07-17 19:35:47 +0700
  • 44915518d3 Merge pull request #24 from zacharydenton/master Yuri Baburov 2012-07-17 05:32:23 -0700
  • 2cbb83a5b7 Merge 0843d9cdf2 into 8aefc6175f GitHub Merge Button 2012-07-09 12:12:16 -0700
  • 0843d9cdf2 Explicitly check if title is None. fixes #22 Zach Denton 2012-07-09 05:07:53 -0400
  • 8aefc6175f Updated README with 0.2.6 changes. Yuri Baburov 2012-06-21 22:00:47 +0700
  • 20d5f3a73a Bump to 0.2.6 Yuri Baburov 2012-06-21 21:42:57 +0700
  • 2e49e34e11 Merge pull request #20 from andreypopp/master Yuri Baburov 2012-06-07 03:10:16 -0700
  • 52665e7a8c Merge 95852d5c18 into 274b60cdb1 GitHub Merge Button 2012-06-07 03:09:06 -0700
  • 95852d5c18 readability.htmls: some docs do not have title elem Andrey Popp 2012-06-07 14:08:09 +0400
  • 274b60cdb1 Merge pull request #19 from EvaSDK/master Yuri Baburov 2012-06-04 01:09:29 -0700
  • 510dd7784d Merge ea6afd3d49 into a19e766900 GitHub Merge Button 2012-06-02 14:17:35 -0700
  • ea6afd3d49 Make sure code is actually distributed Gilles Dartiguelongue 2012-06-02 23:11:27 +0200
  • d708744822 Clean up tests/changes to merge into 0.3.0.dev 0.3.0.dev Richard Harding 2012-04-22 00:04:14 -0400
  • eefb8e1125 Implement duplicate page detection Jerry Charumilind 2011-07-22 11:34:24 -0700
  • c931a80ba8 Tweak tests post merging Richard Harding 2012-04-21 19:46:02 -0400
  • 883a02ad5d Add a regression for a multi-page nytimes article Jerry Charumilind 2011-07-21 14:41:01 -0700
  • cfc6f94634 Fix test for the multipage test with actual content Richard Harding 2012-04-21 19:33:48 -0400
  • 816c66482e Improve unit test for basic multi-page handling Jerry Charumilind 2011-07-21 10:38:10 -0700
  • 99d5fc0a87 Update for merge with Jerry Checkpoint multi-page readability work Richard Harding 2012-04-21 18:12:19 -0400
  • f02fe79840 Checkpoint multi-page readability work Jerry Charumilind 2011-07-21 09:56:04 -0700
  • 5cb4b8b8c0 Tweaks after the code reorg Richard Harding 2012-04-21 17:38:41 -0400
  • f8315d011c Checkpoint multi-page readability work Jerry Charumilind 2011-07-21 09:56:04 -0700
  • 99efa5c10b PEP8 again ... Richard Harding 2012-04-21 14:26:07 -0400
  • a012fd2362 urlfetch is in src Richard Harding 2012-04-21 14:22:27 -0400
  • 3fe416a5d1 Refactor code for easier testing Jerry Charumilind 2011-07-19 17:46:51 -0700
  • 8cadc4a958 Fix links in the regression test set Richard Harding 2012-04-21 13:34:27 -0400
  • 9765d13e90 Garden Richard Harding 2012-04-21 13:28:39 -0400
  • 32d1764e83 Add scoring of next page link ancestry and href Jerry Charumilind 2011-07-15 17:08:56 -0700
  • 0951647c8e Complete move from test_data/output to regression_test* Richard Harding 2012-04-21 13:13:26 -0400
  • ace51a6819 Combine our tests with the new regresssion_test stuff Richard Harding 2012-04-21 13:11:21 -0400
  • 2505c78e5b Jerry Merge: First working find_next_page_link case Jerry Charumilind 2011-07-15 16:31:18 -0700
  • edc0e4d4c6 Move tests to testfile Richard Harding 2012-04-21 12:51:29 -0400
  • 6abc6f7ef2 Add cleaning of short segments Jerry Charumilind 2011-07-14 11:15:36 -0700
  • 1e30e33302 Move the tests to the testfile Jerry Charumilind 2011-07-14 10:32:44 -0700
  • e8a6250605 Clean up merge, put tests in right place, adjust imports Richard Harding 2012-04-21 12:27:08 -0400
  • 62df35570d Checkpoint of multi-page article work Jerry Charumilind 2011-07-13 17:31:51 -0700
  • 29fceeb4b1 Fix regression to run with metadata Richard Harding 2012-04-21 12:14:26 -0400
  • 6f8184be27 Doh, move the tests to the right dir Richard Harding 2012-04-21 12:11:54 -0400
  • 9aef5e36b7 Move the test data into the tests/test_data dir Richard Harding 2012-04-21 12:10:47 -0400
  • 8988b6b767 Add comment for read_orig Jerry Charumilind 2011-07-13 10:40:35 -0700
  • 7d097d5f11 Add subcommand parsing to gen_test Jerry Charumilind 2011-07-12 17:37:25 -0700
  • b04f75239c Add option to not generate yaml file Jerry Charumilind 2011-07-12 16:32:11 -0700
  • c21f00b1ee Reorganize constants Jerry Charumilind 2011-07-12 14:48:39 -0700
  • 9fec245ae4 garden Richard Harding 2012-04-21 11:37:19 -0400
  • 6af808bc14 Add docstring briefly describing gen_test program Jerry Charumilind 2011-07-12 14:36:09 -0700
  • 7980ca84c9 Add regression tests for readability results Jerry Charumilind 2011-07-12 14:32:20 -0700
  • a700bb8bd4 Update makefile regression test helper to open html results Richard Harding 2012-04-21 11:22:19 -0400
  • bf203b5a4b Add summary page for test results Jerry Charumilind 2011-07-11 17:47:57 -0700
  • 65989b538a Remove obsolete code Jerry Charumilind 2011-07-11 17:46:19 -0700
  • 9b7e5bb327 Jerry Merge: Remove obsolete code Jerry Charumilind 2011-07-11 14:21:47 -0700
  • 068eba19ae Jerry Merge: Add reading of test information from YAML file Jerry Charumilind 2011-07-11 14:10:12 -0700
  • 6d3ad559f6 Move test_data, add regression_test make command Richard Harding 2012-04-21 11:06:57 -0400
  • 5222ed0628 Jerry Merge: Initial regression test data Jerry Charumilind 2011-07-08 16:18:38 -0700
  • 6454fb3f37 Clean up merge bits a little bit Richard Harding 2012-04-21 11:01:50 -0400
  • 9366436861 Merge Jerry: pull in initial set of regression tests Richard Harding 2012-04-21 10:55:16 -0400
  • 7dc373e9c5 Add the title and the short title to the metadata set. Richard Harding 2012-04-21 10:08:00 -0400
  • b1966df1c3 Fix docs for changed method Richard Harding 2012-04-21 08:17:04 -0400
  • 57694cb352 Remove the get_ in method name, doesn't fit rest of api Richard Harding 2012-04-21 08:16:18 -0400
  • b78d7e8501 Merge Jerry: pull in the ability to get back confidence score as well as the processed html Jerry Charumilind 2011-07-05 13:35:36 -0700
  • a2b17e757c Update readme for the build location Richard Harding 2012-04-20 06:38:21 -0400
  • 3347f16d93 Fix the flipped nature of the <html> wrapping setting Richard Harding 2012-04-20 06:33:42 -0400
  • 93ac1111a1 Add try it out to the readable server Richard Harding 2012-04-19 22:07:17 -0400
  • 08660f6f0c PEP8 linting, so so close Richard Harding 2012-04-19 16:16:02 -0400
  • 35792e7a59 garden Richard Harding 2012-04-19 16:03:07 -0400
  • aa51283dff work on doing some more pep8 work on things Richard Harding 2012-04-19 15:16:49 -0400
  • a4b6957be2 Update html to be a property with a getter Richard Harding 2012-04-18 22:50:07 -0400
  • b0063ffb3c garden docs Richard Harding 2012-04-18 21:43:14 -0400
  • 8091a75f00 fix my rst Richard Harding 2012-04-18 21:38:38 -0400
  • 8f420bd950 Fix setup.py to pull the rst readme Richard Harding 2012-04-18 21:35:58 -0400
  • 58c69651d3 Update README to be a rst file and clean up a little bit. Richard Harding 2012-04-18 21:31:42 -0400
  • 8b0210c4dc Add a license file Richard Harding 2012-04-18 21:31:11 -0400
  • 0f9da8ace4 Start some lower level unit tests Richard Harding 2012-04-18 21:01:51 -0400
  • dc86283d83 Add a sample articler tester and a nyt sample to process Richard Harding 2012-04-18 20:12:54 -0400
  • 2ee2fe9536 Throw some checking aroud the build_doc Richard Harding 2012-04-18 15:02:01 -0400
  • ac5ef73e71 Update cli client commands, add debugging to test server Richard Harding 2012-04-18 14:45:46 -0400
  • f5451356ee Make sure we update both version strings until we can figure out how to pull it into the setup.py by magic Richard Harding 2012-04-17 22:05:27 -0400