Commit Graph

  • 6a6ad7d2c3 [MRG] fix list index out of range and float division by zero bug chengguangbing 2020-12-16 18:11:06 +0800
  • ba5be43005
    Merge pull request #3 from anakin87/anakin87-patch-2 anakin87 2020-12-08 18:58:21 +0100
  • 5c3a686ebe
    Introduce Faq anakin87 2020-12-08 18:57:41 +0100
  • 644e17edec
    Merge pull request #2 from camelot-dev/master anakin87 2020-12-08 18:37:55 +0100
  • 2760ff845d
    Merge 6612a2f58d into 7709e58d64 Vinayak Mehta 2020-12-06 07:04:57 +0530
  • 6612a2f58d
    Add support for table areas and regions add-ocr Vinayak Mehta 2020-12-06 07:04:47 +0530
  • 1287abeeaf
    Remove test Vinayak Mehta 2020-12-06 05:50:26 +0530
  • eecc0df1ac
    Add support for spanning cells Vinayak Mehta 2020-12-06 05:48:23 +0530
  • 0d9edc0ff3
    Merge 477632bee8 into 7709e58d64 Aman Verma 2020-12-02 00:37:31 +0800
  • 0183f8f462
    Update validation list Vinayak Mehta 2020-11-18 18:51:09 +0530
  • 674b5f4336
    Add LatticeOCR Vinayak Mehta 2020-11-09 04:35:28 +0530
  • 48a3151723
    Merge 140aa0dae9 into 7709e58d64 rawsh-bt 2020-11-01 11:18:57 -0600
  • bd0894fdf5
    Merge 03a04d2f45 into 7709e58d64 jess 2020-10-28 13:46:24 +0000
  • 7709e58d64
    Merge pull request #206 from edugonza/fix-15 Vinayak Mehta 2020-10-28 14:44:35 +0530
  • 7695d35449 Fix #15 extraction of cell data discarding overlapping text boxes Eduardo Gonzalez Lopez de Murillas 2020-10-27 17:51:24 +0100
  • 692b8fcf57
    Merge 0962b8f4d4 into 8ca30f3a3c Jose Vargas 2020-10-25 23:33:15 +0900
  • 8ca30f3a3c
    Merge pull request #202 from tchx84/close-streams-explicitly Vinayak Mehta 2020-10-25 05:34:38 +0530
  • 9161ef3822 Update docs for process_color_background Evgeni Petrov 2020-10-23 09:32:27 +0300
  • 2ebd6073c0 Add saturation thresholding option Evgeni Petrov 2020-10-23 09:27:32 +0300
  • 13a50e2ba2 handlers: Close file streams explicitly Martin Abente Lahaye 2020-10-22 11:00:00 -0300
  • d17dc43ab2
    Merge pull request #196 from jimhall/gs-install-deps Vinayak Mehta 2020-10-18 03:14:24 +0530
  • de6faa7af1
    Add new checks Vinayak Mehta 2020-10-18 03:13:21 +0530
  • 468512a8cd Language added to confirm proper installation of ghostscript libraries Jim Hall 2020-10-08 08:21:34 -0400
  • 4edca28c53
    Merge branch 'master' of github.com:camelot-dev/camelot Vinayak Mehta 2020-09-08 00:35:57 +0530
  • 2a7a4f5b34
    Update README and index.rst Vinayak Mehta 2020-09-08 00:35:32 +0530
  • 0a3944e54d Add bug report template Vinayak Mehta 2020-09-07 23:39:49 +0530
  • 6b42094db5
    Update year Vinayak Mehta 2020-08-28 17:52:21 +0530
  • 937185412a
    Merge pull request #189 from camelot-dev/fix-179 Vinayak Mehta 2020-08-25 23:03:20 +0530
  • 5d20d56e48
    Prevent taking max of an empty set Vinayak Mehta 2020-08-25 22:50:31 +0530
  • 9087429501
    Merge pull request #188 from anakin87/master Vinayak Mehta 2020-08-25 19:16:50 +0530
  • cc905ff2d9
    Merge pull request #186 from pevisscher/patch-1 Vinayak Mehta 2020-08-25 19:14:58 +0530
  • eadc54ad25
    Merge pull request #1 from anakin87/anakin87-patch-1 anakin87 2020-08-25 15:28:48 +0200
  • 579bc16be5
    Update core.py anakin87 2020-08-25 15:27:17 +0200
  • fe13764026
    Prevent taking the max of an empty set Paul Visscher 2020-08-25 13:40:51 +0200
  • aae2c6b3d4
    use correct re.sub signature pevisscher 2020-08-24 16:51:06 +0200
  • dcae630351
    Merge ba1604ee40 into 705473198f Idan David 2020-08-16 10:17:34 +0530
  • 705473198f
    Merge pull request #121 from jedie/patch-2 Vinayak Mehta 2020-08-14 02:36:28 +0530
  • b741c0a9e9
    Check for none and return none Vinayak Mehta 2020-08-14 02:35:50 +0530
  • a6bee88053
    Merge pull request #119 from jedie/patch-1 Vinayak Mehta 2020-08-14 02:27:49 +0530
  • 1e050e1960
    Remove plt.show() usage Vinayak Mehta 2020-08-14 02:27:07 +0530
  • 28371817db
    Fix doc link Vinayak Mehta 2020-08-14 02:09:56 +0530
  • 7ab5db39d0
    Update .readthedocs.yml and remove requirements.txt Vinayak Mehta 2020-08-04 04:37:37 +0530
  • 9a5c4b6865
    Merge pull request #175 from camelot-dev/revert-0-8-1 v0.8.2 Vinayak Mehta 2020-07-27 17:56:48 +0530
  • fbe576ffcb
    Revert the changes in v0.8.1 Vinayak Mehta 2020-07-27 17:38:14 +0530
  • 140aa0dae9 Merge remote-tracking branch 'upstream/master' Robert Washboourne 2020-07-23 17:20:08 -0500
  • b8acb61be6 [ TextEdges ] allow single non-empty char textline Pushkar Nimkar 2020-04-11 15:18:21 +0530
  • fcad5067b9
    Fix failing test Vinayak Mehta 2020-07-23 00:54:41 +0530
  • 1b8ce1d560
    Bump requirement versions Vinayak Mehta 2020-07-23 00:39:53 +0530
  • 16beb15c43
    Bump version and update HISTORY.md v0.8.1 Vinayak Mehta 2020-07-21 21:48:29 +0530
  • be25e6dbdb
    Merge pull request #171 from camelot-dev/fix-169 Vinayak Mehta 2020-07-21 21:30:14 +0530
  • a13e2f6f1f
    Change error name and update pdfminer.six version Vinayak Mehta 2020-07-21 21:21:01 +0530
  • d392000a5f
    Merge 42f8321c8c into 4b08165328 FrancoisHuet 2020-07-21 13:15:53 +0200
  • 86b675e920
    Merge d34f8645f7 into 4b08165328 Fakabbir Amin 2020-07-20 23:41:21 +0000
  • 4b08165328
    Merge pull request #166 from stevestock/patch-1 Vinayak Mehta 2020-07-20 16:00:43 +0530
  • e5b143d9a8
    Update install instructions Vinayak Mehta 2020-07-20 15:59:42 +0530
  • 8e5a8e6712
    Update install.rst Steven Stockhamer 2020-07-19 20:44:16 -0400
  • 42f8321c8c Clean up notebooks, address review comments Frh 2020-07-03 18:28:24 -0700
  • 032dabde68 replace ghostscript with pdf2image Robert Washboourne 2020-06-26 15:04:52 -0500
  • 71805f9333 Fix issues following pass across most test cases Frh 2020-06-16 13:04:53 -0700
  • 9c971a18f0 Linting Frh 2020-06-14 12:36:24 -0700
  • 92322e1545 Address post-merge linting issues. Frh 2020-06-14 12:21:01 -0700
  • b43aca8ff5 Merge branch 'master' into hybrid-parser Frh 2020-06-14 08:53:43 -0700
  • 4fb1e93efd Bump dev libraries requirements to avoid conflicts Frh 2020-06-12 18:23:06 -0700
  • 4145361907 Merge branch 'hybrid-parser' of https://github.com/FrancoisHuet/camelot into hybrid-parser Frh 2020-06-12 17:32:39 -0700
  • 1813b80b8a Merge fix Frh 2020-06-12 17:12:24 -0700
  • 529ea36904 Updated comparison notebook Frh 2020-06-11 17:11:56 -0700
  • 9abdd00cec Enable process_background option for hybrid Frh 2020-05-08 15:08:12 -0700
  • 63adfd5468 Hybrid parser fixes Frh 2020-05-04 18:52:11 -0700
  • 7fae107560 Add baseline test for hybrid Frh 2020-05-04 17:41:57 -0700
  • 4a761611bf WIP: Introduce actual hybrid parser Frh 2020-05-04 16:27:01 -0700
  • edad1efd1b Rename WIP parser "network", actual Hybrid to come Frh 2020-05-02 16:14:03 -0700
  • 2867aecb5e Raise tolerance of plot differences Frh 2020-04-30 17:06:45 -0700
  • 9e385bf8fc Fix plotting unit tests Frh 2020-04-30 16:54:37 -0700
  • 4b3eee4b05 Linting Frh 2020-04-29 13:52:58 -0700
  • 55fd459634 Minor linting Frh 2020-04-29 12:31:02 -0700
  • ada4809a59 Improve column detection for hybrid flavor Frh 2020-04-29 11:46:40 -0700
  • e31e978ebe Fix off by one error in column identification Frh 2020-04-29 09:45:55 -0700
  • 21dc6a46a0 Improve hybrid table body discovery algo Frh 2020-04-28 22:43:55 -0700
  • a04e7702b2 Create notebook to help debug hybrid parser algo Plot vertical col anchors found by hybrid parser Include vertical text in col/row generation Frh 2020-04-28 12:26:12 -0700
  • 8f5e2bba4d Prep for vertical text improvements Frh 2020-04-28 11:46:12 -0700
  • e1572a10c9 Linting Frh 2020-04-25 22:47:23 -0700
  • f7aafcd05c Add parser comparizon notebook Frh 2020-04-25 21:55:21 -0700
  • 90f8d11d47 Add Parser comparison notebook to help visualizing Frh 2020-04-25 21:55:01 -0700
  • 15d99b1d00 Remove another f-string Frh 2020-04-25 21:33:15 -0700
  • 9eb4f65fc9 Remove f-strings, fix url based unit tests Frh 2020-04-25 21:14:56 -0700
  • 81de841ca0 Plot improvements, address 132 Frh 2020-04-25 20:51:00 -0700
  • dbaab66e43 Rename member for clarity, fixed unit test Frh 2020-04-25 17:15:16 -0700
  • a0e46916e2 Improve edgeplot for hybrid Frh 2020-04-25 13:31:10 -0700
  • c9a73a1ad7 Further refactoring Frh 2020-04-24 21:11:31 -0700
  • 18581640be Common parent TextBaseParser for Stream and Hybrid Frh 2020-04-24 15:54:58 -0700
  • a401d33fd9 Refactor out _text_bbox Frh 2020-04-24 15:18:38 -0700
  • 87d95a098c Further simplification Frh 2020-04-24 12:48:51 -0700
  • 22b6e33efa Enforce text_edge as subcase of text_alignment Frh 2020-04-24 12:42:13 -0700
  • 2d97fbc036 Define TextEdge as a bounded TextAlignment Frh 2020-04-23 18:26:55 -0700
  • 0b8aac977a Update test to reflect different order of edges Frh 2020-04-23 14:45:35 -0700
  • 8903ef77d4 More refactoring across stream and hybrid. Frh 2020-04-23 14:42:13 -0700
  • 92c8abdca3 Refactoring TextEdges code across hybrid and stream Frh 2020-04-23 12:55:09 -0700
  • 7ad5b843ab Move generic code to utils Frh 2020-04-22 19:08:06 -0700
  • 14cd328644 Refactor common code hybrid / stream Frh 2020-04-22 17:33:15 -0700
  • bfc2719aff Address last unit test Frh 2020-04-22 16:02:49 -0700