Commit Graph

225 Commits (4dd1e7fb15f768adf3c872197885cd93d915483d)

Author SHA1 Message Date
Vinayak Mehta 4dd1e7fb15
Call pdftopng in subprocess 2021-07-04 18:52:38 +05:30
Vinayak Mehta 36dcfe99d8
Split tests for lattice and stream, and fix test_common reprs 2021-06-28 05:32:48 +05:30
Vinayak Mehta 3ddc02b2f2
Fix fallback key 2021-06-28 03:24:15 +05:30
Vinayak Mehta 0c5aff60b4
Remove newline
Delete cache
2021-06-28 03:19:35 +05:30
Vinayak Mehta 3e4e848a09
Add fallbacks to image conversion 2021-06-28 03:16:54 +05:30
Vinayak Mehta a96702987f
Raise error if ghostscript not installed 2021-06-28 02:20:44 +05:30
Vinayak Mehta 4cebd684ba
Remove ext.ghostscript 2021-06-28 02:06:16 +05:30
Vinayak Mehta 8563a09544
Add image conversion backends 2021-06-28 01:58:45 +05:30
Vinayak Mehta fdade4502e
Fix pdftopng usage
Delete cache
2021-06-28 01:10:16 +05:30
Vinayak Mehta 57a26fabcb
Add pdftopng 2021-06-28 00:47:05 +05:30
Vinayak Mehta acb8f005c2
Merge branch 'master' into format-markdown 2021-06-28 00:32:00 +05:30
Vinayak Mehta c647f573d8
Bump version 2021-06-15 03:58:30 +05:30
Vinayak Mehta ec21904595
Merge pull request #219 from Arnie97/master
[MRG] Add line_overlap and boxes_flow to LAParams
2021-06-15 03:53:40 +05:30
Tiago Samaha Cordeiro 3a8f988740
use resolution argument to generate image with GS 2021-06-15 03:46:07 +05:30
Vinayak Mehta 2c59e7b0f7
Blacken code 2021-06-15 03:29:35 +05:30
Lucas Cimon 955e4b62d0
New export format: markdown 2021-01-13 06:31:30 +01:00
Arnie97 0dee385578 Add line_overlap and boxes_flow to LAParams 2020-12-17 22:12:24 +08:00
Eduardo Gonzalez Lopez de Murillas 7695d35449 Fix #15 extraction of cell data discarding overlapping text boxes 2020-10-27 18:06:57 +01:00
Martin Abente Lahaye 13a50e2ba2 handlers: Close file streams explicitly
No harm in closing these streams explicitly. Best case
scenario, this prevents descriptors leaks, worse case
scenario, it reduces the amount of  messages like the
following during tests:

ResourceWarning: unclosed file
2020-10-22 11:43:01 -03:00
Vinayak Mehta 5d20d56e48
Prevent taking max of an empty set 2020-08-25 22:50:31 +05:30
Vinayak Mehta 9087429501
Merge pull request #188 from anakin87/master
[MRG] Add encoding kwarg to camelot.core.Table.to_html method
2020-08-25 19:16:50 +05:30
anakin87 579bc16be5
Update core.py
Correct method camelot.core.Table.to_html
2020-08-25 15:27:17 +02:00
pevisscher aae2c6b3d4
use correct re.sub signature
`text_strip` currently passes the regex flags as the count parameters, which is hardcoded to `re.UNICODE` (value 32), and thus only replaces the first 32 values.

see https://docs.python.org/3/library/re.html#re.sub for the signature
2020-08-24 16:51:06 +02:00
Vinayak Mehta 705473198f
Merge pull request #121 from jedie/patch-2
[MRG] Save plot when filename is specified
2020-08-14 02:36:28 +05:30
Vinayak Mehta b741c0a9e9
Check for none and return none 2020-08-14 02:35:50 +05:30
Vinayak Mehta fbe576ffcb
Revert the changes in v0.8.1 2020-07-27 17:38:14 +05:30
Vinayak Mehta 16beb15c43
Bump version and update HISTORY.md 2020-07-21 21:48:29 +05:30
Vinayak Mehta a13e2f6f1f
Change error name and update pdfminer.six version 2020-07-21 21:21:01 +05:30
Vinayak Mehta d5d6a5962b
Bump version and update HISTORY.md 2020-05-24 18:36:13 +05:30
Vinayak Mehta a22fa63c4e
Fix syntax errors 2020-05-24 18:19:48 +05:30
Vinayak Mehta 52b2a595b4
Add f-strings and remove python3.5 test job 2020-05-24 18:14:43 +05:30
Vinayak Mehta f725f04223
Remove future imports 2020-05-24 17:33:13 +05:30
Vinayak Mehta 3afb72b872
Fix read_pdf(url) and test data 2020-05-24 17:26:52 +05:30
Jens Diemer f8b6181988
Fix #120 - Save plot 2020-03-15 13:20:27 +01:00
Dimiter Naydenov b2929a9e92
Merge pull request #34 from KOLANICH/win_ghostscript_callback_fix
Fixed calling convention of callback functions
2019-07-24 13:39:18 +03:00
KOLANICH 5687fbc8b2 Fixed calling convention of callback functions 2019-07-16 21:08:34 +03:00
KOLANICH 9e356b1b0a Fixed library discovery on Windows 2019-07-16 21:07:23 +03:00
Vinayak Mehta 0efb3ca1b0 Update HISTORY.md and bump version 2019-07-07 16:07:28 +05:30
Vinayak Mehta a97b50ef21 Update flavor kwargs 2019-07-06 22:59:51 +05:30
Dimiter Naydenov 0f8cda4793
Merge pull request #5 from camelot-dev/fix-cli-group-name
[MRG] No need to monkey-patch Click.HelpFormatter
2019-07-04 18:26:35 +03:00
Dimiter Naydenov 13616c2fb4 No need to monkey-patch Click.HelpFormatter 2019-07-04 13:13:32 +03:00
Dimiter Naydenov 240ea6c411 Fixed strip_text argument getting ignored 2019-07-04 12:12:52 +03:00
Vinayak Mehta 16ddd10644
Update image_processing.py 2019-07-04 00:06:46 +05:30
Vinayak Mehta 2115a0e177 Blacken code 2019-07-03 23:47:42 +05:30
Vinayak Mehta de3281c1b6 Add test 2019-05-27 22:18:23 +05:30
Vinayak Mehta b2a8348f13 Fix #312 2019-05-26 17:13:59 +05:30
Vinayak Mehta 355ae818a0
Merge branch 'master' into fix-split-bug 2019-04-20 21:06:47 +05:30
Vinayak Mehta ce727d9558 Fix split text bug 2019-03-22 02:28:29 +05:30
Sym Roe 8446271aa4
Always sort TableList after reading PDF 2019-02-25 09:48:47 +00:00
Sym Roe c019e582bf
Add __lt__ to Table to allow sorting
Refs #277
2019-02-25 09:20:09 +00:00