Frh
49d3f0f3aa
Rename table_bbox (singular) to table_areas
...
The object is an index of bounding boxes, in some cases given by users.
It's called areas in one section of the code making it systematic.
2020-04-10 16:34:30 -07:00
Frh
270c76a3e7
Pylinting of data file
2020-04-10 16:16:37 -07:00
Frh
467c4a3de0
Moved duplicated common code to base objects
...
* Move table initialization common areas to BaseParser
* Stop relying on intermediate file name for source page index
* Create table comparison utility function to help in debugging
* Generate pdf as images in stream mode plots
* Fix pylint errors
2020-04-10 16:02:00 -07:00
Frh
dff9f5cd82
Further linting fixes flagged by DeepSource
2020-04-06 18:41:10 -07:00
Francois Huet
f1d4a32bc6
Merge branch 'streaming-improvements' of https://www.github.com/FrancoisHuet/camelot into streaming-improvements
2020-04-06 14:39:11 -07:00
Francois Huet
52e67a7f7c
Fix plotting unit tests impacted by MatplotLib
2020-04-06 14:38:47 -07:00
Francois Huet
b66ae79647
Minor commit to trigger code check
2020-04-06 13:23:24 -07:00
Francois Huet
f54e1563e1
Lint and address PDFMiner version impact on tests
2020-04-06 12:47:23 -07:00
Francois Huet
f0b2cffb17
Replace constant padding with expansion heuristic
...
Fixed all unit tests.
Removed constant padding added around tables in the last step of the
initial discovery mode of the stream algorithm.
Replaced it with a heuristic that attempts to expand the table up while
respecting columns identified so far.
Updated unit tests to reflect new behavior, improved rejection of
extraneous information in few cases.
Added unit test covering a use case where the header has vertical test.
Made improvements to better support vertical text in tables.
2020-04-05 17:05:06 -07:00
Francois Huet
00d5d2ede4
[WIP] Remove heuristic of 5* row height
...
Removed the heuristic that pads height by 5x the row height.
Updated the 4 unit tests that got better results based on it.
Still do: fix the 6 unit tests that got broken, plus my new target.
2020-04-04 14:09:12 -07:00
Francois Huet
912efd2c9b
Add failing test case for vertical headers
...
The unit test represents an issue I'm trying to address.
2020-04-04 13:45:16 -07:00
Francois Huet
fafdebb02e
Updated reference plots for unit tests to pass
...
Reviewed differences -all appear minor-.
2020-04-04 13:39:26 -07:00
Francois Huet
0af212c483
Fix layout_kwargs unit test
2020-04-04 13:22:21 -07:00
Francois Huet
0f17658f48
Make unit test stream_split_text pass
...
TODO: the expectations of the test were and are still wrong.
It shouldn't include the header.
2020-04-04 12:44:51 -07:00
Francois Huet
73bedef7b7
Fix expectations for two tables unit test
...
Removed extraneous header and footer expectations.
Fixed a minor space discrepancy that's unconsequential.
TODO: the expectation of the test is still wrong. It shouldn't include the heading paragraph.
2020-04-04 12:38:45 -07:00
Francois Huet
50fea25567
Fix expectations for health pdf test.
...
What: Removed the page header from the test expectation.
Why: the page header isn't part of the table.
2020-04-04 12:19:36 -07:00
Francois Huet
394172bbe9
Ignore mac DS_Store files
2020-04-04 12:17:33 -07:00
Vinayak Mehta
7d4c9e53c6
Update README
2020-03-21 18:07:29 +05:30
Vinayak Mehta
44193e0d26
Add deepsource badge to docs
2019-12-24 13:08:46 +05:30
Vinayak Mehta
a9918a78cf
Add deepsource badge
2019-12-24 13:07:11 +05:30
Vinayak Mehta
47bb839d7a
Create .deepsource.toml
2019-12-24 13:03:45 +05:30
Vinayak Mehta
1b30f8ecf9
Merge pull request #94 from miltonArango/improving-coverage
...
[MRG] Unit tests for the version generation
2019-11-15 10:50:42 +05:30
Milton Arango
8e28a0cac0
Moved the version tests to test_common PR #94
...
Applied black formatting
2019-11-14 20:26:20 -05:00
Vinayak Mehta
eb2badbbd0
Merge pull request #91 from vasantvohra/patch-1
...
[MRG] Update how-it-works.rst
2019-11-15 03:36:02 +05:30
Milton Arango
0d1db4b09e
Unit Tests for the Version Generation
...
Unit tests for the __version__.py generate_version method.
2019-10-26 15:41:41 -05:00
Vasant Vohra
167ee9ac69
Update how-it-works.rst
...
minor typo
2019-10-17 14:04:18 +05:30
Vinayak Mehta
83f816f104
Merge pull request #48 from jnothman/assert_frame_equal
...
[MRG] Use assert_frame_equal for more informative errors in tests
2019-10-15 15:51:26 +05:30
Vinayak Mehta
f1879726d9
Merge pull request #86 from pravarag/add-opencollective
...
[MRG] Update python-tk link
2019-10-15 12:19:07 +05:30
Pravar Agrawal
56f3b54f62
[PyConIndia] Update python-tk link in README
2019-10-15 11:25:57 +05:30
Vinayak Mehta
11fadb16fd
Merge pull request #65 from pravarag/add-opencollective
...
[MRG] Add opencollective url
2019-10-14 23:58:12 +05:30
Vinayak Mehta
2340833bb8
Merge branch 'master' into add-opencollective
2019-10-14 23:57:40 +05:30
Vinayak Mehta
7ce4cb5050
Merge pull request #57 from dcorriveau/master
...
Update README.md
2019-10-14 23:56:40 +05:30
Vinayak Mehta
78e5dd1f4e
Fix #56
2019-10-14 23:55:51 +05:30
Vinayak Mehta
857f68ef6b
Update README.md
2019-10-14 23:52:38 +05:30
Vinayak Mehta
14c3cb49b9
Merge pull request #70 from kishvanchee/fix67
...
[MRG] Add 3.7 version to installation docs
2019-10-14 23:50:44 +05:30
Kishore Vancheeshwaran
a6d32ecddb
added 3.7 version to installation docs
2019-10-14 12:52:51 +05:30
Pravar Agrawal
e5e02401da
[DevSprint] convert camelot to uppercase in README
2019-10-14 12:51:09 +05:30
Pravar Agrawal
ea3eac3c40
[DevSprint] push new branch, fix typo in index.rst change
2019-10-14 12:33:13 +05:30
Pravar Agrawal
28a8112c6d
[DevSprint] Fix typo in index.rst for opencollective link
2019-10-14 12:25:17 +05:30
Pravar Agrawal
45384106c8
[DevSprint] Add opencollective link to README and Docs
2019-10-14 12:21:09 +05:30
Dylan Corriveau
81729f57cc
Update README.md
2019-10-13 16:00:04 -04:00
Joel Nothman
9eb15c09dc
Use assert_frame_equal for more informative errors in tests
2019-08-06 11:38:44 +10:00
Vinayak Mehta
7ecfcad239
Update HISTORY.md
2019-07-28 21:46:55 +10:00
Dimiter Naydenov
b2929a9e92
Merge pull request #34 from KOLANICH/win_ghostscript_callback_fix
...
Fixed calling convention of callback functions
2019-07-24 13:39:18 +03:00
Dimiter Naydenov
6d33c7ff1e
Merge pull request #32 from KOLANICH/ghostscript_discovery_win
...
Fixed library discovery on Windows
2019-07-24 13:38:51 +03:00
KOLANICH
5687fbc8b2
Fixed calling convention of callback functions
2019-07-16 21:08:34 +03:00
KOLANICH
9e356b1b0a
Fixed library discovery on Windows
2019-07-16 21:07:23 +03:00
Vinayak Mehta
f7b94b3e57
Add black badge to index.rst
2019-07-07 16:15:04 +05:30
Vinayak Mehta
0efb3ca1b0
Update HISTORY.md and bump version
2019-07-07 16:07:28 +05:30
Vinayak Mehta
098f7c6727
Merge pull request #26 from camelot-dev/fix-25
...
[MRG] Update flavor kwargs
2019-07-06 23:42:09 +05:30