14 KiB
Executable File
Release History
master
Bugfixes
- #15 Fix duplicate strings being assigned to the same cell. #206 by Eduardo Gonzalez Lopez de Murillas.
- Save plot when filename is specified. #121 by Jens Diemer.
- Close file streams explicitly. #202 by Martin Abente Lahaye.
- Use correct re.sub signature. #186 by pevisscher.
- #183 Fix UnicodeEncodeError when using Stream flavor by adding encoding kwarg to
to_html. #188 by Stefano Fiorucci. - #179 Fix
max() arg is an empty sequenceerror on PDFs with blank pages. #189 by Vinayak Mehta.
Improvements
- Add bug report template.
- Move from Travis to GitHub Actions.
- Update
.readthedocs.ymland remove requirements.txt
Documentation
- #193 Add better checks to confirm proper installation of ghostscript. #196 by jimhall.
- Update
advanced.rstplotting examples. #119 by Jens Diemer.
0.8.2 (2020-07-27)
- Revert the changes in
0.8.1.
0.8.1 (2020-07-21)
Bugfixes
0.8.0 (2020-05-24)
Improvements
- Drop Python 2 support!
- Remove Python 2.7 and 3.5 support.
- Replace all instances of
.formatwith f-strings. - Remove all
__future__imports. - Fix HTTP 403 forbidden exception in read_pdf(url) and remove Python 2 urllib support.
- Fix test data.
Bugfixes
- Fix library discovery on Windows. #32 by KOLANICH.
- Fix calling convention of callback functions. #34 by KOLANICH.
0.7.3 (2019-07-07)
Improvements
Bugfixes
- Fix Click.HelpFormatter monkey-patch. #5 by Dimiter Naydenov.
- Fix strip_text argument getting ignored. #4 by Dimiter Naydenov.
- #25 edge_tol skipped in read_pdf. #26 by Vinayak Mehta.
- Fix pytest deprecation warning. #2 by Vinayak Mehta.
- #293 Split text ignores all text to the right of last cut. #294 by Vinayak Mehta.
- #277 Sort TableList by order of tables in PDF. #283 by Sym Roe.
- #312
table_regionsthrowsValueErrorwhenflavor='stream'. #332 by Vinayak Mehta.
0.7.2 (2019-01-10)
Bugfixes
0.7.1 (2019-01-06)
Bugfixes
- Move ghostscript import to inside the function so Anaconda builds don't fail.
0.7.0 (2019-01-05)
Improvements
- #240 Add support to analyze only certain page regions to look for tables. #243 by Vinayak Mehta.
- You can use
table_regionsinread_pdf()to specify approximate page regions which may contain tables. - Kwarg
line_size_scalingis now calledline_scale.
- You can use
- #212 Add support to export as sqlite database. #244 by Vinayak Mehta.
- #239 Raise warning if PDF is image-based. #240 by Vinayak Mehta.
Documentation
Note: The python wrapper to Ghostscript's C API is now vendorized under the ext module. This was done due to unavailability of the ghostscript package on Anaconda. The code should be removed after we submit a recipe for it to conda-forge. With this release, the user doesn't need to ensure that the Ghostscript executable is available on the PATH variable.
0.6.0 (2018-12-24)
Improvements
- #91 Add support to read from url. #236 by Vinayak Mehta.
- #229, #230 and #233 New configuration parameters. #234 by Vinayak Mehta.
strip_text: To define characters that should be stripped from each string.edge_tol: Tolerance parameter for extending textedges vertically.resolution: Resolution used for PDF to PNG conversion.- Check out the advanced docs for usage details.
- #170 Add option to pass pdfminer layout kwargs. #232 by Vinayak Mehta.
- Keyword arguments for pdfminer.layout.LAParams can now be passed using
layout_kwargsinread_pdf(). - The
marginskeyword argument inread_pdf()is now deprecated.
- Keyword arguments for pdfminer.layout.LAParams can now be passed using
0.5.0 (2018-12-13)
Improvements
- #207 Add a plot type for Stream text edges and detected table areas. #224 by Vinayak Mehta.
- #204
suppress_warningsis now calledsuppress_stdout. #225 by Vinayak Mehta.
Bugfixes
Documentation
- Add pdfplumber comparison and update Tabula (stream) comparison. Check out the wiki page.
0.4.1 (2018-12-05)
Bugfixes
- Add chardet to
install_requiresto fix #210. More details in pdfminer.six#213.
0.4.0 (2018-11-23)
Improvements
- #102 Detect tables automatically when Stream is used. #206 Add implementation of Anssi Nurminen's table detection algorithm by Vinayak Mehta.
0.3.2 (2018-11-04)
Improvements
- #186 Add
_bboxattribute to table. #193 by Vinayak Mehta.- You can use
table._bboxto get coordinates of the detected table.
- You can use
0.3.1 (2018-11-02)
Improvements
- Matplotlib is now an optional requirement. #190 by Vinayak Mehta.
- You can install it using
$ pip install camelot-py[plot].
- You can install it using
- #127 Add tests for plotting. Coverage is now at 87%! #179 by Suyash Behera.
0.3.0 (2018-10-28)
Improvements
- #162 Add password keyword argument. #180 by rbares.
- An encrypted PDF can now be decrypted by passing
password='<PASSWORD>'toread_pdfor--password <PASSWORD>to the command-line interface. (Limited encryption algorithm support from PyPDF2.)
- An encrypted PDF can now be decrypted by passing
- #139 Add suppress_warnings keyword argument. #155 by Jonathan Lloyd.
- Warnings raised by Camelot can now be suppressed by passing
suppress_warnings=Truetoread_pdfor--quietto the command-line interface.
- Warnings raised by Camelot can now be suppressed by passing
- #154 The CLI can now be run using
python -m. Trypython -m camelot --help. #159 by Parth P Panchal. - #165 Rename
table_areatotable_areas. #171 by Parth P Panchal.
Bugfixes
- Raise error if the ghostscript executable is not on the PATH variable. #166 by Vinayak Mehta.
- Convert filename to lowercase to check for PDF extension. #169 by Vinicius Mesel.
Files
- #114 Add Makefile and make codecov run only once. #132 by Vaibhav Mule.
- Add .editorconfig. #151 by KOLANICH.
- Downgrade numpy version from 1.15.2 to 1.13.3.
- Add requirements.txt for readthedocs.
Documentation
- Add "Using conda" section to installation instructions.
- Add readthedocs badge.
0.2.3 (2018-10-08)
- Remove hard dependencies on requirements versions.
0.2.2 (2018-10-08)
Bugfixes
- Move opencv-python to extra_requires. #134 by Vinayak Mehta.
0.2.1 (2018-10-05)
Bugfixes
Improvements
- #123 Make PEP8 compatible. #125 by Oshawk.
- #110 Add more tests. Coverage is now at 84%!
- Add tests for
__repr__. #128 by Vaibhav Mule. - Add tests for CLI. #122 by Vaibhav Mule and #117 by Vinayak Mehta.
- Add tests for errors/warnings. #113 by Vinayak Mehta.
- Add tests for output formats and parser kwargs. #126 by Vinayak Mehta.
- Add tests for
- Add Python 3.5 and 3.7 support. #119 by Vinayak Mehta.
- Add logging and warnings.
Documentation
- Copyedit all documentation. #112 by Christine Garcia.
- #115 Update issue labels in contributor's guide. #116 by Johnny Metz.
- Update installation instructions for Windows. #124 by Vinayak Mehta.
Note: This release also bumps the version for numpy from 1.13.3 to 1.15.2 and adds a MANIFEST.in. Also, openpyxl==2.5.8 is a new requirement and pytest-cov==2.6.0 is a new dev requirement.
0.2.0 (2018-09-28)
Improvements
0.1.2 (2018-09-25)
Improvements
- #85 Add Travis and Codecov.
0.1.1 (2018-09-24)
Documentation
- Add documentation fixes.
0.1.0 (2018-09-24)
- Rebirth!