Commit Graph

340 Commits (2dc48f43d6e1a86a2716f7d2cbd6703e9a6a539f)

Author SHA1 Message Date
Vinayak Mehta 52a2876ab1 Fix tarea type conversion 2016-10-04 19:57:53 +05:30
Vinayak Mehta 4b8e96a86a Update docs
* Update README

* Update index.rst

* Update docstrings

* Fix typo

* Edit docs

* Add error messages
2016-10-04 17:50:48 +05:30
Vinayak Mehta d46eeeab1a Change jpg to png 2016-09-27 18:37:38 +05:30
Vinayak Mehta 75c7deffaa Minor Stream fix 2016-09-27 17:27:34 +05:30
Vinayak Mehta 79afb45e2e Support for vertical tables in Stream
* Change var names

* Add test pdf

* Add tests for Lattice rotation

* Add support for vertical tables in Stream, test pdfs

* Add tests for Stream rotation
2016-09-15 20:51:59 +05:30
Vinayak Mehta 8ce7b74671 Replace imagemagick with ghostscript
* Replace imagemagick with ghostscript

* Add quiet option

* Avoid repetition

* Remove Wand requirement

* Replace jpeg with png
2016-09-13 17:35:07 +05:30
Vinayak Mehta 757ba0444a Remove jtol 2016-09-13 17:28:21 +05:30
Vinayak Mehta 439059817d Update tests with new API
* Update Lattice tests with new API

* Update Stream tests with new API, fix CLI

* Add table_area test, Stream fixes
2016-09-09 16:56:25 +05:30
Vinayak Mehta a94c350a7b Fix param flow
* Fix param flow

* Add check for None
2016-09-09 14:52:38 +05:30
Vinayak Mehta 766260d5d9 Remove hybrid.py 2016-09-08 21:17:24 +05:30
Vinayak Mehta 98f47d1bd7 Fix table_bbox when no tarea is given 2016-09-05 21:26:16 +05:30
Vinayak Mehta d86630e70b Add table_area
[MRG] Add table_area
2016-09-05 18:51:59 +05:30
Vinayak Mehta 0bb6ce0bf9 CLI debug fix 2016-09-01 02:16:58 +05:30
Vinayak Mehta b2dd5f68fe Fix vertical text detection in cells
* Fix vertical text detection in cells

* Add Cell instance method

* Change var names
2016-09-01 01:42:27 +05:30
Vinayak Mehta 8d56f15130 Add negative tolerance 2016-08-31 22:25:33 +05:30
Vinayak Mehta 2a55621d05 Fix magic grid extension 2016-08-31 21:06:41 +05:30
Vinayak Mehta 552f9cf422 Add various metrics to score the quality of a parse
Add various metrics to score the quality of a parse
2016-08-30 14:52:49 +05:30
Vinayak Mehta 43a009dab4 Add flow images 2016-08-24 16:53:03 +05:30
Vinayak Mehta d834faeac8 Fix README
Fix README
2016-08-09 18:36:43 +05:30
Vinayak Mehta 7e5804f87d Adds documentation
[MRG] Adds documentation
2016-08-09 17:23:50 +05:30
Vinayak Mehta dda809b286 Fix Makefile spaces to tabs 2016-08-08 17:26:54 +05:30
Vinayak Mehta 8ff04391b7 Add coveragerc and update Makefile 2016-08-08 17:24:13 +05:30
Vinayak Mehta 814d7b6939 Add Makefile 2016-08-08 16:32:05 +05:30
Vinayak Mehta 13568865b5 Add verbose 2016-08-03 13:14:19 +05:30
Vinayak Mehta 57917426e8 Fix docstrings 2016-08-03 13:14:11 +05:30
Vinayak Mehta 050107b63d Minor fix 2016-07-29 21:47:20 +05:30
Vinayak Mehta e9602bb353 Create python package
Add version support

Add new test file

[RFC] First phase

[RFC] Second phase

[RFC] Third phase

Add logging

Update README

Add debug

Add debug, fixes

Add pep8 changes

Add fix

Rename CLI tool

Add csv fix

Update README

Add fix for numpages

Update README

Update requirements.txt

Use yield

Add tuple unpacking fix

Fix n00b mistake

Add check for None

Fix check for None

Fix unicode

Add relative imports
2016-07-29 21:09:39 +05:30
Vinayak Mehta c612692c42 Remove pdfseparate subprocess call 2016-07-20 16:26:42 +05:30
Vinayak Mehta 85ffb00239 Remove imagemagick subprocess call 2016-07-20 15:40:01 +05:30
Vinayak Mehta 7aebcee7e3 Minor fix 2016-07-20 14:21:56 +05:30
Vinayak Mehta d350dc0bdb Improve log flow 2016-07-19 22:17:53 +05:30
Vinayak Mehta 271d4cafd6 Modify command line tool
Precompute globs

Replace argparse with docopt

Fix CLI

Update .gitignore

Add docstrings

Update README

Fix typo

Replace zip subprocess call

Use tempfile

Fix newline
2016-07-19 17:02:07 +05:30
Vinayak Mehta 3045a92630 Add support for pdfminer LAParams 2016-07-19 17:02:07 +05:30
Vinayak Mehta 2ef3cc7651 Update README 2016-07-19 17:02:07 +05:30
Vinayak Mehta cef1764f5b Update README 2016-07-19 17:02:07 +05:30
Vinayak Mehta 0845e33064 Improve README 2016-07-19 17:02:07 +05:30
Vinayak Mehta b87d2350dc Make code PEP8 compliant 2016-07-19 17:02:07 +05:30
Vinayak Mehta f6869a9af4 Improve grid detection and add more options 2016-07-19 17:02:03 +05:30
Vinayak Mehta 47da8606a6 Remove .pyc files 2016-07-19 17:02:03 +05:30
Vinayak Mehta eef07a86c6 First commit 🔥 2016-07-19 17:02:03 +05:30