Update README and requirements

pull/2/head
Vinayak Mehta 2018-09-02 19:04:24 +05:30
parent d65ee180e5
commit ae64264d3e
2 changed files with 13 additions and 17 deletions

View File

@ -55,11 +55,7 @@ See 'camelot <method> -h' for more information on a specific method.
Currently, camelot works under Python 2.7.
The required dependencies include [numpy](http://www.numpy.org/), [OpenCV](http://opencv.org/) and [ImageMagick](http://www.imagemagick.org/script/index.php).
### Optional
You'll need to install [Tesseract](https://github.com/tesseract-ocr/tesseract) if you want to extract tables from image based pdfs. Also, you'll need a tesseract language pack if your pdf isn't in english.
The required dependencies include [numpy](http://www.numpy.org/), [OpenCV](http://opencv.org/) and [ghostscript](https://www.ghostscript.com/).
## Installation
@ -71,26 +67,26 @@ pip install -U pip setuptools
### Installing dependencies
numpy can be install using `pip`. OpenCV and imagemagick can be installed using your system's default package manager.
numpy can be install using `pip`. OpenCV and ghostscript can be installed using your system's default package manager.
#### Linux
* Arch Linux
<pre>
sudo pacman -S opencv imagemagick
sudo pacman -S opencv ghostscript
</pre>
* Ubuntu
<pre>
sudo apt-get install libopencv-dev python-opencv imagemagick
sudo apt-get install libopencv-dev python-opencv python-tk ghostscript
</pre>
#### OS X
<pre>
brew install homebrew/science/opencv imagemagick
brew install homebrew/science/opencv ghostscript
</pre>
Finally, `cd` into the project directory and install by

View File

@ -1,8 +1,8 @@
docopt
matplotlib
nose
pdfminer
pyexcel-xlsx
Pillow
PyPDF2
Sphinx
docopt==0.6.2
matplotlib==2.2.3
nose==1.3.7
pdfminer==20140328
pyexcel-xlsx==0.5.6
Pillow==5.2.0
PyPDF2==1.26.0
Sphinx==1.8.0b1