From b8c55383eab4257efd2500ff4a40a6493e167241 Mon Sep 17 00:00:00 2001 From: Vinayak Mehta Date: Sat, 6 Jul 2019 04:28:32 +0530 Subject: [PATCH] Update docs --- CONTRIBUTING.md | 6 +++--- README.md | 12 ++++++------ docs/_templates/sidebarintro.html | 8 ++++---- docs/_templates/sidebarlogo.html | 4 ++-- docs/conf.py | 4 ++-- docs/dev/contributing.rst | 8 ++++---- docs/index.rst | 10 +++++----- docs/user/advanced.rst | 12 ++++++------ docs/user/install.rst | 2 +- docs/user/intro.rst | 2 +- 10 files changed, 34 insertions(+), 34 deletions(-) diff --git a/CONTRIBUTING.md b/CONTRIBUTING.md index a5937d6..115ee21 100644 --- a/CONTRIBUTING.md +++ b/CONTRIBUTING.md @@ -16,7 +16,7 @@ As the [Requests Code Of Conduct](http://docs.python-requests.org/en/master/dev/ ## Your first contribution -A great way to start contributing to Camelot is to pick an issue tagged with the [help wanted](https://github.com/socialcopsdev/camelot/labels/help%20wanted) tag or the [good first issue](https://github.com/socialcopsdev/camelot/labels/good%20first%20issue) tag. If you're unable to find a good first issue, feel free to contact the maintainer. +A great way to start contributing to Camelot is to pick an issue tagged with the [help wanted](https://github.com/camelot-dev/camelot/labels/help%20wanted) tag or the [good first issue](https://github.com/camelot-dev/camelot/labels/good%20first%20issue) tag. If you're unable to find a good first issue, feel free to contact the maintainer. ## Setting up a development environment @@ -36,7 +36,7 @@ $ pip install ".[dev]" ### Submit a pull request -The preferred workflow for contributing to Camelot is to fork the [project repository](https://github.com/socialcopsdev/camelot) on GitHub, clone, develop on a branch and then finally submit a pull request. Here are the steps: +The preferred workflow for contributing to Camelot is to fork the [project repository](https://github.com/camelot-dev/camelot) on GitHub, clone, develop on a branch and then finally submit a pull request. Here are the steps: 1. Fork the project repository. Click on the ‘Fork’ button near the top of the page. This creates a copy of the code under your account on the GitHub. @@ -106,7 +106,7 @@ The function docstrings are written using the [numpydoc](https://numpydoc.readth ## Filing Issues -We use [GitHub issues](https://github.com/socialcopsdev/camelot/issues) to keep track of all issues and pull requests. Before opening an issue (which asks a question or reports a bug), please use GitHub search to look for existing issues (both open and closed) that may be similar. +We use [GitHub issues](https://github.com/camelot-dev/camelot/issues) to keep track of all issues and pull requests. Before opening an issue (which asks a question or reports a bug), please use GitHub search to look for existing issues (both open and closed) that may be similar. ### Questions diff --git a/README.md b/README.md index 6ee15fa..4ecb665 100644 --- a/README.md +++ b/README.md @@ -16,7 +16,7 @@ --- -**Here's how you can extract tables from PDF files.** Check out the PDF used in this example [here](https://github.com/atlanhq/camelot/blob/master/docs/_static/pdf/foo.pdf). +**Here's how you can extract tables from PDF files.** Check out the PDF used in this example [here](https://github.com/camelot-dev/camelot/blob/master/docs/_static/pdf/foo.pdf).
 >>> import camelot
@@ -57,7 +57,7 @@ There's a [command-line interface](https://camelot-py.readthedocs.io/en/master/u
 - Each table is a **pandas DataFrame**, which seamlessly integrates into [ETL and data analysis workflows](https://gist.github.com/vinayak-mehta/e5949f7c2410a0e12f25d3682dc9e873).
 - **Export** to multiple formats, including JSON, Excel, HTML and Sqlite.
 
-See [comparison with other PDF table extraction libraries and tools](https://github.com/atlanhq/camelot/wiki/Comparison-with-other-PDF-Table-Extraction-libraries-and-tools).
+See [comparison with other PDF table extraction libraries and tools](https://github.com/camelot-dev/camelot/wiki/Comparison-with-other-PDF-Table-Extraction-libraries-and-tools).
 
 ## Installation
 
@@ -82,7 +82,7 @@ $ pip install camelot-py[cv]
 After [installing the dependencies](https://camelot-py.readthedocs.io/en/master/user/install.html#using-pip), clone the repo using:
 
 
-$ git clone https://www.github.com/atlanhq/camelot
+$ git clone https://www.github.com/camelot-dev/camelot
 
and install Camelot using pip: @@ -105,7 +105,7 @@ The [Contributor's Guide](https://camelot-py.readthedocs.io/en/master/dev/contri You can check the latest sources with:
-$ git clone https://www.github.com/atlanhq/camelot
+$ git clone https://www.github.com/camelot-dev/camelot
 
### Setting up a development environment @@ -126,10 +126,10 @@ $ python setup.py test ## Versioning -Camelot uses [Semantic Versioning](https://semver.org/). For the available versions, see the tags on this repository. For the changelog, you can check out [HISTORY.md](https://github.com/atlanhq/camelot/blob/master/HISTORY.md). +Camelot uses [Semantic Versioning](https://semver.org/). For the available versions, see the tags on this repository. For the changelog, you can check out [HISTORY.md](https://github.com/camelot-dev/camelot/blob/master/HISTORY.md). ## License -This project is licensed under the MIT License, see the [LICENSE](https://github.com/atlanhq/camelot/blob/master/LICENSE) file for details. +This project is licensed under the MIT License, see the [LICENSE](https://github.com/camelot-dev/camelot/blob/master/LICENSE) file for details. diff --git a/docs/_templates/sidebarintro.html b/docs/_templates/sidebarintro.html index bd71d7c..e121100 100644 --- a/docs/_templates/sidebarintro.html +++ b/docs/_templates/sidebarintro.html @@ -4,13 +4,13 @@

-

Useful Links

\ No newline at end of file +
  • Issue Tracker
  • + diff --git a/docs/_templates/sidebarlogo.html b/docs/_templates/sidebarlogo.html index 3c6b800..b68edea 100644 --- a/docs/_templates/sidebarlogo.html +++ b/docs/_templates/sidebarlogo.html @@ -4,6 +4,6 @@

    - -

    \ No newline at end of file +

    diff --git a/docs/conf.py b/docs/conf.py index 4c333bf..ee9f57f 100644 --- a/docs/conf.py +++ b/docs/conf.py @@ -63,7 +63,7 @@ master_doc = 'index' # General information about the project. project = u'Camelot' -copyright = u'2018, SocialCops' +copyright = u'2019, Camelot Developers' author = u'Vinayak Mehta' # The version info for the project you're documenting, acts as replacement for @@ -137,7 +137,7 @@ html_theme = 'alabaster' # documentation. html_theme_options = { 'show_powered_by': False, - 'github_user': 'socialcopsdev', + 'github_user': 'camelot-dev', 'github_repo': 'camelot', 'github_banner': True, 'show_related': False, diff --git a/docs/dev/contributing.rst b/docs/dev/contributing.rst index 21cdb36..2fe37c9 100644 --- a/docs/dev/contributing.rst +++ b/docs/dev/contributing.rst @@ -29,8 +29,8 @@ Your first contribution A great way to start contributing to Camelot is to pick an issue tagged with the `help wanted`_ or the `good first issue`_ tags. If you're unable to find a good first issue, feel free to contact the maintainer. -.. _help wanted: https://github.com/socialcopsdev/camelot/labels/help%20wanted -.. _good first issue: https://github.com/socialcopsdev/camelot/labels/good%20first%20issue +.. _help wanted: https://github.com/camelot-dev/camelot/labels/help%20wanted +.. _good first issue: https://github.com/camelot-dev/camelot/labels/good%20first%20issue Setting up a development environment ------------------------------------ @@ -51,7 +51,7 @@ Submit a pull request The preferred workflow for contributing to Camelot is to fork the `project repository`_ on GitHub, clone, develop on a branch and then finally submit a pull request. Here are the steps: -.. _project repository: https://github.com/socialcopsdev/camelot +.. _project repository: https://github.com/camelot-dev/camelot 1. Fork the project repository. Click on the ‘Fork’ button near the top of the page. This creates a copy of the code under your account on the GitHub. @@ -134,7 +134,7 @@ Filing Issues We use `GitHub issues`_ to keep track of all issues and pull requests. Before opening an issue (which asks a question or reports a bug), please use GitHub search to look for existing issues (both open and closed) that may be similar. -.. _GitHub issues: https://github.com/socialcopsdev/camelot/issues +.. _GitHub issues: https://github.com/camelot-dev/camelot/issues Questions ^^^^^^^^^ diff --git a/docs/index.rst b/docs/index.rst index 4c2bf07..550eaed 100644 --- a/docs/index.rst +++ b/docs/index.rst @@ -8,15 +8,15 @@ Camelot: PDF Table Extraction for Humans Release v\ |version|. (:ref:`Installation `) -.. image:: https://travis-ci.org/socialcopsdev/camelot.svg?branch=master - :target: https://travis-ci.org/socialcopsdev/camelot +.. image:: https://travis-ci.org/camelot-dev/camelot.svg?branch=master + :target: https://travis-ci.org/camelot-dev/camelot .. image:: https://readthedocs.org/projects/camelot-py/badge/?version=master :target: https://camelot-py.readthedocs.io/en/master/ :alt: Documentation Status -.. image:: https://codecov.io/github/socialcopsdev/camelot/badge.svg?branch=master&service=github - :target: https://codecov.io/github/socialcopsdev/camelot?branch=master +.. image:: https://codecov.io/github/camelot-dev/camelot/badge.svg?branch=master&service=github + :target: https://codecov.io/github/camelot-dev/camelot?branch=master .. image:: https://img.shields.io/pypi/v/camelot-py.svg :target: https://pypi.org/project/camelot-py/ @@ -81,7 +81,7 @@ Why Camelot? See `comparison with other PDF table extraction libraries and tools`_. .. _ETL and data analysis workflows: https://gist.github.com/vinayak-mehta/e5949f7c2410a0e12f25d3682dc9e873 -.. _comparison with other PDF table extraction libraries and tools: https://github.com/socialcopsdev/camelot/wiki/Comparison-with-other-PDF-Table-Extraction-libraries-and-tools +.. _comparison with other PDF table extraction libraries and tools: https://github.com/camelot-dev/camelot/wiki/Comparison-with-other-PDF-Table-Extraction-libraries-and-tools The User Guide -------------- diff --git a/docs/user/advanced.rst b/docs/user/advanced.rst index 28204f7..d331b76 100644 --- a/docs/user/advanced.rst +++ b/docs/user/advanced.rst @@ -224,12 +224,12 @@ Table areas that you want Camelot to analyze can be passed as a list of comma-se .. csv-table:: :file: ../_static/csv/table_areas.csv -.. note:: ``table_areas`` accepts strings of the form x1,y1,x2,y2 where (x1, y1) -> top-left and (x2, y2) -> bottom-right in PDF coordinate space. In PDF coordinate space, the bottom-left corner of the page is the origin, with coordinates (0, 0). +.. note:: ``table_areas`` accepts strings of the form x1,y1,x2,y2 where (x1, y1) -> top-left and (x2, y2) -> bottom-right in PDF coordinate space. In PDF coordinate space, the bottom-left corner of the page is the origin, with coordinates (0, 0). Specify table regions --------------------- -However there may be cases like `[1] <../_static/pdf/table_regions.pdf>`__ and `[2] `__, where the table might not lie at the exact coordinates every time but in an approximate region. +However there may be cases like `[1] <../_static/pdf/table_regions.pdf>`__ and `[2] `__, where the table might not lie at the exact coordinates every time but in an approximate region. You can use the ``table_regions`` keyword argument to :meth:`read_pdf() ` to solve for such cases. When ``table_regions`` is specified, Camelot will only analyze the specified regions to look for tables. @@ -316,7 +316,7 @@ In this case, the text that `other tools`_ return, will be ``24.912``. This is r You can solve this by passing ``flag_size=True``, which will enclose the superscripts and subscripts with ````, based on font size, as shown below. -.. _other tools: https://github.com/socialcopsdev/camelot/wiki/Comparison-with-other-PDF-Table-Extraction-libraries-and-tools +.. _other tools: https://github.com/camelot-dev/camelot/wiki/Comparison-with-other-PDF-Table-Extraction-libraries-and-tools :: @@ -340,7 +340,7 @@ You can solve this by passing ``flag_size=True``, which will enclose the supersc Strip characters from text -------------------------- -You can strip unwanted characters like spaces, dots and newlines from a string using the ``strip_text`` keyword argument. Take a look at `this PDF `_ as an example, the text at the start of each row contains a lot of unwanted spaces, dots and newlines. +You can strip unwanted characters like spaces, dots and newlines from a string using the ``strip_text`` keyword argument. Take a look at `this PDF `_ as an example, the text at the start of each row contains a lot of unwanted spaces, dots and newlines. :: @@ -366,7 +366,7 @@ You can strip unwanted characters like spaces, dots and newlines from a string u Improve guessed table areas --------------------------- -While using :ref:`Stream `, automatic table detection can fail for PDFs like `this one `_. That's because the text is relatively far apart vertically, which can lead to shorter textedges being calculated. +While using :ref:`Stream `, automatic table detection can fail for PDFs like `this one `_. That's because the text is relatively far apart vertically, which can lead to shorter textedges being calculated. .. note:: To know more about how textedges are calculated to guess table areas, you can see pages 20, 35 and 40 of `Anssi Nurminen's master's thesis `_. @@ -626,7 +626,7 @@ We don't need anything else. Now, let's pass ``copy_text=['v']`` to copy text in Tweak layout generation ----------------------- -Camelot is built on top of PDFMiner's functionality of grouping characters on a page into words and sentences. In some cases (such as `#170 `_ and `#215 `_), PDFMiner can group characters that should belong to the same sentence into separate sentences. +Camelot is built on top of PDFMiner's functionality of grouping characters on a page into words and sentences. In some cases (such as `#170 `_ and `#215 `_), PDFMiner can group characters that should belong to the same sentence into separate sentences. To deal with such cases, you can tweak PDFMiner's `LAParams kwargs `_ to improve layout generation, by passing the keyword arguments as a dict using ``layout_kwargs`` in :meth:`read_pdf() `. To know more about the parameters you can tweak, you can check out `PDFMiner docs `_. diff --git a/docs/user/install.rst b/docs/user/install.rst index fc9fc82..eb328f3 100644 --- a/docs/user/install.rst +++ b/docs/user/install.rst @@ -38,7 +38,7 @@ After :ref:`installing the dependencies `, you can install from th 1. Cloning the GitHub repository. :: - $ git clone https://www.github.com/socialcopsdev/camelot + $ git clone https://www.github.com/camelot-dev/camelot 2. Then simply using pip again. :: diff --git a/docs/user/intro.rst b/docs/user/intro.rst index 4ec50d7..fad2bf4 100644 --- a/docs/user/intro.rst +++ b/docs/user/intro.rst @@ -27,7 +27,7 @@ Here is a `comparison`_ of Camelot's output with outputs from other open-source .. _pdf-table-extract: https://github.com/ashima/pdf-table-extract .. _PDFTables: https://pdftables.com/ .. _Smallpdf: https://smallpdf.com -.. _comparison: https://github.com/socialcopsdev/camelot/wiki/Comparison-with-other-PDF-Table-Extraction-libraries-and-tools +.. _comparison: https://github.com/camelot-dev/camelot/wiki/Comparison-with-other-PDF-Table-Extraction-libraries-and-tools What's in a name? -----------------