From 75f667ccd57733d490cc9b184cd40505cd698d49 Mon Sep 17 00:00:00 2001 From: Vinayak Mehta Date: Sun, 23 Sep 2018 21:45:20 +0530 Subject: [PATCH] Add jupyter notebook link --- README.md | 2 +- docs/index.rst | 3 ++- 2 files changed, 3 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 0673550..790b2b2 100644 --- a/README.md +++ b/README.md @@ -44,7 +44,7 @@ There's a [command-line interface]() too! - **You are in control**: Unlike other libraries and tools which either give a nice output or fail miserably (with no in-between), Camelot gives you the power to tweak table extraction. (Since everything in the real world, including PDF table extraction, is fuzzy.) - **Metrics**: *Bad* tables can be discarded based on metrics like accuracy and whitespace, without ever having to manually look at each table. -- Each table is a **pandas DataFrame**, which enables seamless integration into data analysis workflows. +- Each table is a **pandas DataFrame**, which enables seamless integration into [ETL and data analysis workflows](https://gist.github.com/vinayak-mehta/e5949f7c2410a0e12f25d3682dc9e873). - **Export** to multiple formats, including json, excel and html. - Simple and Elegant API, written in **Python**! diff --git a/docs/index.rst b/docs/index.rst index 09ff68f..418bbb5 100644 --- a/docs/index.rst +++ b/docs/index.rst @@ -59,12 +59,13 @@ Why Camelot? - **You are in control**: Unlike other libraries and tools which either give a nice output or fail miserably (with no in-between), Camelot gives you the power to tweak table extraction. (Since everything in the real world, including PDF table extraction, is fuzzy.) - **Metrics**: *Bad* tables can be discarded based on metrics like accuracy and whitespace, without ever having to manually look at each table. -- Each table is a **pandas DataFrame**, which enables seamless integration into data analysis workflows. +- Each table is a **pandas DataFrame**, which enables seamless integration into `ETL and data analysis workflows`_. - **Export** to multiple formats, including json, excel and html. - Simple and Elegant API, written in **Python**! See `comparison with other PDF parsing libraries and tools`_. +.. _ETL and data analysis workflows: https://gist.github.com/vinayak-mehta/e5949f7c2410a0e12f25d3682dc9e873 .. _comparison with other PDF parsing libraries and tools: https://github.com/socialcopsdev/camelot/wiki/Comparison-with-other-PDF-Table-Parsing-libraries-and-tools The User Guide