From 28371817dba33cd177e3a2c709bb76ca88030c15 Mon Sep 17 00:00:00 2001 From: Vinayak Mehta Date: Fri, 14 Aug 2020 02:09:56 +0530 Subject: [PATCH] Fix doc link --- docs/user/how-it-works.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/user/how-it-works.rst b/docs/user/how-it-works.rst index 9c198ba..27bd97f 100644 --- a/docs/user/how-it-works.rst +++ b/docs/user/how-it-works.rst @@ -16,7 +16,7 @@ Stream can be used to parse tables that have whitespaces between cells to simula 1. Words on the PDF page are grouped into text rows based on their *y* axis overlaps. -2. Textedges are calculated and then used to guess interesting table areas on the PDF page. You can read `Anssi Nurminen's master's thesis `_ to know more about this table detection technique. [See pages 20, 35 and 40] +2. Textedges are calculated and then used to guess interesting table areas on the PDF page. You can read `Anssi Nurminen's master's thesis `_ to know more about this table detection technique. [See pages 20, 35 and 40] 3. The number of columns inside each table area are then guessed. This is done by calculating the mode of number of words in each text row. Based on this mode, words in each text row are chosen to calculate a list of column *x* ranges.