KPV and Table extraction in PDF File

Discussions about machine vision support in GdPicture.
Post Reply
jloizagah
Posts: 29
Joined: Tue Mar 17, 2009 2:45 pm

KPV and Table extraction in PDF File

Post by jloizagah » Tue Oct 17, 2023 2:26 pm

Hi all.

I am starting using KPV and Table extraction new funcionalities, following your examples, and it seems that these new funtions rely on the OCR library. Even in your exaples, if you want to extract key par values or tables from a PDF file, the pdf is rasterized and an OCR proccess is performed. Is this always necesary?. If I am using a PDF file with its own text layer, why I have to convert it to images and perform an OCR?.

Best regards.

lindamat
Posts: 1
Joined: Wed Nov 08, 2023 11:00 am

Re: KPV and Table extraction in PDF File

Post by lindamat » Wed Nov 08, 2023 11:07 am

Hello, I think if you have a PDF file with an embedded text layer and the library supports extracting text directly from the PDF's text layer, OCR and rasterization may not be necessary. The library should be able to access the text information directly and extract the desired key-value pairs or tables without the need for OCR. geometry dash subzero

Post Reply

Who is online

Users browsing this forum: No registered users and 2 guests