As you can read in my post, I'm already using GetPageText with the latest release. But I don't get any spaces larger then one.
So the visual space is not "filled" with spacechars.
Search found 3 matches
- Tue Sep 18, 2018 3:06 pm
- Forum: PDF
- Topic: Get text from pdf including spaces in the PDF document
- Replies: 3
- Views: 3272
- Wed Sep 12, 2018 9:15 am
- Forum: PDF
- Topic: Get text from pdf including spaces in the PDF document
- Replies: 3
- Views: 3272
Get text from pdf including spaces in the PDF document
Hey,
is it possible to get the formatted text from a PDF? For example a line in the pdf looks like:
But the result of "GetPageText" is:
I need the space information to split a line into columns.
Thanks a lot!
is it possible to get the formatted text from a PDF? For example a line in the pdf looks like:
Code: Select all
1 Test t3 3,5 14
Code: Select all
1 Test t3 3,5 14
Thanks a lot!
- Wed Aug 08, 2018 12:01 pm
- Forum: OCR, MRC & Document understanding
- Topic: NullReferenceException when doing PDF OCR
- Replies: 11
- Views: 16893
Re: NullReferenceException when doing PDF OCR
I've got this issue when disposing the imaging and pdf instance before the ocr process finished. My solution for now was to set the sync option to true: Last parameter here: pdfInstance.OcrPages("*", 0, language, GdPictureHelper.OCRDirectory, "", resolution, 0, true); Complete co...