Approach question going from tiffs to PDF/A
Posted: Sun May 31, 2020 12:39 am
Hello!
I have a need for a Blazor Server app and several WPF clients to pull images from a repository of single page 300 dpi bitonal tiffs, apply redactions, combine them into a searchable PDF/A document that looks and prints exactly like the original tiffs and potentially add a digital cert.
So I created a Standard Library (2.1) and started working from the the sample code that goes from a multipage tiff to a searchable pdf. I'm most of the way writing the one (simple to call) function I intend to share amongst the clients when I noticed that the resulting pages being added to the pdf appear to be being made from the OCR text results. I haven't gotten to the point of actually testing it, but it doesn't look like the original tiff image is being folded into the new PDF pages...
So since it has to look exactly like the original, should I try a completely different approach? Maybe:
1. use GdPictureDocumentConverter to convert the single page tiffs to a PDF1_5 document.
2. loop through each page and use the AddRedactionRegion method then the OCRPage method (or vice versa?)
3. Then convert it to PDF_A_1b document
4. Then apply a digital cert if necessary
Anyhow I'm not sure if i'm over or under thinking this.
The samples created from the online PDF/A conversion engine were pretty much perfect I would like to emulate that if possible.
Thanks in advance for any direction!
Kurt
I have a need for a Blazor Server app and several WPF clients to pull images from a repository of single page 300 dpi bitonal tiffs, apply redactions, combine them into a searchable PDF/A document that looks and prints exactly like the original tiffs and potentially add a digital cert.
So I created a Standard Library (2.1) and started working from the the sample code that goes from a multipage tiff to a searchable pdf. I'm most of the way writing the one (simple to call) function I intend to share amongst the clients when I noticed that the resulting pages being added to the pdf appear to be being made from the OCR text results. I haven't gotten to the point of actually testing it, but it doesn't look like the original tiff image is being folded into the new PDF pages...
So since it has to look exactly like the original, should I try a completely different approach? Maybe:
1. use GdPictureDocumentConverter to convert the single page tiffs to a PDF1_5 document.
2. loop through each page and use the AddRedactionRegion method then the OCRPage method (or vice versa?)
3. Then convert it to PDF_A_1b document
4. Then apply a digital cert if necessary
Anyhow I'm not sure if i'm over or under thinking this.
The samples created from the online PDF/A conversion engine were pretty much perfect I would like to emulate that if possible.
Thanks in advance for any direction!
Kurt