Summarzing comments, extracting highlighted text
Posted: Wed Dec 18, 2019 12:58 pm
Hello everybody,
I followed the discussion and would like to join with a question. I have the same problem, namely that the highlighted text is not recognized. Therefore I use the codelines:
Dim Options As PDFXOCR_Funcs.PXO_Options = New PDFXOCR_Funcs.PXO_Options
Options.blacklist = ""
Options.whitelist = ""
Options.raster_dpi = m_DPI
Options.ImageFlags = PDFXOCR_Funcs.OCR_ImageProcessingFlags.OCR_Image_EdgeRefine
Options.DataPath = m_Datapath
Options.lang = m_Language
Options.RegionMode = PDFXOCR_Funcs.OCR_RegionMode.OCR_Line
Options.reserved = 0
But when I open the PDf with the PDF XChange Editor, select the text recognition, then set it to 600 dpi and high, it will be recognized well. Now the question is how do I implement this in the code. I set the m_DPI to 600 dpi. But it does not work.
I am sorry, if this is the wrong forum, but the topic fits so well.
Greets, Yvonne
[Moderator note: this topic was split from this original post.]
I followed the discussion and would like to join with a question. I have the same problem, namely that the highlighted text is not recognized. Therefore I use the codelines:
Dim Options As PDFXOCR_Funcs.PXO_Options = New PDFXOCR_Funcs.PXO_Options
Options.blacklist = ""
Options.whitelist = ""
Options.raster_dpi = m_DPI
Options.ImageFlags = PDFXOCR_Funcs.OCR_ImageProcessingFlags.OCR_Image_EdgeRefine
Options.DataPath = m_Datapath
Options.lang = m_Language
Options.RegionMode = PDFXOCR_Funcs.OCR_RegionMode.OCR_Line
Options.reserved = 0
But when I open the PDf with the PDF XChange Editor, select the text recognition, then set it to 600 dpi and high, it will be recognized well. Now the question is how do I implement this in the code. I set the m_DPI to 600 dpi. But it does not work.
I am sorry, if this is the wrong forum, but the topic fits so well.
Greets, Yvonne
[Moderator note: this topic was split from this original post.]