Searching for words and phrases that span lines  SOLVED

Forum for the PDF-XChange Editor - Free and Licensed Versions

Moderators: TrackerSupp-Daniel, Tracker Support, Paul - Tracker Supp, Vasyl-Tracker Dev Team, Chris - Tracker Supp, Sean - Tracker, Ivan - Tracker Software, Tracker Supp-Stefan

Post Reply
User avatar
rakunavi
User
Posts: 870
Joined: Sat Sep 11, 2021 5:04 am

Searching for words and phrases that span lines

Post by rakunavi »

=== UPDATE ===================================================================
The issue reported below has been resolved in Ver 9.5 build 365.
I appreciate all the hard work and efforts of the support and development team.
==============================================================================


Hello all,

I have found that when a search is performed for a phrase that span two lines in Japanese, the phrase is not displayed in the search results. Japanese, along with Chinese and Thai, is a language that does not allow spaces between words. This issue may be due to the fact that these language characteristics are not considered in the program implementation. Adobe Acrobat Reader and Foxit Editor Pro are searchable even for words that span lines.

Imagine the following example.

   xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx x
   xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx Japan
   ese
xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xx
   xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx x

If you perform a search on the above example, the keyword "Japan" is a hit, but the keyword "Japanese" is not. I have only used English word as examples to illustrate this, and the actual issue occurs in a Japanese environment. This type of issue will not become apparent in English because word-wrapping rules basically do not allow words to be separated in the middle of lines.

I have attached a verification video for your reference.


I just searched for the same word "操作方法" in PDF-XChange Editor, Adobe Acrobat Reader, and Foxit Editor Pro. PDF-XChange Editor gives a total of 5 hits, while Acrobat and Foxit give a total of 6 hits. The difference in one case is where a word crosses between lines.

The difference is whether or not each software recognizes the "操作方法" between lines 3 and 4 on page 6.
The difference is whether or not each software recognizes the "操作方法" between lines 3 and 4 on page 6.

All software is left at its default settings, with no optional settings. The PDF file being searched in the video is PDF manual published by Jungle, a distributor of the PDF-XChange Editor in Japan. It can be downloaded from the following URL.

   https://www.junglejapan.com/biz/pdfx/pdf/manual.pdf

Hoping that the above information will be of some help to you.
Thank you so much for your continued support.

Best regards,
rakunavi

- PDF-XChange Editor Plus Version:9.3 build 361.0
- Adobe Acrobat Reader DC (64bit) 2022.001.20117 | 64bit
- Foxit PDF Editor Pro Version: 11.2.1.53537
- OS Version: Windows 10 Home/Pro 21H2 Build 19044.1706
- PC Model: Lenovo IdeaPad C340-15IWL / HP ProDesk 600G1
Last edited by rakunavi on Tue Nov 29, 2022 7:15 am, edited 1 time in total.
TOP desires for PDFXCE
forum.pdf-xchange.com/viewtopic.php?t=39665 LassoTool
forum.pdf-xchange.com/viewtopic.php?t=38554 CmtGarbled
forum.pdf-xchange.com/viewtopic.php?t=37353 FulScrMultiMon
forum.pdf-xchange.com/viewtopic.php?t=41002 DisableTouchSelect
User avatar
TrackerSupp-Daniel
Site Admin
Posts: 8436
Joined: Wed Jan 03, 2018 6:52 pm

Re: Searching for words and phrases that span lines  SOLVED

Post by TrackerSupp-Daniel »

Hello, rakunavi

Thank you for the report, I have reproduced this issue and forwarded it to our Dev team for rectification. The ticket number in question today is:
RT#6135: Japanese text "search" does not find multi-line results.

Kind regards,
Dan McIntyre - Support Technician
Tracker Software Products (Canada) LTD

+++++++++++++++++++++++++++++++++++
Our Web site domain and email address has changed as of 26/10/2023.
https://www.pdf-xchange.com
Support@pdf-xchange.com
User avatar
rakunavi
User
Posts: 870
Joined: Sat Sep 11, 2021 5:04 am

Re: Searching for words and phrases that span lines

Post by rakunavi »

Hi Daniel,

Thank you for taking the time to look into this in detail.
I am relieved to share my situation with you.
I'm looking forward to future updates.

Best regards,
rakunavi
TOP desires for PDFXCE
forum.pdf-xchange.com/viewtopic.php?t=39665 LassoTool
forum.pdf-xchange.com/viewtopic.php?t=38554 CmtGarbled
forum.pdf-xchange.com/viewtopic.php?t=37353 FulScrMultiMon
forum.pdf-xchange.com/viewtopic.php?t=41002 DisableTouchSelect
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 17810
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Searching for words and phrases that span lines

Post by Tracker Supp-Stefan »

:)
User avatar
rakunavi
User
Posts: 870
Joined: Sat Sep 11, 2021 5:04 am

Re: Searching for words and phrases that span lines

Post by rakunavi »

Hello all,

Regarding what I reported above, I have found that the "Find and Redact" feature in the "Redact" group on the "Protect" tab also does not work for words or phrases that span lines. In the PDF file at the URL below, the following Japanese words on the first page are examples. You can try these words in both the search feature in the first report and the Find and Redact feature reported here.

  https://www.junglejapan.com/biz/pdfx/pdf/PDFXChangeEditor_document.pdf (1,633 KiB)
  This PDF file is a PDF-XChange Editor brochure published by Jungle, the distributor of PDF-XChange Editor in Japan.

  1. 高速
  2. セキュリティ
  3. 社員
  4. ライセンス
  5. 日本
  6. 使用
  7. スタンプ
Guide.png

Not being found in a search is a serious issue, but when what should be protected is not, the impact is even more serious. As mentioned above, this issue seems to be common not only in Japanese, but also in other languages that do not allow spaces between words, such as Chinese and Thai.

As a side note, another search-related feature is "Create Links from Bookmarks", which works perfectly well even for words or phrases that span lines.

Hoping that the above information will be of some help to you.
Thank you so much for your continued support.

Best regards,
rakunavi
TOP desires for PDFXCE
forum.pdf-xchange.com/viewtopic.php?t=39665 LassoTool
forum.pdf-xchange.com/viewtopic.php?t=38554 CmtGarbled
forum.pdf-xchange.com/viewtopic.php?t=37353 FulScrMultiMon
forum.pdf-xchange.com/viewtopic.php?t=41002 DisableTouchSelect
User avatar
Paul - Tracker Supp
Site Admin
Posts: 6829
Joined: Wed Mar 25, 2009 10:37 pm
Location: Chemainus, Canada
Contact:

Re: Searching for words and phrases that span lines

Post by Paul - Tracker Supp »

Thanks Rakunavi.

I have made a note on the ticket to pay attention to the Search and Redact as well as the regular search.

Thanks for bringing this to our attention.
Best regards

Paul O'Rorke
Tracker Support North America
http://www.tracker-software.com
User avatar
rakunavi
User
Posts: 870
Joined: Sat Sep 11, 2021 5:04 am

Re: Searching for words and phrases that span lines

Post by rakunavi »

Hello all,

This is just a follow-up message.

Regarding the multi-line search reported above, mainly for Japanese, the search results in PDF-XChange Viewer(2.5.322.10) is normal, as in Acrobat, Foxit, etc. Perhaps the source code for PDF-XChange Viewer might be helpful.

Best reagards,
rakunavi
TOP desires for PDFXCE
forum.pdf-xchange.com/viewtopic.php?t=39665 LassoTool
forum.pdf-xchange.com/viewtopic.php?t=38554 CmtGarbled
forum.pdf-xchange.com/viewtopic.php?t=37353 FulScrMultiMon
forum.pdf-xchange.com/viewtopic.php?t=41002 DisableTouchSelect
User avatar
TrackerSupp-Daniel
Site Admin
Posts: 8436
Joined: Wed Jan 03, 2018 6:52 pm

Re: Searching for words and phrases that span lines

Post by TrackerSupp-Daniel »

Hello, rakunavi

Thank you for that rakunavi! We will be sure to compare that and see what we can find.

Kind regards,
Dan McIntyre - Support Technician
Tracker Software Products (Canada) LTD

+++++++++++++++++++++++++++++++++++
Our Web site domain and email address has changed as of 26/10/2023.
https://www.pdf-xchange.com
Support@pdf-xchange.com
Post Reply