This forum has been archived. All content is frozen. Please use KDE Discuss instead.

Nepomuk 4.10.2 inconsistent content index results for PDF?

Tags: None
(comma "," separated)
molecule-eye
Registered Member
Posts
402
Karma
0
OS
I somehow got it working. In Dolphin, going to a pdf's properties -> information -> configure and checking "content" revealed that the contents of various pdfs were indexed. One showed nothing so I manually ran nepomukindexer on it, went back to properties -> information in Dolphin and indexed content showed up. Then magically ALL indexed content showed up! Apparently my pdfs were indexed but Dolphin wasn't tapping into the database. I'll see if it continues to work on reboot.
User avatar
Ignacio Serantes
Registered Member
Posts
453
Karma
1
OS
molecule-eye wrote:I somehow got it working. In Dolphin, going to a pdf's properties -> information -> configure and checking "content" revealed that the contents of various pdfs were indexed. One showed nothing so I manually ran nepomukindexer on it, went back to properties -> information in Dolphin and indexed content showed up. Then magically ALL indexed content showed up! Apparently my pdfs were indexed but Dolphin wasn't tapping into the database. I'll see if it continues to work on reboot.

When you select a file with Dolphin is indexed if is not indexed.


Ignacio Serantes, proud to be a member of KDE forums since 2008-Nov.
User avatar
pumrel
Registered Member
Posts
8
Karma
0
OS
molecule-eye wrote:I somehow got it working. In Dolphin, going to a pdf's properties -> information -> configure and checking "content" revealed that the contents of various pdfs were indexed. One showed nothing so I manually ran nepomukindexer on it, went back to properties -> information in Dolphin and indexed content showed up. Then magically ALL indexed content showed up! Apparently my pdfs were indexed but Dolphin wasn't tapping into the database. I'll see if it continues to work on reboot.
I'm seeing the exact same behaviour. I'm searching for 'receptor' in my research papers I downloaded and only those that have the word in their title come up. However it's a word that is in almost all research papers I have, believe me. So I just open one that did not come up, search for the word, find it, then search again in dolphin. However I have to search for the word in every pdf separately in order for dolphin to take it into account. Apparently, it should not work like that.
User avatar
bcooksley
Administrator
Posts
19765
Karma
87
OS
Can you try running the "nepomukindexer" program against each PDF file, and see if it is completing successfully? Also, is the directory enabled for indexing in System Settings > Desktop Search?


KDE Sysadmin
[img]content/bcooksley_sig.png[/img]
User avatar
einar
Administrator
Posts
3402
Karma
7
OS
The "issue" with indexing PDFs is that you need to put their text inside the Virtuoso index. However Virtuoso does not like too large elements of text pushed into it, and can go crazy on the CPU, therefore only a part of it is pushed inside (a predefined size starting from the beginning).


"Violence is the last refuge of the incompetent."
Image
Plasma FAQ maintainer - Plasma programming with Python


Bookmarks



Who is online

Registered users: bancha, Bing [Bot], Google [Bot], Sogou [Bot]