Registered Member
|
All,
I am running the latest Nepomuk 4.10.2 and am seeing strange return result behavior when searching for items. When I search for a keyword, such as Transformer, I can see the files that contain this result similar to: file:///home/ntruhan/Documents/Transformer.txt file:///home/ntruhan/Documents/transformer config 2.JPG file:///home/ntruhan/Documents/transformer config.JPG file:///home/ntruhan/Documents/transformer install 1.JPG file:///home/ntruhan/Documents/transformer install 2.JPG file:///home/ntruhan/Documents/transformer install 3.JPG file:///home/ntruhan/Documents/transformer install 4.JPG file:///home/ntruhan/Documents/Optimize the Operating Environment for Transformer (old).pdf file:///home/ntruhan/Documents/Optimize the Operating Environment for Transformer.docx file:///home/ntruhan/Documents/Optimize the Operating Environment for Transformer.pdf file:///home/ntruhan/Documents/Transformer in Production.doc .... BUT when I try to search Content I get a much smaller result set. I didn't think the new Nepomuk indexed DOC/DOCX, etc.. yet, so I wouldn't expect that, but I was under the impression it did PDF's? I say this as I get the following result: file:///home/ntruhan/Documents/Transformer cmplst.txt file:///home/ntruhan/Documents/Transformer Install Images.odt file:///home/ntruhan/Documents/transformer config 2.JPG file:///home/ntruhan/Documents/transformer config.JPG file:///home/ntruhan/Documents/transformer install 1.JPG file:///home/ntruhan/Documents/transformer install 2.JPG file:///home/ntruhan/Documents/transformer install 3.JPG file:///home/ntruhan/Documents/transformer install 4.JPG file:///home/ntruhan/Documents/Optimize the Operating Environment for Transformer (old).pdf file:///home/ntruhan/Documents/Transformer in Production.doc The one I am concerned about is the file:///home/ntruhan/Documents/Optimize the Operating Environment for Transformer.pdf Which is just a newer version of the (old) one and it has the word Transformer inside it I have 32,206 files indexed in Nepomuk, and I just deleted my index and re-created it after upgrading and let the index rebuild. Any idea why it would index the contents of one PDF and not another PDF that I know contains the same keyword? Thanks you. |
Registered Member
|
You could use nepomukshow to inspect stored data:
If pdf is not properly indexed you could force a reindex with next command:
If search still fails then the problem is in the search and not in the stored data.
Ignacio Serantes, proud to be a member of KDE forums since 2008-Nov.
|
Registered Member
|
OK. I think I found the problem...
My libpoppler.so.35 exists in /opt/kde4/lib while the nepomukpopplerextractor.so doesn't seem to find it, although it did find it when compiling it. In cmake I see: -- Found PopplerQt4: /opt/kde4/include/poppler/qt4 My KDE4 is not installed in the standard location, and while other libraries have been automatically linked and can be seen with ldconfig -p, the latest poppler inside that location was not linked. I am going to add the lib path to my LD_LIBRARY_PATH and see how it works then. Thank you for the help. |
Registered Member
|
Great. Please don't forget to mark your entry as solved.
Ignacio Serantes, proud to be a member of KDE forums since 2008-Nov.
|
Registered Member
|
Nepomuk doesn't search the contents of my pdfs at all, it seems. And for some reason it searches the contents of my text files, even though I have not told it to index those files. I'm running KDE 4.10.2, clean install of Kubuntu 13.04 (not upgraded from previous version with an old version of KDE).
|
Administrator
|
@molecule-eye: Can you provide the output of the nepomukindexer command applied against a PDF file you have to see if it mentions why the indexing it not succeeding? Your system might be missing the PDF indexer, or be unable to use it.
KDE Sysadmin
[img]content/bcooksley_sig.png[/img] |
Registered Member
|
How would I do this? nepomukshow isn't a recognized command. I also notice that I have always had audio files unchecked from indexing, and yet when I do a contents search I get mp3 files as results. Is this a Kubuntu/Ubuntu-specific issue? On a side, my other Kubuntu 13.04 system, upgraded from 12.10, searches contents of pdfs just fine and seems to be working normally. (The same folders and file types are set to be indexed on both machines.) |
Registered Member
|
nepomukindexer program is part of nepomuk-core and mandatory to file indexing so if you don't have this programs installed you must check your installation. nepomukshow is not required for indexing but it's useful for testing purposes.
Ignacio Serantes, proud to be a member of KDE forums since 2008-Nov.
|
Registered Member
|
Sorry, I have no idea how to obtain nepomukshow. I've installed the dev and debugging tools for nepomuk but it's not in there. Is it available for download somewhere? Google isn't helping.
|
Manager
|
|
Registered Member
|
nepomukshow is now part of nepomuk-core repository.
Ignacio Serantes, proud to be a member of KDE forums since 2008-Nov.
|
Manager
|
I can not find it in openSUSE 12.3, searched for it using the package manager fyi contents of nepomuk-core package
|
Registered Member
|
It's not part of nepomuk-core in Kubuntu either. I'll try downloading from git and compiling it myself.
|
Administrator
|
Please note I mentioned the nepomukindexer command not the nepomukshow command.
Running the nepomukindexer command might give valuable output explaining why the file(s) are not being indexed.
KDE Sysadmin
[img]content/bcooksley_sig.png[/img] |
Registered Member
|
Then for some reason packagers are not including this file but it's part of nepomuk-core.
To obtain output using nepomukindexer you must first enable debug output for nepomuk (strigi service), using kdebugdialog, and use --data parameter.
Ignacio Serantes, proud to be a member of KDE forums since 2008-Nov.
|
Registered users: bancha, Bing [Bot], Google [Bot], Sogou [Bot]