Totara Learn Open Discussions

Global Search + Solr > does not search within PDF files

Für dieses Forum ist eine Höchstzahl von Beiträgen innerhalb eines bestimmten Zeitraums festgelegt worden. Dies gilt nach 3 Beiträgen innerhalb von 2 Tage

We have had Global Search enabled for at least six months on our Totara LMS. Unfortunately, the search output does not produce the results we were hoping for.

Having read numerous articles online about Solr, we thought it would index the plain text content of our PDF files (since it can). However, every search result only outputs information from the 1) learning pages, 2) file names and 3) the descriptions that we used to describe our uploaded files or e-learning modules. It does not provide search results for key words mentioned within PDF files.

All data is indexed via Solr.
Search areas for pages, HTML, folders, labels, SCORM and files are enabled.
The 'Discard site default search field names' option is disabled; enabling it would produce no output.
As far as we can see, everything is configured correctly. All five steps show a green 'yes'.
We are enrolled in certain courses were pdf files are uploaded.

How can we ensure that the global search also returns results from indexed PDF documents? Does anyone have any best practices or experience of similar issues and solutions? Or does Totara simply not use the Solr option to search within files, even though 'files' are mentioned under 'search areas'?

Permalink | Antwort

Hi Frank

Using SOLR for global search does have some known limitations around completeness of results and performance

However searching content in pdf files isn't available with other Totara features yet . Possibly the AI features such as summarise might be able to be developed further.

I can't find any documents in Totara around setting this up but it sounds like you have it setup and have indexed the files OK it is the search results that aren't as accurate as expected. A partner using this may help provide tips for better results

Are there any messages in the SOLR log outlining any problems with pdf indexing?

Does the indexing work with text based . simple, small files - as the indexing doesn't handle image based, , complex formatting or large documents well

Regards

Permalink | Ursprungsbeitrag anzeigen | Antwort