Enable termsVectors for indexed OCR field

Project:RUcore SOLR Searching and Indexing
Version:7.7
Component:Code
Category:feature request
Priority:normal
Assigned:triggs
Status:closed
Description

In order to retrieve hit frequency count termVectors needs to be enabled on the OCR text layers Solr index field.

<a href="http://wiki.apache.org/solr/TermVectorComponent" title="http://wiki.apache.org/solr/TermVectorComponent">http://wiki.apache.org/solr/TermVectorComponent</a>

Comments

#1

Version:7-x» 7.4

#2

Version:7.4» 7.5

I think this is one of the experimental features that we've been waiting for the VM to try out.

#3

Version:7.5» 7.6

This is another waiting on the Solr VM.

#4

Version:7.6» 7.7

This can be worked on with the password I now have for the Solr user on rep-dev.

#5

What is the status of this item scheduled for R7.7?

#6

Assigned to:triggs» chadmills
Status:active» test

This is ready to test on rep-dev. I changed the definition of field name="text" from:
<field name="text" type="text_general" indexed="true" stored="false" multiValued="true"/>
to:
<field name="text" type="text_general" indexed="true" stored="true" termVectors="true" termPositions="true" termOffsets="true" />
restarted the Solr server on rep-dev, and ran the portalcron.

#7

Assigned to:chadmills» triggs
Status:test» active

This is so old, 3+ years, that while the issue says what its intended purpose was I don't know if that is needed anymore. For now I would say remove the configuration change and reindex. After doing so we should close this. We can also reopen it later if we want to revisit.

#8

Version:7.7» 8.x

#9

Version:8.x» 7.7
Status:active» closed

OK. I commented out the vector definition and uncommented the old version we've been using.

Back to top