Logo

Parametric Search Appliance

This is custom script. Do not install updates.

 

Thunderstone Search Appliance Manual

<<Previous: Using the Thesaurus Feature ↑Up: Procedures and Examples Next>>: Entity XML Elements

Entity Recognition

 

With entity recognition, the Parametric Search Appliance can identify and extract document information that is not tagged as metadata, and thus may normally be unavailable for use by Group By or other features beyond keyword search. For example, cities, states or other proper names may be embedded in document text, instead of specifically set as <meta> tags. An entity can be defined to match such data, populate a parametric field (here) and thus enable a Group By facet.

An entity is composed of dictionary terms and/or regular expression patterns to search for, plus optional settings. These values are uploaded in an XML or text file to the Parametric Search Appliance, defining an entity. Once defined, entities may then be used with Data from Field (here) to populate parametric fields during crawls.

The first step in creating an entity is creating an XML file with its terms, patterns and settings. (Alternatively, a plain text file may be created; see here.)



Copyright © Thunderstone Software     Last updated: Jul 28 2017

<<Previous: Using the Thesaurus Feature ↑Up: Procedures and Examples Next>>: Entity XML Elements
Page generated in 0.08 seconds.
2024-11-23 18:52:10 EST