Not getting exact match in Hibernate Search

Kumar_Shikhar · August 17, 2020, 10:12am

Hi Members,

I have a query regarding exact match in Hibernate Search.
I have tried my searching with two types of tokenizers :

White space tokenizer
Standard tokenizer

My query is as below :
org.apache.lucene.search.Query exact_query = qb.phrase().withSlop(0).onField(“fullRecord”).sentence(input.toLowerCase()).createQuery();
My analyzer is as below :
@AnalyzerDef(name = “textanalyzer”, tokenizer = @TokenizerDef(factory = StandardTokenizerFactory.class, params = {
@Parameter(name = “maxTokenLength”, value = “8000”) }), filters = {
@TokenFilterDef(factory = LowerCaseFilterFactory.class),
@TokenFilterDef(factory = DoubleMetaphoneFilterFactory.class, params = {
// @Parameter(name = “encoder”, value = “DoubleMetaphone”),
@Parameter(name = “maxCodeLength”, value = “10”),
@Parameter(name = “inject”, value = “true”) }), })
@AnalyzerDef(name = “WithWhitespaceTokenizerFactory”, tokenizer = @TokenizerDef(factory = WhitespaceTokenizerFactory.class), filters = {
@TokenFilterDef(factory = LowerCaseFilterFactory.class),
@TokenFilterDef(factory = PhoneticFilterFactory.class, params = {
@Parameter(name = “encoder”, value = “Metaphone”),
@Parameter(name = “maxCodeLength”, value = “20”),
@Parameter(name = “inject”, value = “true”)
}),
})

@Field(termVector = TermVector.WITH_POSITION_OFFSETS)
@Analyzer(definition = "textanalyzer")
@Field(name = "fullRecord_forWildcards", analyzer = @Analyzer(definition = "WithWhitespaceTokenizerFactory"))

@Column(name = "FullRecord")
private String fullRecord;'

Search String : ** ROW HIGHER

Expected Result : Only rows which have exact match as ROW HIGHER should be returned.
i.e, ROW HIGHER, 2, ROW HIGHER, 2, STR, LONDON

Getting Output :
All of these three rows mentioned below :
ROW HIGHER, 2, ROW HIGHER, 2, STR, LONDON
HIGHER ROW, HIGHER ROW, FORE STREET, KINGSAND, CORNWALL, PL10 1NL
HIGHER ROW, HIGHER ROW, FORE STREET, KINGSAND, CORNWALL

Note : I have tried Standard & White space tokenizers both. In both of the scenario I am getting same results as output.
I also have tried using an another field with no analyzer on it like this.
@Field(name = “exact_fullRecord”, analyze = Analyze.NO)

Please Suggest a solution to achieve this.
Your help would be highly appreciated !

Swarnkar · September 15, 2020, 4:08am

for exact match you do not required to tokenize it (other wise it will create multiple tokinizer based on punctuation marks or with white space )
@Field(name = “exact_fullRecord”, analyze = Analyze.NO)
this should work perfectly for exact match

Topic		Replies	Views
Eliminating non matching Records Hibernate Search	8	422	September 16, 2020
Can Someone Please help me out? I am stucked at wildcard search with special characters using StandardTokenizerFactory Hibernate Search	28	2201	August 19, 2020
Hibernate Search on special characters Hibernate Search	7	3801	January 27, 2021
Hibernate search - search exact record with multiple query string Hibernate Search	3	588	August 30, 2018
No results returned on search with words like 'G01S 5/45' Hibernate Search	1	9	March 18, 2025

Not getting exact match in Hibernate Search

Related topics