we’re currently evaluating Hibernate-Search 6 as a replacement for our self-made elasticsearch-framework. As we have international customers we make use of the ICU-Plugin provided for Elasticsearch. This mentioned Plugin contains several nice features for Unicode-related operations.
The ICU-Plugin also adds new field types to the index - one of that is called icu_collation_keyword.
As far as I understood, your StringIndexFieldType maps to DataTypes.TEXT in case an analyzer is set. In case no analyzer is set, DataTypes.KEYWORD is used (which is resolved to “keyword”).
So what I wanted to ask:
Is there already some kind of ICU-support for Hibernate-Search?
Did I miss some package I need to add to the project in order to get these types (or even a @IcuCollationKeywordField-annotation? ).
I currently worked around that issue by creating a new implementation of StandardIndexFieldTypeOptionsStep for that ICU-type + created a custom ValueBinder which takes an instance of that implementation. Maybe you got some better ideas