What to do in case if automatic indexing fails

sbarlabanov · July 29, 2020, 8:37pm

Hi Yoann,

I think that catching indexing errors will not help, because the catching logic may stop/fail (e.g. application restart/redeployment) and those errors get lost -> cluster out of sync. Anything done after the DB transaction commit may fail and would lead to out-of-sync situation.
The same applies to storing a queue of entities to index in Kafka - how are you going to sync that queue with the database? The transaction is done, push to Kafka fails -> cluster out-of-sync.
In order to make it safe, I see only one way: the fact, which entities need to be indexed, must be stored together with those entities in the same DB inside the same transaction used to update/insert/delete those entities (a separate table aka event sourcing). In this case it is guaranteed that the knowledge of which entities to index will not be lost. Storing that knowledge in any other system (Kafka or a file system) would require 2-phase commit, which is quite a pain and usually not an option. When automatic indexing succeeds it has to mark those entities/events as processed. If automatic indexing fails, those entities/events must be retried by an async process. Or do I miss something?

Best regards,
Sergiy

Topic		Replies	Views
Indexing issue while committing: log error instead of throwing exception? Hibernate Search	8	1204	April 16, 2018
Question abount Hibernate Search FailureHandler (HS 6) Hibernate Search	4	676	November 30, 2020
Failure handler doesn't work Hibernate Search	1	558	July 11, 2022
Some data is not indexed. The error name is HSEARCH400007 Hibernate Search	1	1828	October 29, 2019
Is it possible for HibernateSearch to not index data if the app is shutdown whilst updating Hibernate Search	4	491	August 6, 2020

What to do in case if automatic indexing fails

Related topics