Skip to content

Make fast-field normalizers truncate large token instead of dropping#6309

Open
evance-br wants to merge 2 commits intoquickwit-oss:mainfrom
evance-br:make-fast-field-truncate-long-tokens
Open

Make fast-field normalizers truncate large token instead of dropping#6309
evance-br wants to merge 2 commits intoquickwit-oss:mainfrom
evance-br:make-fast-field-truncate-long-tokens

Conversation

@evance-br
Copy link
Copy Markdown

@evance-br evance-br commented Apr 15, 2026

Description

This changes the implementation of the Quickwit fast-fields normalizers raw and lowercase to truncate large token to DEFAULT_TRUNCATE_TOKEN_LENGTH = 225 instead of dropping the token as it's currently done. This will fix the field presence when the field is fast.

How was this PR tested?

  • Added a unit test
  • Ingested data that were failing index field presence query and verified it now work consistently after this change.

Related to: #6288

@fulmicoton
Copy link
Copy Markdown
Collaborator

Can you make it a different/new normalizer?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants