Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Dictionary decoder has some hot paths in decoding
Describe the solution you'd like
We can unroll the dictionary gather/scatter loop to vectorize / improve throughput in this loop.
Also we can improve the codegen for BitReader::get_value by not going through a slice
Additional context
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Dictionary decoder has some hot paths in decoding
Describe the solution you'd like
We can unroll the dictionary gather/scatter loop to vectorize / improve throughput in this loop.
Also we can improve the codegen for
BitReader::get_valueby not going through asliceAdditional context