Skip to content

Optimize ZROT_RVV for the non-unit-stride case#5677

Open
CheryDan wants to merge 1 commit intoOpenMathLib:developfrom
CheryDan:riscv/zdrot
Open

Optimize ZROT_RVV for the non-unit-stride case#5677
CheryDan wants to merge 1 commit intoOpenMathLib:developfrom
CheryDan:riscv/zdrot

Conversation

@CheryDan
Copy link

Optimize the RVV implementation of ZROT when inc_x and inc_y are non-unit strides (inc_x != 1, inc_y != 1).

Reorder several operations to reduce vector register pressure and avoid unnecessary vector register spill to the stack. This helps GCC keep vector values in registers and reduces redundant spill/reload instructions, improving runtime performance.

No functional change.

Optimize the RVV implementation of ZROT when inc_x and inc_y are
non-unit strides (inc_x != 1, inc_y != 1).

Reorder several operations to reduce vector register pressure and
avoid unnecessary vector register spill to the stack. This helps GCC
keep vector values in registers and reduces redundant spill/reload
instructions, improving runtime performance.

No functional change.

Signed-off-by: daichengrong <daichengrong@iscas.ac.cn>
@martin-frbg martin-frbg added this to the 0.3.32 milestone Mar 16, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants