[PERF] Performance Regression on Mali GPU

[#2615](https://github.com/dmlc/tvm/pull/2615#issuecomment-481144069) introduced a performance regression to Mali GPU backend. Because some loops cannot be vectorized, the run time of resnet-18 increases to 750ms from original 130ms. Rolling back to its previous commit can fix the problem and reproduce our released benchmark.

This bug was reported by users in the forum.
https://discuss.tvm.ai/t/bad-performance-after-using-tvm-on-nanopc-t4/2006/3
https://discuss.tvm.ai/t/unable-to-reproduce-benchmark-results-on-rk3399-mali-t860/2340/7

@yzhliu  has a [temporary fix](https://github.com/dmlc/tvm/pull/2615#issuecomment-481932019). Please follow up.

cc @icemelon9 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PERF] Performance Regression on Mali GPU #3088

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[PERF] Performance Regression on Mali GPU #3088

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions