[ONNX] [Test] fix GRU modification and reduce tolerance for RNN tests by vvchernov · Pull Request #8923 · apache/tvm

vvchernov · 2021-09-03T11:03:56Z

During unification of GRU layer for frontends (#8781) the critical bag in GRU implementation on ONNX frontend side was observed. Inspite of unit test of GRU the bag was hidden by high tolerance (~1e-2) set for RNN tests. Moreover after bug fixing GRU unit test still requires high tolerance for correct pass. This fact was strange because local tests show very good coincidence with golden output from pytorch for any GRU modification (~1e-7).

I checked LSTM tests: tolerance 1e-6 is enough in most cases but sometimes it needs 1e-5 (it is tolerance for other operations tested for onnx frontend).
Bug in GRU was localized and fixed. it was related to implementation of specific GRU modification.
tolerance for RNN tests were reduced from 1e-2 to 1e-5
Random seed was added for GRU test. The test was passed 1000 times to avoid flaky test

AndrewZhaoLuo · 2021-09-03T16:34:08Z

Hey all, how did you check the required tolerance? Can you run it 1000 times to ensure these tolerances will pass if you have not already?

vvchernov · 2021-09-03T17:09:43Z

It is a good question. Unfortunately there is only one test for operation (or its modification) to reduce testing time. Moreover numpy random is used with repeatable randomization. It means that it is not checked fully all cases for operation from which we could define tolerance thresholds. But it checks that operation works for one but orbitrary input for reasonable tolerance. For numerical calculations the tolerance of order of 1e-6 - 1e-5 seems reasonable. In my case it was high enough to hide problem in operation.

AndrewZhaoLuo · 2021-09-03T17:37:01Z

I just want to make sure the tests are not super flaky.

You can do this by running pytest tests/python/frontend/onnx/test_forward.py::test_gru --repeat 1000 after pip installing pytest-repeat.

Lower than 1 / 1000 failures will seem sufficient.

vvchernov · 2021-09-06T19:19:35Z

@AndrewZhaoLuo I have added seed for test reproduction and checked it 1000 times.

AndrewZhaoLuo

LGTM, though you'll need one of the code owners on the right to merge. @mbrookhart

…apache#8923) * fix high tolerance for RNN tests * random seed was added to GRU test reproduction Co-authored-by: Valery Chernov <valery.chernov@deelvin.com>

vvchernov requested review from Huyuwei, areusch, comaniac, jroesch, junrushao, jwfromm, kazum, mbrookhart, merrymercy, siju-samuel, srkreddy1238, tqchen and yzhliu as code owners September 3, 2021 11:03

fix high tolerance for RNN tests

c32cb7a

vvchernov force-pushed the vc/onnx_unit_test branch from 2c8fe0e to c32cb7a Compare September 3, 2021 11:06

random seed was added to GRU test reproduction

efaeafd

AndrewZhaoLuo approved these changes Sep 6, 2021

View reviewed changes

vvchernov changed the title ~~[ONNX] [Test] fix GRU modification (linear before reset = False) and reduce tolerance for RNN tests~~ [ONNX] [Test] fix GRU modification and reduce tolerance for RNN tests Sep 7, 2021

masahi approved these changes Sep 8, 2021

View reviewed changes

masahi merged commit 8027a7a into apache:main Sep 8, 2021

junrushao mentioned this pull request Nov 1, 2021

Apache TVM v0.8 Release Note Candidate #9416

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ONNX] [Test] fix GRU modification and reduce tolerance for RNN tests#8923

[ONNX] [Test] fix GRU modification and reduce tolerance for RNN tests#8923
masahi merged 2 commits intoapache:mainfrom
Deelvin:vc/onnx_unit_test

vvchernov commented Sep 3, 2021 •

edited

Loading

Uh oh!

AndrewZhaoLuo commented Sep 3, 2021

Uh oh!

vvchernov commented Sep 3, 2021

Uh oh!

AndrewZhaoLuo commented Sep 3, 2021

Uh oh!

vvchernov commented Sep 6, 2021

Uh oh!

AndrewZhaoLuo left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

vvchernov commented Sep 3, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AndrewZhaoLuo commented Sep 3, 2021

Uh oh!

vvchernov commented Sep 3, 2021

Uh oh!

AndrewZhaoLuo commented Sep 3, 2021

Uh oh!

vvchernov commented Sep 6, 2021

Uh oh!

AndrewZhaoLuo left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vvchernov commented Sep 3, 2021 •

edited

Loading