[TIR] Tir constants integration into compilation pipeline by d-smirnov · Pull Request #8509 · apache/tvm

d-smirnov · 2021-07-20T12:15:09Z

This PR is a follow-up PR to integate tir constant nodes introduced in 8472 to compilation pipeline.

d-smirnov · 2021-07-23T21:38:47Z

d-smirnov requested review from anijain2305, areusch, comaniac, jroesch, junrushao1994, jwfromm, kparzysz-quic, MarisaKirisame, masahi, mbrookhart, merrymercy, slyubomirsky, tqchen, vinx13, wweic, yzhliu, zhiics and ZihengJiang as code owners 12 hours ago

Oh. not sure how did I manage to do this. I am sorry for such a massive query.

manupak

@d-smirnov , shall we look to update the design as agreed here ? : https://github.com/apache/tvm-rfcs/blob/main/rfcs/0022-tir-non-scalar-constants.md

Change-Id: Ia4adca9d3315b26fbdc006ef7c115900c081e303

Change-Id: Ice305f4fefd13fe95e97574e6d63ffeb664621df

Refactored ExtractPrimFuncConstants to IRModule pass. deDup -> DeDup Refactored logic of Applicator supplementary class Change-Id: I6c120d175eb6790ba90f176c4f856bde8f0c7c94

Change-Id: Ie3ee6ea2479476a30f486baef74f20070f117942

Change-Id: I12c63731663b9c9ea574b9ed5cb17311ba3cf701

manupak · 2022-02-22T23:34:55Z

Thanks @d-smirnov working so long on this to enable representation of non-scalar constants in TIR.

Thanks @giuseros for the initial work..
Thanks @junrushao1994 and @areusch for reviews!

kparzysz-quic · 2022-02-24T23:51:03Z

We're seeing issues on Hexagon after this commit. They are directly caused by float16 not being supported for embedded constants, but there are other potential problems there (difference between target and executor in fuse_ops). See https://discuss.tvm.apache.org/t/problem-with-fuseops-and-embedded-constants-in-tir/12165 for more info. I'm wondering about your thoughts about this.

tmoreau89 · 2022-02-25T00:24:29Z

@kparzysz-quic thanks for signaling the regression. Is there a way this problem can be triggered via a simple unit test? Perhaps we can get the authors to issue a patch, or perhaps even we could consider reverting this commit until we can resolve the issue?

manupak · 2022-02-25T08:51:38Z

We can look into this.

@kparzysz-quic , would you be able to submit a PR with an xfail that should have been broken by this PR ?
(It is bit harder to track tests that are not in tree).

They are directly caused by float16 not being supported for embedded constants

tvm/src/target/llvm/codegen_params.cc

Lines 137 to 157 in dcbdedd

    
           case runtime::DataType::TypeCode::kFloat: 
        
             switch (arr_type.bits()) { 
        
               case 16: 
        
                 // NOTE: float16 is treated as uint16_t. 
        
                 element_type = llvm::Type::getIntNTy(*ctx, arr_type.bits()); 
        
                 BuildLLVMVector<uint16_t>(element_type, arr->data, num_elements, &elements); 
        
                 break; 
        
               case 32: 
        
                 element_type = llvm::Type::getFloatTy(*ctx); 
        
                 BuildLLVMVector<float>(element_type, arr->data, num_elements, &elements); 
        
                 break; 
        
               case 64: 
        
                 element_type = llvm::Type::getDoubleTy(*ctx); 
        
                 BuildLLVMVector<double>(element_type, arr->data, num_elements, &elements); 
        
                 break; 
        
               default: 
        
                 CHECK(false) << "CodegenParams: only support 32- or 64-bit floating point; saw " 
        
                              << arr_type.bits() << "-bit array"; 
        
                 break; 
        
             } 
        
             break;

tvm/src/target/source/codegen_params.cc

Lines 214 to 229 in dcbdedd

    
           case runtime::DataType::TypeCode::kFloat: { 
        
             os.fill(' '); 
        
             os.setf(std::ios::left, std::ios::adjustfield); 
        
             if (arr_type.bits() == 16) { 
        
               // NOTE: print types not widely supported by C as uint16_t. 
        
               PrintIntegralArray<uint16_t>(arr->data, num_elements, indent_chars, os); 
        
             } else if (arr_type.bits() == 32) { 
        
               PrintFloatingPointArray<float>(arr->data, num_elements, indent_chars, os); 
        
             } else if (arr_type.bits() == 64) { 
        
               PrintFloatingPointArray<double>(arr->data, num_elements, indent_chars, os); 
        
             } else { 
        
               CHECK(false) << "CodegenParams: only support 32- or 64-bit floating point; saw " 
        
                            << arr_type.bits() << "-bit array"; 
        
             } 
        
             break; 
        
           }

The above are called respectively in the following locations :

tvm/src/target/llvm/codegen_llvm.cc

Line 1485 in dcbdedd

auto array = NDArrayToLLVMArray(ctx_, data);

tvm/src/target/source/codegen_c.cc

Line 673 in dcbdedd

NDArrayDataToC(data, 4, decl_stream);

(These are same code used to codegen LinkedParam node)

So, I am not following what is meant here.. Can you elaborate further ?

To understand a bit more, would you be able to share the relay.build(...) configuration (targets, executor, runtime) that you are using ? Ideally, if the target does not support -link-params, it should be able to disable it or if the target needs them linked (i.e. not handled via executor) it needs codegen tir.AllocateConst node -- for which the support has been added for all backends of TVM (with tests). By naive, look at the Hexagon codegen it seems like it is extending codegen LLVM.

So what we need to understand, what is the expectation of Hexagon backend when linked-param is used ?
(I suppose it requires LinkedParam nodes ? )

I am sensing we might need to move link-param from being executor attr to a target attr . cc : @Mousius @d-smirnov

Im a bit concerned by the suggestion to revert based on non-tree tests. FYI : @u99127

kparzysz-quic · 2022-02-25T15:10:25Z

We have a temporary workaround---just reset the link_params variable to false in fuse_ops, so the urgency for us is lower.

There are two issues that need to be addressed:

The link_params property is inconsistent with the CPU target in FuseOps, when invoked through FoldConstants.
Float16 not being supported in AllocateConst visitors, e.g. https://github.com/apache/tvm/blob/main/src/relay/backend/te_compiler_cache.cc#L242

Hexagon does utilize the LinkedParams function, but we can change that as needed. So far, without any workarounds, we don't even get to the Hexagon codegen.

manupak · 2022-02-25T15:14:47Z

Hi @kparzysz-quic ,

This PR is about non-scalar constants, 2) links to scalar constants -- maybe that feature is yet to be supported anyway ?

For 1) maybe lets create an issue -- Im trying to understand the relevance to this PR, @d-smirnov thoughts ?

kparzysz-quic · 2022-02-25T15:18:12Z

Right---the relevance is that I tried setting "relay.FuseOps.link_params" to False in PassContext/config, but it didn't have any effect, since FoldConstants creates its own empty context. It also uses a new CPU target for folding, which has link_params=False. However, FuseOps uses the Executor attribute from IR module, which has link_params taken from the original target (i.e. True).

kparzysz-quic · 2022-02-25T15:21:01Z

The changes in fuse_ops that this PR made (obtaining the value of link_params) created a different behavior even for scalar constants. I think the immediate issue is really with how the link_params variable is set in fuse_ops.

manupak · 2022-02-25T15:29:01Z

However, FuseOps uses the Executor attribute from IR module, which has link_params taken from the original target (i.e. True).

I am still a bit puzzled.

https://github.com/apache/tvm/pull/8509/files#diff-8b95b87a2611e5e0c367ce17396b051fd868aa260c51ce0e50b94564fcb0e71fR1054-R1057

I read the above code as PassContext overriding whatever provided by the Executor.
cc : @d-smirnov

manupak · 2022-02-25T15:30:56Z

Would you be able to provide what should be the "correct" behaviour ?

kparzysz-quic · 2022-02-25T15:42:55Z

However, FuseOps uses the Executor attribute from IR module, which has link_params taken from the original target (i.e. True).

I am still a bit puzzled.

https://github.com/apache/tvm/pull/8509/files#diff-8b95b87a2611e5e0c367ce17396b051fd868aa260c51ce0e50b94564fcb0e71fR1054-R1057

I read the above code as PassContext overriding whatever provided by the Executor. cc : @d-smirnov

Line 1054: If executor is present in IRModule, set link_params to the value from the executor. This sets link_params to True, even though the current target is CPU (since the executor was created based on the original target "hexagon").

Line 1057: If the current config has a key "relay.FuseOps.link_params", set link_params to that value. Otherwise, use the value from line 1054. Here the config is empty, since FoldConstants creates an empty one, so the default value is used (i.e. the one from 1054).

kparzysz-quic · 2022-02-25T15:44:30Z

I think you mentioned that link_params should be a property of the target. If fuse_ops got it from the target, everything would work fine, since it would get it from "CPU" instead of "hexagon". I think that would be the correct behavior.

manupak · 2022-02-25T15:52:43Z

Hmmm, Just to understand this a bit better, what is the target string being used in your case ?

Secondly, does the hexagon backend requires the constants binded to the IR, eventually to be codegen'd or does it require them to be provided at the runtime ?

kparzysz-quic · 2022-02-25T16:04:29Z

The first failing testcase is resnet50:

This is the script that reproduces the crash is:

import coremltools
import onnx
import onnxmltools
import tvm
import tvm.contrib.hexagon
from tvm import relay
from tvm.relay.transform import InferType, ToMixedPrecision

dtype_dict = {"data": "float32"}
shape_dict = {"data": [1,3,224,224]}

#input name and path for your caffe model
proto_file = './ResNet-50-deploy.prototxt'
input_caffe_path = './ResNet-50-model.caffemodel'

# Convert Caffe model to CoreML
coreml_model = coremltools.converters.caffe.convert((input_caffe_path, proto_file))
# Convert the Core ML model into ONNX
onnx_model = onnxmltools.convert_coreml(coreml_model)
onnxmltools.utils.save_model(onnx_model, 'resnet-50.onnx')
mod, params = relay.frontend.from_onnx(onnx_model, shape_dict)
mod = InferType()(mod)
mod = ToMixedPrecision("float16")(mod)

target = tvm.target.hexagon("v68", link_params=True)
config = {"relay.FuseOps.link_params":0}                    # <-- doesn't do anything
with tvm.transform.PassContext(opt_level=3, config=config):
    lib = relay.build(mod, target, target_host=target, params=params, mod_name="default")

At the bottom of the crash dump you should see

  0: tvm::relay::tec::ScheduleBuilder::VisitExpr_(tvm::relay::ConstantNode const*)::{lambda(tvm::runtime::Array<tvm::tir::Var, void> const&)#1}::operator()(tvm::runtime::Array<tvm::tir::Var, void> const&) const
  File "/w/src/dmlc/tvm/src/relay/backend/te_compiler_cache.cc", line 242
TVMError: float16 not handled

The target string is hexagon -keys=hexagon -link-params=1 -mattr=+hvxv68,+hvx-length128b -mcpu=hexagonv68 -mtriple=hexagon.

We don't yet have codegen for AllocateConst specific to Hexagon, but if the LLVM codegen handles it, it will probably work for us as well. Right now we use the link_params function, which contains all the constants in it, but is called at runtime to supply them to the model.

Edit: This will crash with the upstream code as well, so if you have the same resnet50 as we do, you should be able to reproduce this crash.

manupak · 2022-02-25T16:32:36Z

config = {"relay.FuseOps.link_params":0}

tvm/tests/python/relay/test_pass_fuse_ops.py

Lines 631 to 661 in d9fac4f

    
           def test_fuse_take(link_params): 
        
               """Test fusion case involving concat and take""" 
        
               def before(): 
        
                   shape = (tvm.tir.const(10, "int64"), tvm.tir.const(1, "int64")) 
        
                   x = relay.var("x", shape=shape) 
        
                   concat = relay.concatenate([x, x], axis=-1) 
        
                   out = relay.op.take(concat, indices=relay.const([0], dtype="int64")) 
        
                   return relay.Function(relay.analysis.free_vars(out), out) 
        
               def expected(link_params): 
        
                   shape1 = (tvm.tir.const(10, "int64"), tvm.tir.const(1, "int64")) 
        
                   shape2 = (tvm.tir.const(1, "int64"),) 
        
                   x = relay.var("x", shape=shape1) 
        
                   p0 = relay.var("p0", shape=shape1) 
        
                   p1 = relay.var("p1", shape=shape2, dtype="int64") 
        
                   c = relay.const([0], dtype="int64") 
        
                   concat = relay.concatenate([p0, p0], axis=-1) 
        
                   out = relay.op.take(concat, indices=c if link_params else p1) 
        
                   f0 = relay.Function([p0] if link_params else [p0, p1], out) 
        
                   f0 = f0.with_attr("Primitive", tvm.tir.IntImm("int32", 1)) 
        
                   y = relay.Call(f0, [x] if link_params else [x, c]) 
        
                   return relay.Function([x], y) 
        
               after = run_opt_pass(expected(link_params), transform.InferType()) 
        
               with tvm.transform.PassContext(opt_level=2, config={"relay.FuseOps.link_params": link_params}): 
        
                   m = run_opt_pass(before(), transform.InferType()) 
        
                   m = run_opt_pass(m, transform.FuseOps()) 
        
               assert tvm.ir.structural_equal(m, after)

I think this is respected. maybe we should need to verify when Executor has link-params ?

But after having read this all again -- the actual issue link-params will be set to True if the Executor has it. It seems like hexagon backend was reliant on FoldConstant behaviour of always having link-params set to False. Is it not ?

If so the solution might be to provide link-params override for FoldConstants as well ?

Something feels like link-params behaviour for FuseOps pass in the main compilation pipeline needs to be True. Am I wrong here ?
Or are you saying that if you hardcode FuseOps link_params to be False always, it all works for you ?

u99127 · 2022-02-25T16:37:08Z

Can ? / should this discussion please be moved into a proper issue ?

Ramana

manupak · 2022-02-25T16:50:16Z

Yes I think it make sense -- @kparzysz-quic lets create an issue and move it there.

driazati · 2022-03-03T19:15:08Z

@manupa-arm and @junrushao1994 can you tag me on changes to CI scripts in the future? The changes here in tests/scripts/ and tests/lint/python_format.sh broke linting CI on main in a roundabout way

manupak · 2022-03-03T20:14:23Z

Oops sorry about this! Will do
Attn : @d-smirnov

driazati · 2022-03-03T20:35:13Z

thanks! btw, it's fixed in #10469 so no action here is necessary

d-smirnov force-pushed the tir_constant_2 branch from 5f1da9c to 50d6052 Compare July 23, 2021 09:41

d-smirnov requested review from MarisaKirisame, ZihengJiang, anijain2305, areusch, comaniac, jroesch, junrushao, jwfromm, kparzysz-quic, masahi, mbrookhart, merrymercy, slyubomirsky, tqchen, vinx13, wweic, yzhliu and zhiics as code owners July 23, 2021 09:41

d-smirnov force-pushed the tir_constant_2 branch from fc4e057 to 64e0c42 Compare September 7, 2021 16:47

d-smirnov requested a review from Hzfengsy as a code owner September 7, 2021 16:47

d-smirnov force-pushed the tir_constant_2 branch from 64e0c42 to a4d809e Compare September 7, 2021 17:05

manupak mentioned this pull request Sep 8, 2021

[TIR][USMP] Added buffer info extraction pass #8468

Merged

d-smirnov force-pushed the tir_constant_2 branch from a4d809e to 0665a8c Compare September 8, 2021 13:50

This was referenced Sep 30, 2021

[RFC] Unified Static Memory Planning (USMP) Tracking Issue #8404

Closed

[RFC][TIR] TIR Non-scalar Constants apache/tvm-rfcs#22

Merged

manupak requested changes Oct 7, 2021

View reviewed changes

d-smirnov force-pushed the tir_constant_2 branch from 0665a8c to 6c88c9d Compare November 16, 2021 17:38

d-smirnov requested a review from icemelon as a code owner November 16, 2021 17:38

d-smirnov added 5 commits February 22, 2022 18:36

Constant extractor now is a separate pass

d19ba1b

Change-Id: Ia4adca9d3315b26fbdc006ef7c115900c081e303

Added forgotten file + unit test fix

d452f19

Change-Id: Ice305f4fefd13fe95e97574e6d63ffeb664621df

Changed to IRModule pass

4d8ed0d

Refactored ExtractPrimFuncConstants to IRModule pass. deDup -> DeDup Refactored logic of Applicator supplementary class Change-Id: I6c120d175eb6790ba90f176c4f856bde8f0c7c94

bugfix after rebasing

d03a5a2

Change-Id: Ie3ee6ea2479476a30f486baef74f20070f117942

-v -> -vv to have more debug information

b00ead9

Change-Id: I12c63731663b9c9ea574b9ed5cb17311ba3cf701

kparzysz-quic mentioned this pull request Feb 25, 2022

[Bug] fuse_ops takes link_params from executor instead of target #10390

Closed

driazati mentioned this pull request Mar 3, 2022

[skip ci][ci] Remove -i from lint scripts #10469

Merged

kparzysz-quic mentioned this pull request Apr 27, 2022

Remove support for run-time linked-params from codegen #11144

Merged

driazati mentioned this pull request Jul 14, 2022

TVM v0.9.0.rc0 Release Candidate Notes #12102

Closed

masahi mentioned this pull request Aug 18, 2022

[TIR] Support AllocateConst nodes in TensorIR scheduling flow #12489

Merged

Conversation

d-smirnov commented Jul 20, 2021

Uh oh!

d-smirnov commented Jul 23, 2021

Uh oh!

manupak left a comment

Choose a reason for hiding this comment

Uh oh!

manupak commented Feb 22, 2022

Uh oh!

kparzysz-quic commented Feb 24, 2022

Uh oh!

tmoreau89 commented Feb 25, 2022

Uh oh!

manupak commented Feb 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kparzysz-quic commented Feb 25, 2022

Uh oh!

manupak commented Feb 25, 2022

Uh oh!

kparzysz-quic commented Feb 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kparzysz-quic commented Feb 25, 2022

Uh oh!

manupak commented Feb 25, 2022

Uh oh!

manupak commented Feb 25, 2022

Uh oh!

kparzysz-quic commented Feb 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kparzysz-quic commented Feb 25, 2022

Uh oh!

manupak commented Feb 25, 2022

Uh oh!

kparzysz-quic commented Feb 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

manupak commented Feb 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

u99127 commented Feb 25, 2022

Uh oh!

manupak commented Feb 25, 2022

Uh oh!

driazati commented Mar 3, 2022

Uh oh!

manupak commented Mar 3, 2022

Uh oh!

driazati commented Mar 3, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

manupak commented Feb 25, 2022 •

edited

Loading

kparzysz-quic commented Feb 25, 2022 •

edited

Loading

kparzysz-quic commented Feb 25, 2022 •

edited

Loading

kparzysz-quic commented Feb 25, 2022 •

edited

Loading

manupak commented Feb 25, 2022 •

edited

Loading