[TFLite] TFLite FP16 Post Quantization Support

TensorFlow Lite now supports converting weights to 16-bit floating point values during model conversion from TensorFlow to TensorFlow Lite's flat buffer format. This results in a 2x reduction in model size. 

However, this will insert new `dequantize` for ops (like `conv2d`) used for `dequantize` fp16 weight to fp32.  Like this:
![image](https://user-images.githubusercontent.com/7287321/84767184-925abf80-b004-11ea-8894-8c94119fc25e.png)


TVM doesn't support this behavior. List the things we mainly should to do:

- Support float16 type inside tflite parser
- Extend `dequantize` to support fp16 to fp32 

Related issue:https://github.com/apache/incubator-tvm/issues/5774#

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TFLite] TFLite FP16 Post Quantization Support #5823

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[TFLite] TFLite FP16 Post Quantization Support #5823

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions