[XPU] add quantize_linear and dequantize_linear op #71375
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
PR Category
Custom Device
PR Types
New features
Description
增加XPU下面的
quantize_linear
以及dequantize_linear
算子。受执行器的某些特殊限制,参考GPU和CPU的实现,增加了两个辅助的没有什么实际意义的算子。当前算子会忽略
qmin
、bit_length
等等参数。其原因有若干个,例如,python/paddle/nn/quant/format.py
的调用入口只传了较少的参数,以及参考GPU和CPU的代码,中间若干层的调用中,会把这些参数逐渐都扔掉(即,不往下传)。此外,经过和业务方讨论,only_observer
和round_type
也只支持了默认值。此外,不支持
!is_test
的情况,等以后有需求了再考虑补充。