oneAPI Deep Neural Network Library (oneDNN)
Performance library for Deep Learning
2.1.3
cpu_matmul_quantization_cpp_short

C++ API example demonstrating how one can perform reduced precision matrix-matrix multiplication using MatMul and the accuracy of the result compared to the floating point computations.

Concepts: