128 TOPS/sec
130 Watts (System)
256 TOPS/sec
160 Watts (System)
512 TOPS/sec
1120 Watts (System)
*System = Host CPU, power supply, DRAM, SRAM etc.
OPENAI Whisper (FP16) : Word Error Rate (the smaller, the better)
> 3.97 pJ / bit (HBM2)
< 1 pJ / bit (3DIC)
Compiler-Frontend
Model parsing and model graph generation
Compiler-Middle-end/Back-end
Compilation and optimization of NPU execution
Simulator
Accuracy and functionality analysis
Profiler
Detailed op-level reports of inference performance