Open
Description
It seems like DirectML supports the upcoming NPU-enabled chips for Windows machines:
https://devblogs.microsoft.com/directx/introducing-neural-processor-unit-npu-support-in-directml-developer-preview/
I don't think there is any other way to tap into this hardware, so we should explore if it possible to add this library as a backend in ggml
in order to run stuff on the NPUs. There has been some semi-related work in the past that combined ggml
and Direct3D: https://github.com/Const-me/Whisper. Not sure if it is relevant at all, maybe just as an inspiration