Add Metal (Apple Silicon) build variants
#1
by
robtaylor-chipflow - opened
Add pre-built Metal kernel variants for Apple Silicon Macs.
Build variants:
torch210-metal-aarch64-darwintorch29-metal-aarch64-darwin
These enable GPU-accelerated fused RMS normalization on MPS (Metal Performance Shaders) backend, tested on M1/M2/M3/M4 with macOS 14+.
74/74 tests passing across all dtypes and configurations.
Hey @robtaylor-chipflow , could you please open a pr here : https://github.com/huggingface/kernels-community instead ?