Skip to content
#

intel-arc

Here are 37 public repositories matching this topic...

Makes Intel Arc Pro B70 GPUs actually fast on Ubuntu Server. 11 llama.cpp cherry-picks that fix the big B70 bugs (MoE slot-init SEGV, Q8_0 reorder crash, OOM reorder, missing BF16 GET_ROWS, wrong Xe2 warptile, slow K-quant DMMV, etc.) + Mesa 26 + runtime env workarounds + SYCL/Vulkan backend-selection rules. 2-7x speedup on 4x B70, bench-verified.

  • Updated May 10, 2026
  • Python

Field-tested guide: multi-GPU vLLM tensor-parallel (TP=2/TP=4) on Intel Arc Pro B70 (Battlemage BMG-G31, Xe2) on Linux. Driver setup (xe force_probe=e223), bare-metal vLLM + oneAPI 2025.3, the compute-runtime multi-root USM + triton-xpu init_devices fixes, FP8/int4-AutoRound quant, root-cause error reports. AI-agent readable (AGENTS.md).

  • Updated Jun 13, 2026
  • Shell

Improve this page

Add a description, image, and links to the intel-arc topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the intel-arc topic, visit your repo's landing page and select "manage topics."

Learn more