auto-tuning

Here are 50 public repositories matching this topic...

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

sparsity pruning quantization knowledge-distillation auto-tuning int8 low-precision quantization-aware-training post-training-quantization awq int4 large-language-models gptq smoothquant sparsegpt fp4 mxformat

Updated Jun 24, 2026
Python

oracle / bpftune

Star

bpftune uses BPF to auto-tune Linux systems

linux ebpf bpf auto-tuning

Updated Jun 10, 2026
C

zwang4 / awesome-machine-learning-in-compilers

Star

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

machine-learning compiler parallel-computing parallelism artificial-intelligence operating-systems optimisation auto-tuning parallel-programming parallelisation multi-cores

Updated Jan 21, 2026

KernelTuner / kernel_tuner

Star

Kernel Tuner

python c testing machine-learning cplusplus gpu optimization opencl cuda autotuning software-development opencl-kernels kernel-tuner cuda-kernels gpu-computing auto-tuning

Updated Jun 18, 2026
Python

ROCm / Tensile

Star

[DEPRECATED] Moved to ROCm/rocm-libraries repo

python machine-learning amd gpu assembly opencl dnn matrix-multiplication neural-networks gpu-acceleration blas hip gpu-computing tensors tensor-contraction gemm radeon auto-tuning

Updated Jun 20, 2026
Python

sbu-fsl / kernel-ml

Star

Machine Learning Framework for Operating Systems - Brings ML to Linux kernel

machine-learning kernel-module linux-kernel operating-systems auto-tuning mlsys

Updated Dec 13, 2021
C

CNugteren / CLTune

Sponsor

Star

CLTune: An automatic OpenCL & CUDA kernel tuner

opencl cuda tuner auto-tuning

Updated Dec 12, 2022
C++

ederwander / PyAutoTune

Star

Autotune Module for Python "PyAutoTune"

audio python c pyaudio real-time dsp realtime voice autotuning fft pitch auto-tuning autotune auto-tune

Updated Oct 16, 2020
C

HAL-42 / AlchemyCat

Star

Alchemy Cat —— 🔥Config System for SOTA

config machine-learning computer-vision deep-learning auto-tuning parameter-tuning

Updated Dec 18, 2025
Python

SUSE / phoebe

Star

Phoebe

linux machine-learning artificial-intelligence systems self-healing auto-tuning

Updated May 24, 2021
C

tlc-pack / TLCBench

Star

Benchmark scripts for TVM

benchmark deep-learning auto-tuning tvm tuning-logs

Updated Mar 15, 2022
Python

weixingsun / jBProF

Star

ebpf profiler for jvm

profiler jvm latency breakpoint perf flamegraph jni ebpf bpf jvmti auto-tuning

Updated Jul 8, 2025
C++

Katagiri-Hoshino-Lab / VibeCodeHPC

Star

CLI-based multi-agents for Auto-Tuning (e.g. HPC code optimazation loops) supporting Local LLMs

tmux hpc opencode multiagent ssh-agent auto-tuning local-llm gemini-cli claude-code codex-cli kimi-cli qwen-code cline-cli vibe-local

Updated Mar 25, 2026
Python

Collective Knowledge crowd-tuning extension to let users crowdsource their experiments (using portable Collective Knowledge workflows) such as performance benchmarking, auto tuning and machine learning across diverse platforms with Linux, Windows, MacOS and Android provided by volunteers. Demo of DNN crowd-benchmarking and crowd-tuning: