gfx1201

Star

Here are 7 public repositories matching this topic...

maeddesg / vulkanforge

Star

LLM inference engine for AMD RDNA4 — Rust + Vulkan compute shaders, gguf & native FP8.

rust machine-learning amd vulkan inference mesa llm fp8 gguf rdna4 gfx1201 gemma4

Updated Jun 20, 2026
Rust

tlee933 / llama.cpp-rdna4-gfx1201

Star

llama.cpp with native AMD RDNA4 (gfx1201) ROCm 7.11 support - 98.97 tok/s AI inference, competitive with RTX 4070 Ti, 32GB VRAM

machine-learning amd gpu hip rocm ai-inference llm llama-cpp rdna4 gfx1201

Updated Jan 3, 2026
C++

KaiFelixBennett / gemma4-turboquant-rdna4

Star

Run Gemma-4-31B at full 256K context on a $1,400 AMD RDNA4 GPU (gfx1201): TurboQuant KV cache + HIP-graph-safe Flash-Attention for llama.cpp, fully measured on real hardware.

amd hip quantization gemma rocm kv-cache long-context llama-cpp local-llm llm-inference flash-attention rdna4 gfx1201 turboquant

Updated Jun 22, 2026
Python

Fine-tune your own LLM on an AMD Radeon GPU — the easy, tested way. QLoRA via ROCm on Windows/WSL2 & Linux, a worked Gemma-4 example, a reusable live training dashboard, and a smoke test that proves the loss falls.

amd pytorch lora gemma rocm radeon fine-tuning peft wsl2 llm llama-cpp qlora rdna4 gfx1201

Updated Jun 20, 2026
HTML

xnyzer / ollama-rocm

Star

Ollama with ROCm 7 GPU acceleration for AMD RDNA 4 (RX 9070, RX 9070 XT, RX 9060 XT - gfx1201) on Windows 11

windows amd rocm llm local-llm ollama rdna4 gfx1201 rx-9070-xt hip-sdk

Updated Jun 14, 2026
PowerShell

cantascendia / rocm-rdna4-windows

Star

Run PyTorch natively on Windows 11 with an AMD RX 9070 XT (RDNA4 / gfx1201) on stable ROCm 7.2.1 — no WSL2, no Linux, no ZLUDA. Exact pinned wheel URLs, runtime env vars, documented RDNA4 pitfalls (broken nightlies, no xformers/flash-attn/bitsandbytes), and real benchmarks for ComfyUI, SD, LLMs, RVC/TTS. Verified on one 9070 XT.

Updated Jun 16, 2026
Batchfile

Dixon-Cider / gemma4-mtp-rocm-windows-r9700

Star

Run Google's Gemma 4 31B with MTP speculative decoding natively on Windows on an AMD Radeon AI PRO R9700 (RDNA4/gfx1201) via llama.cpp + ROCm 7.1.1 — the build recipe, the ROCm cmath compile fix, and a KV-precision / recall / reasoning study (q4_0 runs the full 256K context with no measurable quality loss).

windows gemma rocm amd-gpu llama-cpp local-llm speculative-decoding rdna4 gfx1201

Updated Jun 22, 2026
PowerShell

Improve this page

Add a description, image, and links to the gfx1201 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gfx1201 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gfx1201

Here are 7 public repositories matching this topic...

maeddesg / vulkanforge

tlee933 / llama.cpp-rdna4-gfx1201

KaiFelixBennett / gemma4-turboquant-rdna4

KaiFelixBennett / RadeonForge

xnyzer / ollama-rocm

cantascendia / rocm-rdna4-windows

Dixon-Cider / gemma4-mtp-rocm-windows-r9700

Improve this page

Add this topic to your repo