nf4
Here are 10 public repositories matching this topic...
Nodes to run Hunyuan Image 3 locally with BF16 and NF4 quantized options in Comfyui
-
Updated
Apr 30, 2026 - Python
This repository wraps the flux fill model as ComfyUI nodes. Compared to the flux fill dev model, these nodes can use the flux fill model to perform inpainting and outpainting work under lower VRM conditions
-
Updated
May 14, 2025 - Python
This project implements a classic Retrieval-Augmented Generation (RAG) system using HuggingFace models with quantization techniques. The system processes PDF documents, extracts their content, and enables interactive question-answering through a Streamlit web application.
-
Updated
Jul 18, 2025 - Python
This project presents a medical question–answering language model built by fine-tuning Google Gemma-2-2B-IT using LoRA (Low-Rank Adaptation) 🧠⚕️. The primary objective is to adapt a general-purpose large language model to the healthcare domain in a parameter-efficient, reproducible, and resource-aware manner.
-
Updated
Dec 28, 2025 - Jupyter Notebook
4-bit NormalFloat (NF4) quantized LoRA for Rust with GGUF export
-
Updated
Mar 6, 2026 - Rust
Pure Gleam tensor library with quantization (INT8, NF4, AWQ), Flash Attention, and 2:4 Sparsity - 7.5x memory multiplication
-
Updated
Jun 11, 2026 - Gleam
A hands-on, runnable tour of LLM quantization: compress GPT-2 to 4-bit while tracking memory and quality, ending with QLoRA's NF4 and double quantization.
-
Updated
Jun 17, 2026 - Python
AI model compression benchmarks — NF4 beats INT8 in every metric
-
Updated
Feb 21, 2026 - Python
Ideogram 4 本地 CUDA 图像生成器:内置 NF4 权重、官方 Magic Prompt 提示词优化、Windows/Linux 便携包 / Local open-weight text-to-image generator
-
Updated
Jun 18, 2026 - Python
Improve this page
Add a description, image, and links to the nf4 topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the nf4 topic, visit your repo's landing page and select "manage topics."