Neural Magic
Neural Magic (Acquired by Red Hat) empowers developers to optimize & deploy LLMs at scale. Our model compression & acceleration enable top performance with vLLM
Pinned Loading
Repositories
Showing 10 of 98 repositories
- gorilla Public Forked from ShishirPatil/gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
neuralmagic/gorilla’s past year of commit activity - vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
neuralmagic/vllm’s past year of commit activity - nyann-bench Public
neuralmagic/nyann-bench’s past year of commit activity - speculators Public Forked from vllm-project/speculators
A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM
neuralmagic/speculators’s past year of commit activity - lighteval Public Forked from huggingface/lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
neuralmagic/lighteval’s past year of commit activity - vllm-openshift-recipes Public
neuralmagic/vllm-openshift-recipes’s past year of commit activity - nm-vllm-omni-ent Public Forked from vllm-project/vllm-omni
A framework for efficient model inference with omni-modality models
neuralmagic/nm-vllm-omni-ent’s past year of commit activity - model-validation-configs Public
neuralmagic/model-validation-configs’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…