Think
LLM
Models
Capabilities
Use Cases
Benchmarks
Papers
Glossary
Search
/
Glossary
/
GGUF
GGUF
formats
A file format for quantized models designed for efficient CPU and GPU inference with llama.cpp.
Learn more on Wikipedia
GGUF — Glossary — ThinkLLM