Naija-Petro (32B) โ€” GGUF

GGUF quantizations of Shinzmann/naija-petro (the 32B variant) for inference with llama.cpp, Ollama, LM Studio, and compatible runtimes.

See the full model card for training details, intended use, and limitations. For a lighter footprint, consider the 8B GGUF. For Nigeria-specific accuracy, use these weights with the Naija-Petro RAG system.

Available quantizations

File suffix Method Notes
Q4_K_M 4-bit (k-quant, medium) Smallest; recommended default for 32B on limited RAM/VRAM
Q5_K_M 5-bit (k-quant, medium) Higher quality, larger
Q8_0 8-bit Near-lossless; largest and slowest

32B GGUF files are large. Q4_K_M is the practical choice for most machines; ensure you have enough RAM/VRAM + disk for the chosen quant.

Usage

Ollama

ollama run hf.co/Shinzmann/naija-petro-GGUF:Q4_K_M

llama.cpp

./llama-cli -hf Shinzmann/naija-petro-GGUF:Q4_K_M \
  -p "What are the screening criteria for CO2 EOR?" -c 4096

License

Apache-2.0 (inherited from Qwen3-32B). Validate outputs with qualified engineers before operational use.

Downloads last month
154
GGUF
Model size
33B params
Architecture
qwen3
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Shinzmann/naija-petro-GGUF

Base model

Qwen/Qwen3-32B
Quantized
(1)
this model