This project builds a minimal LLaMA-style model using PyTorch and converts it to the GGUF format.
Useful for testing GGUF loaders and tooling without having to download large files.
Just run make all
The tokenizer.model file comes from https://huggingface.co/openlm-research/open_llama_3b