Error message #
llama runner process has terminated: error loading model: unable to locate cpu buffer
What it means #
This usually happens when Ollama can’t allocate enough memory to load the model. If you’re running on CPU with limited RAM, it may not be able to create the necessary buffer for the model.
How to fix it #
Try using a smaller model, such as:
- TinyLLaMA (638โฏMB)
- Gemma 3B:1b (815โฏMB)
These use less memory and typically work fine even on systems without a dedicated GPU.
Tip: Always save your work before trying a larger model. If it doesnโt load, no harm done โ just try a lighter one.
