Question 1

What hardware do I need for Ollama?

Accepted Answer

It depends on the model. 7B models run on 8GB RAM. 13B models need 16GB. 70B models need 48GB+ VRAM. GPU acceleration (NVIDIA, Apple Silicon) dramatically improves speed.

Question 2

Can local models do tool use?

Accepted Answer

Some can, but reliability varies. Models like Qwen and Llama support function calling, but frontier cloud models are still more reliable for complex tool use scenarios.

Question 3

How do I combine local and cloud models?

Accepted Answer

Use a fallback chain: try the local Ollama model first, fall back to a cloud model via OpenRouter if the local model fails or returns inadequate results.

Ollama (Local LLMs) vs Cloud LLMs

Ollama (Local)

Cloud LLMs

Verdict

Frequently Asked Questions

What hardware do I need for Ollama?

Can local models do tool use?

How do I combine local and cloud models?

Related

In the Glossary

In the Directory