Question 1

How can I run Claude Code for free?

Accepted Answer

I cover two methods: run an open-source model locally with Ollama, or route Claude Code to free cloud models through OpenRouter. Claude Code is the harness, so you can “swap the engine” to a free model. You’ll still need a small setup step, but ongoing usage can be free depending on the model and provider limits.

Question 2

Is using Ollama or OpenRouter with Claude Code against Anthropic’s terms of service?

Accepted Answer

No—this isn’t some loophole where you’re stealing Opus. You’re using the Claude Code agent harness, but you’re pointing it at a different model. The key idea is you’re not running Opus locally; you’re using open-weight models that are actually downloadable or freely accessible.

Question 3

What’s the difference between open-source and closed-source models for Claude Code?

Accepted Answer

Open-source (open-weight) models can be downloaded, run, and modified, which is why you can host them locally for free. Closed-source models like Opus/Sonnet/GPT/Gemini are locked behind a company API, so you pay per token. Closed-source is usually better right now, but the gap is shrinking fast.

Question 4

How do I set up Ollama to use a local model with Claude Code?

Accepted Answer

Download Ollama from ollama.com, then pull a model with `ollama pull `. You can test it with `ollama run `, and then launch Claude Code via the Ollama “launch Claude” command. I demo this directly inside VS Code so you can keep everything in one workflow.

Question 5

Why do some open-source models misbehave inside Claude Code?

Accepted Answer

Claude Code expects certain tool-calling behavior and JSON protocol, and many open models weren’t trained specifically for that harness. Another huge issue is context window size—Claude’s system prompt and your repo can blow past smaller contexts. In the video, I fix this by increasing the model context in Ollama and then tool calls start showing properly.

Question 6

How do I configure Claude Code to use OpenRouter free models?

Accepted Answer

You change your project’s `settings.local.json` environment variables to point `ANTHROPIC_BASE_URL` to `https://openrouter.ai/api` and put your OpenRouter key into `ANTHROPIC_AUTH_TOKEN`. The important part: I override every default model variable (Sonnet/Opus/Haiku/small-fast/subagent), otherwise Claude Code can silently fall back to paid Anthropic models and charge you.

Question 7

Do I really need to add money to Anthropic or OpenRouter to use the free setup?

Accepted Answer

Yeah, there’s a small “annoying but real” setup detail. For Anthropic onboarding, you may need to buy the initial $5 credits even if you won’t consume them once you switch to free models. For OpenRouter, loading $5–$10 increases your free-model rate limits (like going from ~50 requests/day to ~1,000 requests/day).

Question 8

When should I use open-source models instead of Opus or Sonnet?

Accepted Answer

I use them for low-stakes or high-volume work: summarizing, grepping a codebase, scaffolding repetitive code, classification/triage, and prep work before handing off to a stronger model. They’re also great when Claude is down or you hit session limits. For high-stakes coding where you can’t mess up, I still prefer top closed-source models.

Ollama + Claude Code = 99% CHEAPER

🛍️ Products Mentioned (7)

Full courses + unlimited support

All my FREE resources

Openrouter Product

Apply for my YT podcast

Work with me

FREE MONTH voice to text

Code NATEHERK for 10% off VPS (annual plan)

About This Video

Frequently Asked Questions

🎬 More from Nate Herk | AI Automation