By provider: gemini 0.1601 EUR ollama 0.0000 EUR
| Model | Provider | Calls | Input tok | Output tok | Avg latency | Cost |
|---|---|---|---|---|---|---|
| gemini-2.0-flash | gemini | 116 | 1,011,983 | 157,981 | 0 ms | 0.1414 EUR |
| gpt-oss:120b-cloud | ollama | 12 | 273,861 | 42,932 | 44522 ms | 0.0000 EUR |
| kimi-k2:1t-cloud | ollama | 3 | 25,304 | 4,591 | 114610 ms | 0.0000 EUR |
| gpt-oss:20b-cloud | ollama | 2 | 12,186 | 4,107 | 8923 ms | 0.0000 EUR |
| gemini-2.5-pro | gemini | 1 | 6,070 | 1,417 | 33149 ms | 0.0187 EUR |
Which model fomo2 picks for each kind of LLM call. Override per task
via llm.task_routing in config/default.yaml,
or per-feed via llm_task.
| Task | Provider | Model | Overridden |
|---|---|---|---|
dedup |
ollama | gpt-oss:20b-cloud | no |
default |
ollama | gpt-oss:120b-cloud | no |
embedding |
gemini | gemini-embedding-001 | no |
enrich |
ollama | gpt-oss:20b-cloud | no |
goal_default |
ollama | gpt-oss:120b-cloud | no |
goal_pro |
ollama | kimi-k2:1t-cloud | no |
narrative |
ollama | gpt-oss:20b-cloud | no |
prediction_scoring |
ollama | qwen3-coder:480b-cloud | no |
validation |
ollama | gpt-oss:120b-cloud | no |
watcher |
ollama | qwen3-coder:480b-cloud | no |
wizard |
ollama | gpt-oss:120b-cloud | no |
See Smart LLM Picker for OpenRouter rankings and recommended Ollama matches.
Exchange rate: 1 USD = 0.86 EUR
| Model | Input (USD) | Output (USD) | Embedding (USD) |
|---|---|---|---|
| claude-haiku-4-5-20251001 | $0.800 | $4.000 | n/a |
| claude-sonnet-4-20250514 | $3.000 | $15.000 | n/a |
| deepseek-v3.1:671b-cloud | free | free | n/a |
| embeddinggemma:300m | free | free | n/a |
| gemini-2.0-flash | $0.100 | $0.400 | n/a |
| gemini-2.0-flash-lite | $0.075 | $0.300 | n/a |
| gemini-2.5-flash | $0.150 | $0.600 | n/a |
| gemini-2.5-pro | $1.250 | $10.000 | n/a |
| gemini-embedding-001 | free | free | $0.150 |
| google/gemini-2.0-flash-exp:free | free | free | n/a |
| gpt-4o | $2.500 | $10.000 | n/a |
| gpt-4o-mini | $0.150 | $0.600 | n/a |
| gpt-oss:120b-cloud | free | free | n/a |
| gpt-oss:20b-cloud | free | free | n/a |
| kimi-k2:1t-cloud | free | free | n/a |
| qwen3-coder:480b-cloud | free | free | n/a |
| text-embedding-004 | free | free | $0.100 |
| text-embedding-3-small | free | free | $0.020 |