Apr 05, 2025

Maverick and Goose!

Maverick and Goose!

Llama 4 Maverick:
- 17B active parameters, 128 experts, 400B total.
- 1M token context window.
- Not single-GPU; runs on one H100 DGX host or can be distributed for greater efficiency.
- Outperforms GPT-4o and Gemini 2.0 Flash on coding, reasoning, and multilingual tests at a competitive cost.
- Maintains strong image understanding and grounded reasoning ability.

This post and comments are published on Nostr.

You might also like...

deepseek = best model, good, cheap

Open Source llama 4 model is…

Open-Source Llama 4 cheaper than…

Midjourney v7 short, Mercs by the…

Midjourney v7 rocks