Meta Releases Open Source Llama 3.3 70B which outperforms GPT-4o

Meta has unveiled its latest addition to the Llama family of generative AI models: Llama 3.3 70B.

This new model represents again a significant advancement in open source LLMs, offering impressive performance at a lower computational cost.

Llama 3.3 70B running on a Mac Studio M2 Max (Quantized)
at a reasonable 5,5 tokens per second:

Key Features and Availability

Efficiency Breakthrough: Llama 3.3 70B delivers performance comparable to the 405B model at a significantly lower cost, marking a major advancement in AI efficiency.
Competitive Performance: The model outperforms several industry competitors, including Google’s Gemini 1.5 Pro and OpenAI’s GPT-4o, on various benchmarks.
Open Ecosystem: Meta continues to push for an open AI ecosystem, making Llama models widely available for research and commercial use.
Improved Capabilities: The model shows enhancements in reasoning, coding, and instruction following, making it versatile for various applications.
Ongoing Development: This release is part of Meta’s broader strategy, with plans for even larger models and new capabilities in the future.
Ethical Considerations: The model incorporates built-in safeguards like Llama Guard 3 and Prompt Guard to prevent misuse.
Resource Requirements: Despite improvements, running the model locally still requires some computational resources, including at least 64GB of RAM.