The following table is a summary of the models I’ve been tracking. They are ordered by release date.
| Model Name | Company | Release Date | Open Source | Parameters | Model Repository | Technical Report/Paper | Website |
|---|---|---|---|---|---|---|---|
| DeepSeek-R1 | DeepSeek AI | Jan 2025 | Yes | 671B | GitHub | Paper | DeepSeek |
| o3 | OpenAI | December 2024 | No | - | - | - | ChatGPT |
| Moondream | Vikhyat | January 2025 | Yes | 2B, 0.5B | GitHub | Blog post | Moondream |
| DeepSeek-V3 | DeepSeek AI | December 2024 | Yes | 671B (37B active) | GitHub | Paper | - |
| Gemini 2.0 Flash | December 2024 | No | - | - | - | Gemini | |
| Llama 3.3 | Meta | December 2024 | Yes | 70B | GitHub | Model Card | - |
| Qwen 2.5 | Alibaba | September 2024 | Yes | 0.5B to 72B | GitHub | Paper | - |
| Llama 3.2 | Meta | September 2024 | Yes | 1B, 3B, 11B, 90B | GitHub | Model Card | - |
| Llama 3.1 | Meta | July 2024 | Yes | 8B, 70B, 405B | GitHub | Model Card | - |
| Llama 3 | Meta | April 2024 | Yes | 8B, 70B | GitHub | Model Card | - |
| Mixtral 8x22B | Mistral AI | April 2024 | Yes | 141B (39B active) | Model Card | blog post | - |
| Qwen 2 | Alibaba | June 2024 | Yes | 0.5B to 72B | GitHub | Paper | - |
| Gemma 2 | June 2024 | Yes | 2B, 9B, 27B | GitHub | Paper | - | |
| Claude 3.5 Sonnet | Anthropic | June 2024 | No | - | - | - | Claude |
| Aya 23 | Cohere | May 2024 | Yes | 8B, 35B | Aya-23-8B Aya-23-35B | Paper | Cohere |
| ChatGPT (GPT-4o) | OpenAI | May 2024 | No | 1.76T (rumored) | - | - | ChatGPT |
| Qwen 1.5 | Alibaba | March 2024 | Yes | 0.5B to 72B | GitHub | Paper | - |
| Aya 101 | Cohere | February 2024 | Yes | 13B | Aya-101 | Paper | - |
| Llama 2 | Meta | July 2023 | Yes | 7B, 13B, 70B | GitHub | Model Card | - |
| Mistral 7B | Mistral AI | September 2023 | Yes | 7.3B | GitHub | Paper | - |
| Llama 1 | Meta | February 2023 | Yes | 7B, 13B, 33B, 65B | GitHub | Model Card | - |
| GPT-3 | OpenAI | May 2020 | No | 175B | GitHub | Paper | - |