coffee-gen-ai

Models

The following table is a summary of the models I’ve been tracking. They are ordered by release date.

</table>
Model Name Company Release Date Open Source Parameters Model Repository Technical Report/Paper Website
DeepSeek-R1 DeepSeek AI Jan 2025 Yes 671B GitHub Paper DeepSeek
o3 OpenAI December 2024 No - - - ChatGPT
Moondream Vikhyat January 2025 Yes 2B, 0.5B GitHub Blog post Moondream
DeepSeek-V3 DeepSeek AI December 2024 Yes 671B (37B active) GitHub Paper -
Gemini 2.0 Flash Google December 2024 No - - - Gemini
Llama 3.3 Meta December 2024 Yes 70B GitHub Model Card -
Qwen 2.5 Alibaba September 2024 Yes 0.5B to 72B GitHub Paper -
Llama 3.2 Meta September 2024 Yes 1B, 3B, 11B, 90B GitHub Model Card -
Llama 3.1 Meta July 2024 Yes 8B, 70B, 405B GitHub Model Card -
Llama 3 Meta April 2024 Yes 8B, 70B GitHub Model Card -
Mixtral 8x22B Mistral AI April 2024 Yes 141B (39B active) Model Card blog post -
Qwen 2 Alibaba June 2024 Yes 0.5B to 72B GitHub Paper -
Gemma 2 Google June 2024 Yes 2B, 9B, 27B GitHub Paper -
Claude 3.5 Sonnet Anthropic June 2024 No - - - Claude
Aya 23 Cohere May 2024 Yes 8B, 35B Aya-23-8B Aya-23-35B Paper Cohere
ChatGPT (GPT-4o) OpenAI May 2024 No 1.76T (rumored) - - ChatGPT
Qwen 1.5 Alibaba March 2024 Yes 0.5B to 72B GitHub Paper -
Aya 101 Cohere February 2024 Yes 13B Aya-101 Paper -
Llama 2 Meta July 2023 Yes 7B, 13B, 70B GitHub Model Card -
Mistral 7B Mistral AI September 2023 Yes 7.3B GitHub Paper -
Llama 1 Meta February 2023 Yes 7B, 13B, 33B, 65B GitHub Model Card -
GPT-3 OpenAI May 2020 No 175B GitHub Paper -