coffee-gen-ai

Research papers

structure

Research Updates/

Reinforcement Learning from Human Feedback

Agent Frameworks:

Chain of Thought / Thinking

Human and Agent interaction

Agent Computer Interfaces (ACI)

Multi-Agent

Debate

Evaluation

LLM evaluators (LLM-as-a-Judge)

LLM-as-a-Judge weather as self-evaluator or evaluator of other LLM’s generation, it a topic that has been proven to be useful in following scenarios:

Here are some interesting papers on this topic:

Evaluation of LLM’s

AI CUDA Engineer

models

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models by Deekseek, Jan 2024

Bite: How Deepseek R1 was trained by Philipp Schmid Jan, 2025

The Illustrated DeepSeek-R1 by Jay Alammar Jan, 2025

Based on “The Rise and Evolution of RAG in 2024 A Year in Review” by RAGFlow (Dec 2024) and other observations:

Based on “Recent advancements in large language models (LLMs) and their applications” (LinkedIn, Apr 2025 - likely referencing late 2024/early 2025 developments):