OpenAI o1 and o3

The Best Way to Understand PPO, GRPO, and DPO: 3 Simple Analogies

Recently, with the rise of various reasoning LLMs, terms like GRPO, DPO, and PPO have been popping up more frequently.

Apr 6 •

O1 Replication Journey Part 3: Procrastination Problem of LLM

Can Inference-Time Scaling Make It Smarter?

Mar 20 •

O1 Replication Journey Part 2: Let a Great Teacher Guide Students

In my view, any kind of learning boils down to two key elements: training data and training methods. For enhancing LLM reasoning or replicating OpenAI…

Mar 6 •

O1 Replication Journey Part 1: From Shortcut Hunters to True Explorers

Mastering the LLM Reasoning Exploration Journey, Not Just the Answer

Mar 3

AI Innovations and Insights 30: Agentic Reasoning and Key Features of Claude 3.7 Sonnet

This article is the 30th in this deeply interesting series. In this post, we will explore two mind-opening topics:

Feb 28

AI Innovations and Insights 26: Sky-T1 and MiniRAG

This article is the 26th in this stimulating series. Today, we will explore two fascinating topics in AI, which are:

Feb 15

s1 Explained: Can a $50 LLM Rival DeepSeek-R1?

Since January 2025, the DeepSeek-R1 model has been in the spotlight.

Feb 7 •

Understanding DeepSeek-R1: Insights and Perspectives

DeepSeek-R1, a recently released LLM with deep reasoning capabilities, is making waves—reminding me of the early days of ChatGPT.

Feb 5 •

AI Innovations and Insights 24: rStar, SimRAG, and mR2AG

This article is the 24th in this compelling series. Today, we will explore three intriguing topics in AI, which are:

Feb 2

The Roadmap to Reproduce OpenAI o1

An Intelligent Expedition Team

Jan 30 •

AI Innovations and Insights 23: KAG, AlphaMath, and Offloading

This article is the 23rd in this compelling series. Today, we will explore three intriguing topics in AI, which are:

Jan 28

AI Innovations and Insights 21: Marco-o1, Plan×RAG, and PPTX to MarkDown

This article is the 21st in this compelling series. Today, we will explore three intriguing topics in AI, which are:

Jan 21

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts