AI Exploration Journey
Subscribe
Sign in
Home
RAG
Document Parsing
AI Innovations and Insights
LLM Reasoning
LLMs
Data Structures
Archive
Leaderboard
About
OpenAI o1 and o3
Latest
Top
Discussions
The Best Way to Understand PPO, GRPO, and DPO: 3 Simple Analogies
Recently, with the rise of various reasoning LLMs, terms like GRPO, DPO, and PPO have been popping up more frequently.
Apr 6
•
Florian
10
Share this post
AI Exploration Journey
The Best Way to Understand PPO, GRPO, and DPO: 3 Simple Analogies
Copy link
Facebook
Email
Notes
More
O1 Replication Journey Part 3: Procrastination Problem of LLM
Can Inference-Time Scaling Make It Smarter?
Mar 20
•
Florian
2
Share this post
AI Exploration Journey
O1 Replication Journey Part 3: Procrastination Problem of LLM
Copy link
Facebook
Email
Notes
More
O1 Replication Journey Part 2: Let a Great Teacher Guide Students
In my view, any kind of learning boils down to two key elements: training data and training methods. For enhancing LLM reasoning or replicating OpenAI…
Mar 6
•
Florian
7
Share this post
AI Exploration Journey
O1 Replication Journey Part 2: Let a Great Teacher Guide Students
Copy link
Facebook
Email
Notes
More
O1 Replication Journey Part 1: From Shortcut Hunters to True Explorers
Mastering the LLM Reasoning Exploration Journey, Not Just the Answer
Mar 3
7
Share this post
AI Exploration Journey
O1 Replication Journey Part 1: From Shortcut Hunters to True Explorers
Copy link
Facebook
Email
Notes
More
AI Innovations and Insights 30: Agentic Reasoning and Key Features of Claude 3.7 Sonnet
This article is the 30th in this deeply interesting series. In this post, we will explore two mind-opening topics:
Feb 28
7
Share this post
AI Exploration Journey
AI Innovations and Insights 30: Agentic Reasoning and Key Features of Claude 3.7 Sonnet
Copy link
Facebook
Email
Notes
More
AI Innovations and Insights 26: Sky-T1 and MiniRAG
This article is the 26th in this stimulating series. Today, we will explore two fascinating topics in AI, which are:
Feb 15
5
Share this post
AI Exploration Journey
AI Innovations and Insights 26: Sky-T1 and MiniRAG
Copy link
Facebook
Email
Notes
More
s1 Explained: Can a $50 LLM Rival DeepSeek-R1?
Since January 2025, the DeepSeek-R1 model has been in the spotlight.
Feb 7
•
Florian
5
Share this post
AI Exploration Journey
s1 Explained: Can a $50 LLM Rival DeepSeek-R1?
Copy link
Facebook
Email
Notes
More
1
Understanding DeepSeek-R1: Insights and Perspectives
DeepSeek-R1, a recently released LLM with deep reasoning capabilities, is making waves—reminding me of the early days of ChatGPT.
Feb 5
•
Florian
9
Share this post
AI Exploration Journey
Understanding DeepSeek-R1: Insights and Perspectives
Copy link
Facebook
Email
Notes
More
AI Innovations and Insights 24: rStar, SimRAG, and mR2AG
This article is the 24th in this compelling series. Today, we will explore three intriguing topics in AI, which are:
Feb 2
4
Share this post
AI Exploration Journey
AI Innovations and Insights 24: rStar, SimRAG, and mR2AG
Copy link
Facebook
Email
Notes
More
The Roadmap to Reproduce OpenAI o1
An Intelligent Expedition Team
Jan 30
•
Florian
3
Share this post
AI Exploration Journey
The Roadmap to Reproduce OpenAI o1
Copy link
Facebook
Email
Notes
More
AI Innovations and Insights 23: KAG, AlphaMath, and Offloading
This article is the 23rd in this compelling series. Today, we will explore three intriguing topics in AI, which are:
Jan 28
2
Share this post
AI Exploration Journey
AI Innovations and Insights 23: KAG, AlphaMath, and Offloading
Copy link
Facebook
Email
Notes
More
AI Innovations and Insights 21: Marco-o1, Plan×RAG, and PPTX to MarkDown
This article is the 21st in this compelling series. Today, we will explore three intriguing topics in AI, which are:
Jan 21
4
Share this post
AI Exploration Journey
AI Innovations and Insights 21: Marco-o1, Plan×RAG, and PPTX to MarkDown
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts