Blog posts

2025

LLaMAFactory Code Analysis

14 minute read

Published:

This blog post documents my learning process of the LLaMAFactory code repository.

From RL to RLHF

9 minute read

Published:

In light of the widespread adoption of RLHF in large language models (LLMs), this blog is intended to document my study of RLHF, starting from reinforcement learning.

Diffusion Model for Sequential Recommendation

5 minute read

Published:

This blog serves as a reading note for a series of papers on applying diffusion models to recommendation systems, especially sequential recommendation.