Posts by Tags

cool posts

LLaMAFactory Code Analysis

14 minute read

Published: July 30, 2025

This blog post documents my learning process of the LLaMAFactory code repository.

From RL to RLHF

9 minute read

Published: July 09, 2025

In light of the widespread adoption of RLHF in large language models (LLMs), this blog is intended to document my study of RLHF, starting from reinforcement learning.

Diffusion Model for Sequential Recommendation

5 minute read

Published: March 25, 2025

This blog serves as a reading note for a series of papers on applying diffusion models to recommendation systems, especially sequential recommendation.