Sitemap

A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.

Pages

Posts

LLaMAFactory Code Analysis

14 minute read

Published:

This blog post documents my learning process of the LLaMAFactory code repository.

From RL to RLHF

9 minute read

Published:

In light of the widespread adoption of RLHF in large language models (LLMs), this blog is intended to document my study of RLHF, starting from reinforcement learning.

Diffusion Model for Sequential Recommendation

5 minute read

Published:

This blog serves as a reading note for a series of papers on applying diffusion models to recommendation systems, especially sequential recommendation.

portfolio

publications

talks

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.