Personal Website

Hi, I’m Jiayi Fu.

I am a researcher working on large language models, reinforcement learning, and diffusion models. My biggest dream is to build a general-purpose agent that can learn to do anything.

View Papers Contact Me

3Papers
3Years of Research

Jiayi Fu

Researcher · Engineer · Pianist

Profile

Who I Am

I am a First-year PhD student at INSAIT, supervised by Prof. Yuxia Wang. I am also a amateur pianist, and I like watching football matches.

Research Interests

Large language models, reinforcement learning, and diffusion models.

Highlights

Empty for now

Education

Academic Background

2026 — Present

Ph.D. in Computer Science

INSAIT

Focus: Large language models, reinforcement learning, and diffusion models.

2022 — 2025

M.S. in Computer Science

Fudan University

Focus: Natural Language Processing, reinforcement learning, and watermarking.

2018 — 2022

B.Eng. in Computer Science and Technology

Harbin Institute of Technology

Operating systems, Database systems, Compilers, Computer Networks, etc.

Papers

Selected Publications

ACL 2024

GumbelSoft: Diversified Language Model Watermarking via the GumbelMax-trick

Jiayi Fu, Xuandong Zhao, et al.

We propose GumbelSoft, a novel language model watermarking method that leverages the GumbelMax-tric.

Paper Code

ICML 2025 R2-FM Workshop

Reward Shaping to Mitigate Reward Hacking in RLHF

Jiayi Fu, Xuandong Zhao, et al.

We propose a reward shaping method PAR to mitigate reward hacking in RLHF.

Paper Code

Experience

Research & Industry Experience

2024.6 — 2025.2

Research Intern

StepFun

Worked on LLM Post-training.

2022.9 — 2023.6

Teaching Assistant

Fudan University

Worked as a teaching assistant for the course "Operating Systems".

Media

Images, Video, and Audio

Sofia Photography

National Gallery.

Ordinary Road

The road of an ordinary people.

Sunset Road

A comfortable song for driving.

Visitor Map

Where Visitors Come From

Contact

Let’s Connect

Email Me