More Research
Reinforcement Learning via
Value Gradient Flow
Haoran Xu
*
1
,
Kaiwen Hu
*
2
,
Somayeh Sojoudi
2
,
Amy Zhang
1
1
UT Austin
2
UC Berkeley
* Equal contribution
Paper
Code
TLDR
:
scalable and sample-efficient
RL finetuning with generative models using Value Gradient Flow.