Towards Efficient and Effective Deep Model-based Reinforcement Learning
Yuping Luo
PhD thesis
[link]
Towards Learning to Play Piano with Dexterous Hands and Touch
Huazhe Xu, Yuping Luo, Shaoxiong Wang, Trevor Darrell, Roberto Calandra
IROS 2022
[paper]
Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations
Yuping Luo, Tengyu Ma
NeurIPS 2021, also ICML 2021 RL4RealLife Workshop
[paper] [code]
Safe Reinforcement Learning by Imagining the Near Future
Garrett Thomas, Yuping Luo, Tengyu Ma
NeurIPS 2021
[paper] [code]
Towards Resolving the Implicit Bias of Gradient Descent for Matrix Factorization: Greedy Low-Rank Learning
($\alpha$-$\beta$) Zhiyuan Li, Yuping Luo, Kaifeng Lyu
ICLR 2021
[paper]
Bootstrapping the Expressivity with Model-based Planning
Kefan Dong*, Yuping Luo*, Tengyu Ma
ICML 2020
[paper] [code]
Provable Representation Learning for Imitation Learning via Bi-level Optimization
($\alpha$-$\beta$) Sanjeev Arora, Simon S. Du, Sham Kakade, Yuping Luo, Nikunj Saunshi
ICML 2020
[paper]
Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling
Yuping Luo, Huazhe Xu, Tengyu Ma
ICLR 2020
[paper] [slides]
Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle
($\alpha$-$\beta$) Simon S. Du, Yuping Luo, Ruosong Wang, Hanrui Zhang
NeurIPS 2019
[paper]
Implicit Regularization in Deep Matrix Factorization
($\alpha$-$\beta$) Sanjeev Arora, Nadav Cohen, Wei Hu, Yuping Luo
NeurIPS 2019 (spotlight)
[paper] [code]
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
Yuping Luo*, Huazhe Xu*, Yuanzhi Li, Yuandong Tian, Trevor Darrell, Tengyu Ma
ICLR 2019
[paper] [code]
Learning Online Alignments with Continuous Rewards Policy Gradient
Yuping Luo, Chung-Cheng Chiu, Navdeep Jaitly, Ilya Sutskever
ICASSP 2017
[paper]