UbeCc

Follow

Haoran Wang UbeCc

Follow

I am not a beast of burden. I am a LLaMA! 不是牛马是拉马（我不是奶龙） (Junior@Tsinghua University)

51 followers · 89 following

Tsinghua University
Beijing, China
17:33 (UTC +08:00)
ubecwang@gmail.com
@UbecWang

Achievements

Achievements

Highlights

Pro

Organizations

Pinned Loading

OpenRLHF/OpenRLHF OpenRLHF/OpenRLHF Public

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)

Python 6.5k 634
THUDM/SWE-Dev THUDM/SWE-Dev Public

SWE-Dev is an open-source SWE agent with a scalable test case construction pipeline. This pipeline synthesizes test cases through a two-step process: generating Gherkin descriptions and correspondi…

Python 18
Generalization-of-Transformers Generalization-of-Transformers Public

[ICLR'25] Understanding the Generalization of In-Context Learning in Transformers: An Empirical Study

Python 3
Shape-Control-of-DLO Shape-Control-of-DLO Public

Deep Reinforcement Learning spring 24, Tsinghua Univ.

Python 4