-
Sea AI Lab @sail-sg
- Singapore
- https://lkevinzc.github.io/
- @zzlccc
Pinned Loading
-
mosecorg/mosec
mosecorg/mosec PublicA high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine
-
sail-sg/oat
sail-sg/oat Public🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.
-
sail-sg/understand-r1-zero
sail-sg/understand-r1-zero PublicUnderstanding R1-Zero-Like Training: A Critical Perspective
-
sail-sg/oat-zero
sail-sg/oat-zero PublicA lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.
-
sail-sg/dice
sail-sg/dice PublicOfficial implementation of Bootstrapping Language Models via DPO Implicit Rewards
-
sail-sg/rosmo
sail-sg/rosmo PublicCodes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
Python 28
If the problem persists, check the GitHub status page or contact support.