Busy...
Software Development, LLMs
Pinned Loading
-
tezos-reward-distributor-organization/tezos-reward-distributor
tezos-reward-distributor-organization/tezos-reward-distributor PublicTezos Reward Distributor (TRD): A reward distribution software for tezos bakers.
-
reinforcement-learning-an-introduction
reinforcement-learning-an-introduction PublicSolutions to Sutton and Barto book exercises
-
-
qlora_templates
qlora_templates PublicForked from jondurbin/qlora
QLoRA: Efficient Finetuning of Quantized LLMs (Uses Huggingface Chat Templates)
Python
-
cs330-2021-stanford-meta-learning-hw-answers
cs330-2021-stanford-meta-learning-hw-answers Publichttp://cs330.stanford.edu/fall2021 coding hw answers.
Python 3
-
berkeley_rl_hw_answers
berkeley_rl_hw_answers PublicForked from berkeleydeeprlcourse/homework_fall2022
My answers to Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2022)
Jupyter Notebook 2
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.