coderonion

coderonion

214 followers · 1.8k following

Achievements

x2 x3 x2

Achievements

x2 x3 x2

awesome-cuda-triton-hpc Public

🔥🔥🔥 A collection of some awesome public CUDA, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR and High Performance Computing (HPC) projects.

awesome hpc gpu cuda pytorch cublas triton

212 24 Updated Feb 27, 2025
awesome-llm-and-aigc Public

🚀🚀🚀A collection of some wesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applica…

computer-vision cuda openai yolo triton awesome-list llama

609 57 Updated Feb 27, 2025
DeepGEMM Public
Forked from deepseek-ai/DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 1 MIT License Updated Feb 27, 2025
FlashMLA Public
Forked from deepseek-ai/FlashMLA

FlashMLA: Efficient MLA Decoding Kernel for Hopper GPUs

C++ 1 MIT License Updated Feb 27, 2025
awesome-deepseek-integration Public
Forked from deepseek-ai/awesome-deepseek-integration

1 Creative Commons Zero v1.0 Universal Updated Feb 21, 2025
coderonion Public

1 Updated Feb 20, 2025
awesome-yolo-object-detection Public

🚀🚀🚀 A collection of some awesome public YOLO object detection series projects and the related object detection datasets.

gui cuda yolo awesome-list llama object-detection datasets

1,395 197 Updated Feb 20, 2025
VLM-R1 Public
Forked from om-ai-lab/VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 1 Updated Feb 20, 2025
X-AnyLabeling Public
Forked from CVHub520/X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 1 GNU General Public License v3.0 Updated Feb 19, 2025
edgeyolo Public
Forked from LSH9832/edgeyolo

an edge-real-time anchor-free object detector with decent performance

Python 1 Apache License 2.0 Updated Feb 19, 2025
TensorRT-Model-Optimizer Public
Forked from NVIDIA/TensorRT-Model-Optimizer

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream d…

Python 1 Other Updated Feb 19, 2025
NuMojo Public
Forked from Mojo-Numerics-and-Algorithms-group/NuMojo

NuMojo is a library for numerical computing in Mojo 🔥 similar to numpy in Python.

Mojo 2 Apache License 2.0 Updated Feb 15, 2025
OpenSeek Public
Forked from FlagAI-Open/OpenSeek

OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next-generation models that surpass DeepSeek.

1 Updated Feb 14, 2025
ktransformers Public
Forked from kvcache-ai/ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 1 Apache License 2.0 Updated Feb 13, 2025
minimind Public
Forked from jingyaogong/minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 1 Apache License 2.0 Updated Feb 12, 2025
deepscaler Public
Forked from agentica-project/deepscaler

Democratizing Reinforcement Learning for LLMs

Python 1 MIT License Updated Feb 11, 2025
minimind-v Public
Forked from jingyaogong/minimind-v

🚀 「大模型」3小时从0训练27M参数的视觉多模态VLM！🌏 Train a 27M-parameter VLM from scratch in just 3 hours!

Python 1 Apache License 2.0 Updated Feb 10, 2025
unsloth Public
Forked from unslothai/unsloth

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory

Python 1 Apache License 2.0 Updated Feb 9, 2025
TensorRT-YOLO Public
Forked from laugh12321/TensorRT-YOLO

🚀 Easier & Faster YOLO Deployment Toolkit for NVIDIA 🛠️

C++ 1 GNU General Public License v3.0 Updated Feb 7, 2025
CUDA-Learn-Notes Public
Forked from DefTruth/CUDA-Learn-Notes

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 1 GNU General Public License v3.0 Updated Feb 7, 2025
maestro Public
Forked from roboflow/maestro

streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL

Python 1 Apache License 2.0 Updated Feb 6, 2025
ultralyticsPro Public
Forked from iscyy/ultralyticsPro

🔥🔥🔥 专注于YOLO11，YOLOv8、YOLOv10、RT-DETR、YOLOv7、YOLOv5改进模型，Support to improve backbone, neck, head, loss, IoU, NMS and other modules🚀

Python 1 GNU Affero General Public License v3.0 Updated Feb 6, 2025
Logic-RL Public
Forked from Unakar/Logic-RL

Python 1 Apache License 2.0 Updated Feb 5, 2025
LLaMA-Factory Public
Forked from hiyouga/LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 1 Apache License 2.0 Updated Feb 4, 2025
R1-V Public
Forked from Deep-Agent/R1-V

Witness the aha moment of VLM with less than $3.

Python 1 Updated Feb 4, 2025
a-hamdi-cuda Public
Forked from a-hamdi/GPU

100 days of building Cuda kernels!

Cuda 1 MIT License Updated Feb 3, 2025
TinyZero Public
Forked from Jiayi-Pan/TinyZero

Clean, accessible reproduction of DeepSeek R1-Zero

Python 1 Apache License 2.0 Updated Jan 30, 2025
llama-cpp-python Public
Forked from abetlen/llama-cpp-python

Python bindings for llama.cpp

Python 1 MIT License Updated Jan 29, 2025
open-r1 Public
Forked from huggingface/open-r1

Fully open reproduction of DeepSeek-R1

Python 1 Apache License 2.0 Updated Jan 27, 2025
tilelang Public
Forked from tile-ai/tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU kernels

C 1 MIT License Updated Jan 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

coderonion

Achievements

Achievements

Block or report coderonion

awesome-cuda-triton-hpc Public

awesome-llm-and-aigc Public

DeepGEMM Public

FlashMLA Public

awesome-deepseek-integration Public

coderonion Public

awesome-yolo-object-detection Public

VLM-R1 Public

X-AnyLabeling Public

edgeyolo Public

TensorRT-Model-Optimizer Public

NuMojo Public

OpenSeek Public

ktransformers Public

minimind Public

deepscaler Public

minimind-v Public

unsloth Public

TensorRT-YOLO Public

CUDA-Learn-Notes Public

maestro Public

ultralyticsPro Public

Logic-RL Public

LLaMA-Factory Public

R1-V Public

a-hamdi-cuda Public

TinyZero Public

llama-cpp-python Public

open-r1 Public

tilelang Public