Pinned Loading
Repositories
- grps_trtllm Public
Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, distributed multi-GPU inference, multimodal capabilities, and a Gradio chat interface.
- HSTU-Tensorflow Public
- grps Public
Deep Learning Deployment Framework: Supports tf/torch/trt/trtllm/vllm and other NN frameworks. Support dynamic batching, and streaming modes. It is dual-language compatible with Python and C++, offering scalability, extensibility, and high performance. It helps users quickly deploy models and provide services through HTTP/RPC interfaces.
- ControlTalk Public
Official code for "Controllable Talking Face Generation by Implicit Facial Keypoints Editing"
People
This organization has no public members. You must be a member to see who’s a part of this organization.