Skip to content

Popular repositories Loading

  1. Tune-A-Video Tune-A-Video Public

    [ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

    Python 4.2k 384

  2. Awesome-Video-Diffusion Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

    3.2k 191

  3. Show-1 Show-1 Public

    Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

    Python 1.1k 62

  4. Show-o Show-o Public

    Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

    Python 826 37

  5. MotionDirector MotionDirector Public

    [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

    Python 797 47

  6. Image2Paragraph Image2Paragraph Public

    [A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

    Python 785 53

Repositories

Showing 10 of 65 repositories
  • Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

    showlab/Awesome-Video-Diffusion’s past year of commit activity
    3,169 191 0 0 Updated Sep 20, 2024
  • Show-o Public

    Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

    showlab/Show-o’s past year of commit activity
    Python 826 Apache-2.0 37 15 0 Updated Sep 19, 2024
  • Awesome-GUI-Agent Public

    đź’» A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

    showlab/Awesome-GUI-Agent’s past year of commit activity
    126 5 1 0 Updated Sep 17, 2024
  • Awesome-MLLM-Hallucination Public

    đź“– A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

    showlab/Awesome-MLLM-Hallucination’s past year of commit activity
    367 10 1 0 Updated Sep 15, 2024
  • Awesome-Unified-Multimodal-Models Public

    đź“– This is a repository for organizing papers, codes and other resources related to unified multimodal models.

    showlab/Awesome-Unified-Multimodal-Models’s past year of commit activity
    134 1 0 0 Updated Sep 9, 2024
  • RingID Public
    showlab/RingID’s past year of commit activity
    Python 12 0 1 0 Updated Aug 30, 2024
  • MovieSeq Public

    [ECCV2024] Learning Video Context as Interleaved Multimodal Sequences

    showlab/MovieSeq’s past year of commit activity
    17 0 1 0 Updated Aug 26, 2024
  • MotionDirector Public

    [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

    showlab/MotionDirector’s past year of commit activity
    Python 797 Apache-2.0 47 20 0 Updated Aug 21, 2024
  • GUI-Narrator Public

    Repository of GUI Action Narrator

    showlab/GUI-Narrator’s past year of commit activity
    JavaScript 2 0 0 0 Updated Aug 16, 2024
  • videollm-online Public

    VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)

    showlab/videollm-online’s past year of commit activity
    Python 178 Apache-2.0 24 10 0 Updated Aug 15, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.