ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation

This repository contains the code to train and infer ChartCoder.

News

[2025.1.16] We have updated our data generation code data_generator, built on Multi-modal-Self-instruct. Please follow their instructions and our code to generate the <chart, code> data pairs.

Overview

Installation

Clone this repo

git clone https://github.com/thunlp/ChartCoder.git

Create environment

cd MMedAgent
conda create -n chartcoder python=3.10 -y
conda activate chartcoder
pip install --upgrade pip  # enable PEP 660 support
pip install -e .

Additional packages required for training

pip install -e ".[train]"
pip install flash-attn --no-build-isolation

Models

Model	Download Link
MLP Connector	projector
ChartCoder	ChartCoder

The MLP Connector is our pre-trained MLP weights, which you could directly use for SFT.

Data

Model	Download Link
Chart2Code-160k	Chart2Code-160k TBD

Train

The whole training process consists of two stages. To train the ChartCoder, siglip-so400m-patch14-384 and deepseek-coder-6.7b-instruct should be downloaded first.

For Pre-training, run

bash scripts/train/pretrain_siglip.sh

For SFT, run

bash scripts/train/finetune_siglip_a4.sh

Please change the model path to your local path. See the corresponding .sh file for details. We also provide other training scripts, such as using CLIP _clip and multiple machines _m. See scripts/train for further information.

Inference

Please see inference.py for details.

Results

Please refer to our paper for detailed performance on ChartMimic, Plot2Code and ChartX benchmarks. Thanks for these contributions to the chart-to-code field.

Citation

If you find this work useful, consider giving this repository a star ⭐️ and citing 📝 our paper as follows:

@misc{zhao2025chartcoderadvancingmultimodallarge,
      title={ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation}, 
      author={Xuanle Zhao and Xianzhen Luo and Qi Shi and Chi Chen and Shuo Wang and Wanxiang Che and Zhiyuan Liu and Maosong Sun},
      year={2025},
      eprint={2501.06598},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2501.06598}, 
}

Acknowledgement

The code is based on the LLaVA-NeXT. Thanks for these great works and open sourcing!

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
data_generator		data_generator
fig		fig
llava		llava
scripts		scripts
trl		trl
README.md		README.md
cog.yaml		cog.yaml
inference.py		inference.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation

News

Overview

Installation

Models

Data

Train

Inference

Results

Citation

Acknowledgement

About

Releases

Packages

Contributors 2

Languages

thunlp/ChartCoder

Folders and files

Latest commit

History

Repository files navigation

ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation

News

Overview

Installation

Models

Data

Train

Inference

Results

Citation

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages