benchmarks

[TVM][BUGFIX] Change graph_runtime to graph_executor & Fix MosesNorma…

Apr 14, 2021

5ff0519 · Apr 14, 2021

Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md	Fix Benchmark (#1471 )	Jan 11, 2021
benchmark_gluonnlp.py	benchmark_gluonnlp.py	Add AMP + Update Benchmarking Script (#1405 )	Nov 6, 2020
benchmark_gluonnlp.sh	benchmark_gluonnlp.sh	[Numpy] Benchmark the backbone models + Some fixes + Always use pytho…	Aug 14, 2020
benchmark_gluonnlp_fp16.sh	benchmark_gluonnlp_fp16.sh	Add AMP + Update Benchmarking Script (#1405 )	Nov 6, 2020
benchmark_gluonnlp_tvm.sh	benchmark_gluonnlp_tvm.sh	[TVM] Add TVM Support (#1390 )	Oct 24, 2020
benchmark_hf.py	benchmark_hf.py	Fix an issue introduced in the previous benchmark fix PR (#1462 )	Dec 29, 2020
benchmark_utils.py	benchmark_utils.py	[TVM][BUGFIX] Change graph_runtime to graph_executor & Fix MosesNorma…	Apr 14, 2021
requirements.txt	requirements.txt	[Numpy] Benchmark the backbone models + Some fixes + Always use pytho…	Aug 14, 2020
run_backbone_benchmark.sh	run_backbone_benchmark.sh	Fix Benchmark (#1471 )	Jan 11, 2021

README.md

Benchmarking the Performance of NLP Backbones

We benchmark the latency and peak memory usage of a single training (forward + backward) and inference (forward-only) step of the NLP backbones. For comparison, we also provide the numbers of the models in huggingface.

Backbones in HuggingFace

We use the huggingface benchmark to benchmark the training + inference speed of common workloads in NLP.

python3 -m pip install -U -r requirements.txt
python3 benchmark_hf.py

It will generate a list of csv files:

├── pytorch_train_fp32.csv
├── pytorch_train_fp16.csv
├── pytorch_infer_fp32.csv
├── pytorch_infer_fp16.csv
├── pytorch_infer_fp32_ts.csv

GluonNLP Backbones based on MXNet-2.0

We profile three options: NT layout, NT layout with TN layout as the compute layout, and TN layout.

python3 -m pip install -U -r requirements.txt
bash benchmark_gluonnlp.sh

It will generate csv files with gluonnlp_ as the prefix

├── gluonnlp_train_fp32_NT_NT.csv
├── gluonnlp_train_fp32_NT_TN.csv
├── gluonnlp_train_fp32_TN_TN.csv
├── gluonnlp_infer_fp32_NT_NT_tvm0.csv
├── gluonnlp_infer_fp32_NT_TN_tvm0.csv
├── gluonnlp_infer_fp32_TN_TN_tvm0.csv

GluonNLP + TVM for Inference

Install TVM as described in https://tvm.apache.org/docs/install/index.html

bash benchmark_gluonnlp_tvm.sh

├── gluonnlp_infer_fp32_NT_NT_tvm1.csv
├── gluonnlp_infer_fp32_NT_TN_tvm1.csv
├── gluonnlp_infer_fp32_TN_TN_tvm1.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

benchmarks

benchmarks

README.md

Benchmarking the Performance of NLP Backbones

Backbones in HuggingFace

GluonNLP Backbones based on MXNet-2.0

GluonNLP + TVM for Inference

Generate the Benchmark Report

Files

benchmarks

Directory actions

More options

Directory actions

More options

Latest commit

History

benchmarks

Folders and files

parent directory

README.md

Benchmarking the Performance of NLP Backbones

Backbones in HuggingFace

GluonNLP Backbones based on MXNet-2.0

GluonNLP + TVM for Inference

Generate the Benchmark Report