Skip to content

Commit 9e36ffc

Browse files
committed
Update Speed Benchmark.
1 parent f89ecaa commit 9e36ffc

File tree

1 file changed

+46
-1
lines changed

1 file changed

+46
-1
lines changed

recognition/arcface_torch/README.md

+46-1
Original file line numberDiff line numberDiff line change
@@ -45,6 +45,31 @@ CUDA_VISIBLE_DEVICES=0,1 python eval_ijbc.py \
4545
```
4646
More details see [eval.md](docs/eval.md) in docs.
4747

48+
## Speed Benchmark
49+
![Image text](https://github.com/nttstar/insightface-resources/blob/master/images/partial_fc_speed.png)
50+
51+
ArcFace_torch can train large-scale face recognition training set efficiently and quickly.
52+
When the number of classes in training sets is greater than 300K and the training is sufficient,
53+
partial fc sampling strategy will get same accuracy with several times faster training performance and smaller GPU memory.
54+
55+
1. Different Parallel Methods Training Speed
56+
57+
| Method | bs128_r100_200w identities | bs128_r50_400w identities | bs64_r50_800w identities |
58+
| :---: | :--- | :--- | :--- |
59+
| data parallel | 1 | 1 | 1 |
60+
| model parallel | 1362 | 1600 | 482 |
61+
| fp16 + model parallel | 2006 | 2165 | 767 |
62+
| fp16 + partial fc 0.1 | 3247 | 4385 | 3001 |
63+
64+
2. Different Parallel Methods GPU Memory
65+
66+
| Method | bs128_r100_200w identities | bs128_r50_400w identities | bs64_r50_800w identities |
67+
| :---: | :--- | :--- | :--- |
68+
| data parallel | OOM | OOM | OOM |
69+
| model parallel | 27382 | 30322 | 32182 |
70+
| fp16 + model parallel | 20310 | 26622 | 32182 |
71+
| fp16 + partial fc 0.1 | 11987 | 10890 | 11108 |
72+
4873

4974
## Model Zoo
5075

@@ -68,4 +93,24 @@ All Model Can be found in here.
6893
| Glint360k-Cosface |[log](https://raw.githubusercontent.com/anxiangsir/insightface_arcface_log/master/glint360k_cosface_r100/training.log) |r100 | 96.19 | 97.39 | 98.52 | 99.26 | 99.83 |
6994
| Glint360k-Cosface |[log](https://raw.githubusercontent.com/anxiangsir/insightface_arcface_log/master/glint360k_cosface_r100_fp16_0.1/training.log)|r100-fp16-sample-0.1 | 95.95 | 97.35 | 98.57 | 99.30 | 99.85 |
7095

71-
More details see [eval.md](docs/modelzoo.md) in docs.
96+
More details see [eval.md](docs/modelzoo.md) in docs.
97+
98+
99+
100+
## Citation
101+
```
102+
@inproceedings{deng2019arcface,
103+
title={Arcface: Additive angular margin loss for deep face recognition},
104+
author={Deng, Jiankang and Guo, Jia and Xue, Niannan and Zafeiriou, Stefanos},
105+
booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
106+
pages={4690--4699},
107+
year={2019}
108+
}
109+
@inproceedings{an2020partical_fc,
110+
title={Partial FC: Training 10 Million Identities on a Single Machine},
111+
author={An, Xiang and Zhu, Xuhan and Xiao, Yang and Wu, Lan and Zhang, Ming and Gao, Yuan and Qin, Bin and
112+
Zhang, Debing and Fu Ying},
113+
booktitle={Arxiv 2010.05222},
114+
year={2020}
115+
}
116+
```

0 commit comments

Comments
 (0)