Skip to content

Commit 2712566

Browse files
committed
Use TF v2.3 and docker container for 2.3 across the board
1 parent 7946265 commit 2712566

File tree

21 files changed

+184
-184
lines changed

21 files changed

+184
-184
lines changed

benchmarks/image_recognition/tensorflow/densenet169/README.md

+3-3
Original file line numberDiff line numberDiff line change
@@ -58,7 +58,7 @@ following modes/precisions:
5858
--batch-size 100 \
5959
--socket-id 0 \
6060
--in-graph /home/<user>/densenet169_fp32_pretrained_model.pb \
61-
--docker-image intel/intel-optimized-tensorflow:2.2.0 \
61+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
6262
-- input_height=224 input_width=224 warmup_steps=20 steps=100 \
6363
input_layer="input" output_layer="densenet169/predictions/Reshape_1"
6464
```
@@ -74,7 +74,7 @@ following modes/precisions:
7474
--batch-size 1 \
7575
--socket-id 0 \
7676
--in-graph /home/<user>/densenet169_fp32_pretrained_model.pb \
77-
--docker-image intel/intel-optimized-tensorflow:2.2.0 \
77+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
7878
-- input_height=224 input_width=224 warmup_steps=20 steps=100 \
7979
input_layer="input" output_layer="densenet169/predictions/Reshape_1"
8080
```
@@ -91,7 +91,7 @@ following modes/precisions:
9191
--batch-size 100 \
9292
--socket-id 0 \
9393
--in-graph /home/<user>/densenet169_fp32_pretrained_model.pb \
94-
--docker-image intel/intel-optimized-tensorflow:2.2.0 \
94+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
9595
--data-location /home/<user>/imagenet_validation_dataset \
9696
-- input_height=224 input_width=224 \
9797
input_layer="input" output_layer="densenet169/predictions/Reshape_1"

benchmarks/image_recognition/tensorflow/inceptionv3/README.md

+8-8
Original file line numberDiff line numberDiff line change
@@ -97,7 +97,7 @@ python launch_benchmark.py \
9797
--framework tensorflow \
9898
--accuracy-only \
9999
--batch-size 100 \
100-
--docker-image intel/intel-optimized-tensorflow:2.2.0 \
100+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
101101
--in-graph /home/<user>/inceptionv3_int8_pretrained_model.pb \
102102
--data-location /home/<user>/datasets/ImageNet_TFRecords
103103
```
@@ -118,7 +118,7 @@ python launch_benchmark.py \
118118
--benchmark-only \
119119
--batch-size 1 \
120120
--socket-id 0 \
121-
--docker-image intel/intel-optimized-tensorflow:2.2.0 \
121+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
122122
--in-graph /home/<user>/inceptionv3_int8_pretrained_model.pb \
123123
--data-location /home/<user>/datasets/ImageNet_TFRecords \
124124
-- warmup_steps=50 steps=500
@@ -135,7 +135,7 @@ python launch_benchmark.py \
135135
--benchmark-only \
136136
--batch-size 1 \
137137
--socket-id 0 \
138-
--docker-image intel/intel-optimized-tensorflow:2.2.0 \
138+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
139139
--in-graph /home/<user>/inceptionv3_int8_pretrained_model.pb \
140140
-- warmup_steps=50 steps=500
141141
```
@@ -151,7 +151,7 @@ python launch_benchmark.py \
151151
--benchmark-only \
152152
--batch-size 128 \
153153
--socket-id 0 \
154-
--docker-image intel/intel-optimized-tensorflow:2.2.0 \
154+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
155155
--in-graph /home/<user>/inceptionv3_int8_pretrained_model.pb \
156156
--data-location /home/<user>/datasets/ImageNet_TFRecords \
157157
-- warmup_steps=50 steps=500
@@ -168,7 +168,7 @@ python launch_benchmark.py \
168168
--benchmark-only \
169169
--batch-size 128 \
170170
--socket-id 0 \
171-
--docker-image intel/intel-optimized-tensorflow:2.2.0 \
171+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
172172
--in-graph /home/<user>/inceptionv3_int8_pretrained_model.pb \
173173
-- warmup_steps=50 steps=500
174174
```
@@ -258,7 +258,7 @@ python launch_benchmark.py \
258258
--framework tensorflow \
259259
--batch-size 1 \
260260
--socket-id 0 \
261-
--docker-image intel/intel-optimized-tensorflow:2.2.0 \
261+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
262262
--in-graph /home/<user>/inceptionv3_fp32_pretrained_model.pb
263263
```
264264
Example log tail when running for online inference:
@@ -289,7 +289,7 @@ python launch_benchmark.py \
289289
--framework tensorflow \
290290
--batch-size 128 \
291291
--socket-id 0 \
292-
--docker-image intel/intel-optimized-tensorflow:2.2.0 \
292+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
293293
--in-graph /home/<user>/inceptionv3_fp32_pretrained_model.pb
294294
```
295295
Example log tail when running for batch inference:
@@ -321,7 +321,7 @@ python launch_benchmark.py \
321321
--accuracy-only \
322322
--batch-size 100 \
323323
--data-location /dataset/Imagenet_Validation \
324-
--docker-image intel/intel-optimized-tensorflow:2.2.0 \
324+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
325325
--in-graph /home/<user>/inceptionv3_fp32_pretrained_model.pb
326326
```
327327
Example log tail when running for accuracy:

benchmarks/image_recognition/tensorflow/resnet50v1_5/README.md

+16-16
Original file line numberDiff line numberDiff line change
@@ -47,10 +47,10 @@ $ git clone https://github.com/IntelAI/models.git
4747
The optimized ResNet50v1.5 model files are attached to the [intelai/models](https://github.com/intelai/models) repo and
4848
located at `models/models/image_recognition/tensorflow/resnet50v1_5/`.
4949

50-
The docker image (`intel/intel-optimized-tensorflow:2.2.0`)
50+
The docker image (`intel/intel-optimized-tensorflow:2.3.0`)
5151
used in the commands above were built using
5252
[TensorFlow](https://github.com/tensorflow/tensorflow.git) master for TensorFlow
53-
version 2.2.0.
53+
version 2.3.0.
5454

5555
* Calculate the model accuracy, the required parameters parameters include: the `ImageNet` dataset location (from step 1),
5656
the pre-trained `resnet50v1_5_int8_pretrained_model.pb` input graph file (from step 2), and the `--accuracy-only` flag.
@@ -66,7 +66,7 @@ $ python launch_benchmark.py \
6666
--mode inference \
6767
--batch-size=100 \
6868
--accuracy-only \
69-
--docker-image intel/intel-optimized-tensorflow:2.2.0
69+
--docker-image intel/intel-optimized-tensorflow:2.3.0
7070
```
7171
The log file is saved to the value of `--output-dir`.
7272

@@ -105,7 +105,7 @@ $ python launch_benchmark.py \
105105
--mode inference \
106106
--batch-size=128 \
107107
--benchmark-only \
108-
--docker-image intel/intel-optimized-tensorflow:2.2.0 \
108+
--docker-image intel/intel-optimized-tensorflow:2.3.0
109109
-- warmup_steps=50 steps=500
110110
```
111111
The tail of the log output when the benchmarking completes should look
@@ -164,7 +164,7 @@ $ python launch_benchmark.py \
164164
--mode inference \
165165
--batch-size=1 \
166166
--socket-id=0 \
167-
--docker-image intel/intel-optimized-tensorflow:2.2.0
167+
--docker-image intel/intel-optimized-tensorflow:2.3.0
168168
```
169169

170170
The log file is saved to the value of `--output-dir`.
@@ -202,7 +202,7 @@ $ python launch_benchmark.py \
202202
--mode inference \
203203
--batch-size=128 \
204204
--socket-id=0 \
205-
--docker-image intel/intel-optimized-tensorflow:2.2.0
205+
--docker-image intel/intel-optimized-tensorflow:2.3.0
206206
```
207207

208208
The log file is saved to the value of `--output-dir`.
@@ -243,7 +243,7 @@ $ python launch_benchmark.py \
243243
--batch-size 100 \
244244
--socket-id=0 \
245245
--data-location /home/<user>/dataset/ImageNetData_directory \
246-
--docker-image intel/intel-optimized-tensorflow:2.2.0
246+
--docker-image intel/intel-optimized-tensorflow:2.3.0
247247
```
248248

249249
The log file is saved to the value of `--output-dir`.
@@ -280,7 +280,7 @@ $ python launch_benchmark.py \
280280
--batch-size 100 \
281281
--socket-id=0 \
282282
--data-location /home/<user>/dataset/ImageNetData_directory \
283-
--docker-image intel/intel-optimized-tensorflow:2.2.0
283+
--docker-image intel/intel-optimized-tensorflow:2.3.0
284284
```
285285
The results file will be written to the
286286
`models/benchmarks/common/tensorflow/logs` directory, unless another
@@ -344,7 +344,7 @@ $ python launch_benchmark.py \
344344
--mode inference \
345345
--batch-size=1 \
346346
--socket-id 0 \
347-
--docker-image=intel/intel-optimized-tensorflow:tensorflow-2.2-bf16-nightly
347+
--docker-image=intel/intel-optimized-tensorflow:2.3.0
348348
```
349349

350350
The log file is saved to the value of `--output-dir`.
@@ -380,7 +380,7 @@ $ python launch_benchmark.py \
380380
--mode inference \
381381
--batch-size=128 \
382382
--socket-id 0 \
383-
--docker-image=intel/intel-optimized-tensorflow:tensorflow-2.2-bf16-nightly
383+
--docker-image=intel/intel-optimized-tensorflow:2.3.0
384384
```
385385

386386
The log file is saved to the value of `--output-dir`.
@@ -419,7 +419,7 @@ $ python launch_benchmark.py \
419419
--batch-size 100 \
420420
--socket-id 0 \
421421
--data-location /home/<user>/dataset/ImageNetData_directory \
422-
--docker-image=intel/intel-optimized-tensorflow:tensorflow-2.2-bf16-nightly
422+
--docker-image=intel/intel-optimized-tensorflow:2.3.0
423423
```
424424

425425
The log file is saved to the value of `--output-dir`.
@@ -454,7 +454,7 @@ $ python launch_benchmark.py \
454454
--batch-size 100 \
455455
--socket-id 0 \
456456
--data-location /home/<user>/dataset/ImageNetData_directory \
457-
--docker-image=intel/intel-optimized-tensorflow:tensorflow-2.2-bf16-nightly
457+
--docker-image=intel/intel-optimized-tensorflow:2.3.0
458458
```
459459
The results file will be written to the
460460
`models/benchmarks/common/tensorflow/logs` directory, unless another
@@ -555,7 +555,7 @@ $ python launch_benchmark.py \
555555
--framework tensorflow \
556556
--checkpoint <location_to_store_training_checkpoints> \
557557
--data-location=/home/<user>/dataset/ImageNetData_directory \
558-
--docker-image=intel/intel-optimized-tensorflow:tensorflow-2.2-bf16-nightly
558+
--docker-image=intel/intel-optimized-tensorflow:2.3.0
559559
```
560560

561561
This run will take considerable amount of time since it is running for
@@ -585,7 +585,7 @@ $ python launch_benchmark.py \
585585
--framework tensorflow \
586586
--data-location=/home/<user>/dataset/ImageNetData_directory \
587587
--mpi_num_processes=2 \
588-
--docker-image=intel/intel-optimized-tensorflow:tensorflow-2.2-bf16-nightly
588+
--docker-image=intel/intel-optimized-tensorflow:2.3.0
589589
```
590590
The above distributed training runs one MPI process per socket, to maximize performance, users can run more than one (commonly two) MPI processes per socket. The following command achieves launching 4 MPI processes over 2 sockets. Note that in this case we need to reduce the OMP_NUM_THREADS and intra_op_parallelism_threads by half (minus one or two for performance sometimes, e.g. half of 28 becomes 14, and we can use 12 for good performance). This is controlled by "-a <half the amount of cores of per socket or less>". Batch size can remain the same for weak scaling or reduced by half as well for strong scaling.
591591

@@ -598,7 +598,7 @@ $ python launch_benchmark.py \
598598
--data-location=/home/<user>/dataset/ImageNetData_directory \
599599
--mpi_num_processes=4 \
600600
--mpi_num_processes_per_socket=2 \
601-
--docker-image=intel/intel-optimized-tensorflow:tensorflow-2.2-bf16-nightly \
601+
--docker-image=intel/intel-optimized-tensorflow:2.3.0 \
602602
-a <half the amount of cores per socket or less>
603603
```
604604

@@ -613,7 +613,7 @@ $ python launch_benchmark.py \
613613
--data-location=/home/<user>/dataset/ImageNetData_directory \
614614
--mpi_num_processes=2 \
615615
--mpi_num_processes_per_socket=1 \
616-
--docker-image=intel/intel-optimized-tensorflow:tensorflow-2.2-bf16-nightly \
616+
--docker-image=intel/intel-optimized-tensorflow:2.3.0 \
617617
-a <half the amount of cores per socket or less>
618618
```
619619

benchmarks/language_modeling/tensorflow/bert_large/README.md

+12-12
Original file line numberDiff line numberDiff line change
@@ -50,7 +50,7 @@ python launch_benchmark.py \
5050
--mode=training \
5151
--framework=tensorflow \
5252
--batch-size=24 \
53-
--docker-image intel/intel-optimized-tensorflow:tensorflow-2.2-bf16-nightly \
53+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
5454
--volume $BERT_LARGE_DIR:$BERT_LARGE_DIR \
5555
--volume $SQUAD_DIR:$SQUAD_DIR \
5656
-- train_option=SQuAD \
@@ -66,7 +66,7 @@ python launch_benchmark.py \
6666
max_seq_length=384 \
6767
doc_stride=128 \
6868
optimized_softmax=True \
69-
experimental_gelu=True \
69+
experimental_gelu=False \
7070
do_lower_case=True
7171
7272
```
@@ -111,7 +111,7 @@ python launch_benchmark.py \
111111
--framework=tensorflow \
112112
--batch-size=24 \
113113
--mpi_num_processes=<num_of_sockets> \
114-
--docker-image intel/intel-optimized-tensorflow:tensorflow-2.2-bf16-nightly \
114+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
115115
--volume $BERT_LARGE_DIR:$BERT_LARGE_DIR \
116116
--volume $SQUAD_DIR:$SQUAD_DIR \
117117
-- train_option=SQuAD \
@@ -127,7 +127,7 @@ python launch_benchmark.py \
127127
max_seq_length=384 \
128128
doc_stride=128 \
129129
optimized_softmax=True \
130-
experimental_gelu=True \
130+
experimental_gelu=False \
131131
do_lower_case=True
132132
```
133133
The results file will be written to the
@@ -148,7 +148,7 @@ python launch_benchmark.py \
148148
--mode=training \
149149
--framework=tensorflow \
150150
--batch-size=32 \
151-
--docker-image intel/intel-optimized-tensorflow:tensorflow-2.2-bf16-nightly \
151+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
152152
--volume $BERT_BASE_DIR:$BERT_BASE_DIR \
153153
--volume $GLUE_DIR:$GLUE_DIR \
154154
-- train-option=Classifier \
@@ -163,7 +163,7 @@ python launch_benchmark.py \
163163
learning-rate=2e-5 \
164164
num-train-epochs=30 \
165165
optimized_softmax=True \
166-
experimental_gelu=True \
166+
experimental_gelu=False \
167167
do-lower-case=True
168168
169169
```
@@ -185,7 +185,7 @@ python launch_benchmark.py \
185185
--framework=tensorflow \
186186
--batch-size=32 \
187187
--mpi_num_processes=<num_of_sockets> \
188-
--docker-image intel/intel-optimized-tensorflow:tensorflow-2.2-bf16-nightly \
188+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
189189
--volume $BERT_LARGE_DIR:$BERT_LARGE_DIR \
190190
--volume $GLUE_DIR:$GLUE_DIR \
191191
-- train-option=Classifier \
@@ -227,7 +227,7 @@ python launch_benchmark.py \
227227
--socket-id=0 \
228228
--num-intra-threads=24 \
229229
--num-inter-threads=1 \
230-
--docker-image intel/intel-optimized-tensorflow:tensorflow-2.2-bf16-nightly \
230+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
231231
--volume $BERT_LARGE_DIR:$BERT_LARGE_DIR \
232232
--volume $PRETRAINING_DATA_DIR:$PRETRAINING_DATA_DIR \
233233
-- train-option=Pretraining \
@@ -262,7 +262,7 @@ python launch_benchmark.py \
262262
--num-intra-threads=22 \
263263
--num-inter-threads=1 \
264264
--mpi_num_processes=4 \
265-
--docker-image intel/intel-optimized-tensorflow:tensorflow-2.2-bf16-nightly \
265+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
266266
--volume $BERT_LARGE_DIR:$BERT_LARGE_DIR \
267267
--volume $PRETRAINING_DATA_DIR:$PRETRAINING_DATA_DIR \
268268
-- train-option=Pretraining \
@@ -344,7 +344,7 @@ FP32 training instructions are the same as Bfloat16 training instructions above,
344344
--checkpoint /home/<user>/bert_large_checkpoints \
345345
--output-dir /home/<user>/bert-squad-output \
346346
--benchmark-only \
347-
--docker-image intel/intel-optimized-tensorflow:tensorflow-2.2-bf16-nightly \
347+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
348348
-- infer_option=SQuAD
349349
```
350350
@@ -359,7 +359,7 @@ FP32 training instructions are the same as Bfloat16 training instructions above,
359359
--data-location /home/<user>/wwm_uncased_L-24_H-1024_A-16 \
360360
--checkpoint /home/<user>/bert_large_checkpoints \
361361
--output-dir /home/<user>/bert-squad-output \
362-
--docker-image intel/intel-optimized-tensorflow:tensorflow-2.2-bf16-nightly \
362+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
363363
-- profile=True infer_option=SQuAD
364364
```
365365
@@ -374,7 +374,7 @@ FP32 training instructions are the same as Bfloat16 training instructions above,
374374
--data-location /home/<user>/wwm_uncased_L-24_H-1024_A-16 \
375375
--checkpoint /home/<user>/bert_large_checkpoints \
376376
--output-dir /home/<user>/bert-squad-output \
377-
--docker-image intel/intel-optimized-tensorflow:tensorflow-2.2-bf16-nightly \
377+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
378378
--accuracy-only \
379379
-- infer_option=SQuAD
380380
```

benchmarks/language_translation/tensorflow/bert/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -75,7 +75,7 @@ You can also use the helper script [download_glue_data.py](https://gist.github.c
7575
learning-rate=2e-5 \
7676
num_train_epochs=3.0
7777
```
78-
Using `--docker-image intel/intel-optimized-tensorflow:2.2.0` to run with Intel optimized TensorFlow docker container.
78+
Using `--docker-image intel/intel-optimized-tensorflow:2.3.0` to run with Intel optimized TensorFlow docker container.
7979

8080

8181
6. The log file is saved to the `models/benchmarks/common/tensorflow/logs` directory. Below are examples of what the tail of your log file should look like for the different configs.

benchmarks/language_translation/tensorflow/mlperf_gnmt/README.md

+3-3
Original file line numberDiff line numberDiff line change
@@ -69,7 +69,7 @@ python launch_benchmark.py \
6969
--batch-size 1 \
7070
--socket-id 0 \
7171
--data-location /home/<user>/nmt/data \
72-
--docker-image intel/intel-optimized-tensorflow:2.2.0 \
72+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
7373
--in-graph /home/<user>/mlperf_gnmt_fp32_pretrained_model.pb \
7474
--benchmark-only
7575
```
@@ -84,7 +84,7 @@ python launch_benchmark.py \
8484
--batch-size 32 \
8585
--socket-id 0 \
8686
--data-location /home/<user>/nmt/data \
87-
--docker-image intel/intel-optimized-tensorflow:2.2.0 \
87+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
8888
--in-graph /home/<user>/mlperf_gnmt_fp32_pretrained_model.pb \
8989
--benchmark-only
9090
```
@@ -99,7 +99,7 @@ python launch_benchmark.py \
9999
--batch-size 32 \
100100
--socket-id 0 \
101101
--data-location /home/<user>/nmt/data \
102-
--docker-image intel/intel-optimized-tensorflow:2.2.0 \
102+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
103103
--in-graph /home/<user>/mlperf_gnmt_fp32_pretrained_model.pb \
104104
--accuracy-only
105105
```

benchmarks/language_translation/tensorflow/transformer_lt_official/README.md

+2-2
Original file line numberDiff line numberDiff line change
@@ -52,7 +52,7 @@ python launch_benchmark.py \
5252
--framework tensorflow \
5353
--batch-size 1 \
5454
--socket-id 0 \
55-
--docker-image intel/intel-optimized-tensorflow:2.2.0 \
55+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
5656
--in-graph /home/<user>/transformer_lt_official_fp32_pretrained_model/graph/fp32_graphdef.pb \
5757
--data-location /home/<user>/transformer_lt_official_fp32_pretrained_model/data \
5858
-- file=newstest2014.en \
@@ -71,7 +71,7 @@ python launch_benchmark.py \
7171
--framework tensorflow \
7272
--batch-size 64 \
7373
--socket-id 0 \
74-
--docker-image intel/intel-optimized-tensorflow:2.2.0 \
74+
--docker-image intel/intel-optimized-tensorflow:2.3.0 \
7575
--in-graph /home/<user>/transformer_lt_official_fp32_pretrained_model/graph/fp32_graphdef.pb \
7676
--data-location /home/<user>/transformer_lt_official_fp32_pretrained_model/data \
7777
-- file=newstest2014.en \

0 commit comments

Comments
 (0)