[cleanup/fix] Make inference parameters explicit, etc. #198

xy12181 · 2025-02-19T19:28:44Z

Rename max_num_seq to batch_size
Fix a process runner typo
......

$ python experimental/jax/inference/entrypoint/mini_offline_benchmarking.py

Offline inference begins: 2025-02-19 18:51:29.910043 ...
Offline inference ends: 2025-02-19 19:15:14.701318
Benchmarking result:
  Total requests: 24000
  Total input tokens: 5311141
  Total output tokens: 7030596
  Input token thruput:  3727.18 tokens/sec
  Output token thruput:  4933.84 tokens/sec

xy12181 · 2025-02-19T19:36:16Z

experimental/jax/inference/parallel/config.py

@@ -114,7 +114,7 @@ class LinearParallelConfig:
 @dataclasses.dataclass
 class RMSNormParallelConfig:
  mesh: jax.sharding.Mesh
-  activation_shared: bool = False


@zhihaoshan-google: shar_d_ed, not shared. right?

Correct! sorry for the mistakes.

zhihaoshan-google

LGTM Thanks for the correction!

xy12181 · 2025-02-21T05:56:44Z

@vipannalla could you please help review this PR?

- Rename max_num_seq to batch_size - Fix a process runner typo - ...... ``` $ python experimental/jax/inference/entrypoint/mini_offline_benchmarking.py Offline inference begins: 2025-02-19 18:51:29.910043 ... Offline inference ends: 2025-02-19 19:15:14.701318 Benchmarking result: Total requests: 24000 Total input tokens: 5311141 Total output tokens: 7030596 Input token thruput: 3727.18 tokens/sec Output token thruput: 4933.84 tokens/sec ```

xy12181 requested review from wyzhang, zhihaoshan-google and sixiang-google February 19, 2025 19:28

xy12181 requested a review from vipannalla as a code owner February 19, 2025 19:28

xy12181 commented Feb 19, 2025

View reviewed changes

xy12181 force-pushed the yongx/dev1 branch from 37f5448 to e584e5c Compare February 21, 2025 04:30

zhihaoshan-google approved these changes Feb 21, 2025

View reviewed changes

xy12181 force-pushed the yongx/dev1 branch from e584e5c to 24173cd Compare February 21, 2025 05:58

xy12181 force-pushed the yongx/dev1 branch from 24173cd to 562d8d1 Compare February 21, 2025 16:39

xy12181 requested a review from zhihaoshan-google February 21, 2025 21:25

vipannalla approved these changes Feb 21, 2025

View reviewed changes

xy12181 merged commit ec77720 into AI-Hypercomputer:main Feb 21, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[cleanup/fix] Make inference parameters explicit, etc. #198

[cleanup/fix] Make inference parameters explicit, etc. #198

xy12181 commented Feb 19, 2025 •

edited

Loading

xy12181 Feb 19, 2025 •

edited

Loading

zhihaoshan-google Feb 21, 2025

zhihaoshan-google left a comment

xy12181 commented Feb 21, 2025

[cleanup/fix] Make inference parameters explicit, etc. #198

[cleanup/fix] Make inference parameters explicit, etc. #198

Conversation

xy12181 commented Feb 19, 2025 • edited Loading

xy12181 Feb 19, 2025 • edited Loading

Choose a reason for hiding this comment

zhihaoshan-google Feb 21, 2025

Choose a reason for hiding this comment

zhihaoshan-google left a comment

Choose a reason for hiding this comment

xy12181 commented Feb 21, 2025

xy12181 commented Feb 19, 2025 •

edited

Loading

xy12181 Feb 19, 2025 •

edited

Loading