Skip to content

Commit 0ca641d

Browse files
authored
Various fixes to CPU benchmark jobs (#108)
* [no ci] Minor tweak for CPU jobs Signed-off-by: Huy Do <huydhn@gmail.com> * [no ci] Another attempt Signed-off-by: Huy Do <huydhn@gmail.com> * [no ci] Disable aarch64 CPU bm for now Signed-off-by: Huy Do <huydhn@gmail.com> * [no ci] The heck Signed-off-by: Huy Do <huydhn@gmail.com> * [no ci] WTF Signed-off-by: Huy Do <huydhn@gmail.com> * [no ci] Testing Signed-off-by: Huy Do <huydhn@gmail.com> * [no ci] Ugrh Signed-off-by: Huy Do <huydhn@gmail.com> * [no ci] Moving files around? Signed-off-by: Huy Do <huydhn@gmail.com> * [no ci] DEBUG Signed-off-by: Huy Do <huydhn@gmail.com> * [no ci] DEBUG Signed-off-by: Huy Do <huydhn@gmail.com> * Fix a bug in deup model Signed-off-by: Huy Do <huydhn@gmail.com> * Clean up debug Signed-off-by: Huy Do <huydhn@gmail.com> * [no ci] Should work now Signed-off-by: Huy Do <huydhn@gmail.com> --------- Signed-off-by: Huy Do <huydhn@gmail.com>
1 parent de7bec9 commit 0ca641d

File tree

9 files changed

+472
-225
lines changed

9 files changed

+472
-225
lines changed

.github/scripts/generate_vllm_benchmark_matrix.py

Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -60,7 +60,7 @@
6060
"linux.rocm.gpu.gfx942.8": "rocm",
6161
"linux.24xl.spr-metal": "cpu",
6262
"linux.24xl.gnr": "cpu",
63-
"linux.arm64.m7g.4xlarge": "cpu",
63+
"linux.arm64.m7g.4xlarge": "arm64-cpu",
6464
"linux.hpu.gaudi3.8": "hpu",
6565
}
6666

@@ -77,6 +77,9 @@
7777
# and not h100. This also serves as a knob to tune CI behavior. TODO (huydhn):
7878
# Figure out how to set this in the JSON benchmark configuration instead
7979
PLATFORM_SKIPS = {
80+
"meta-llama/Llama-3.1-8B-Instruct": [
81+
"linux.24xl.spr-metal", # Timed out
82+
],
8083
# Already been covered in both A100 and H100
8184
"meta-llama/Meta-Llama-3.1-8B-Instruct": [
8285
"linux.dgx.b200",
@@ -272,7 +275,7 @@ def generate_benchmark_matrix(
272275
# Dedup
273276
if model in selected_models:
274277
continue
275-
# and only choose the selected model:
278+
# and only choose the selected model
276279
if models and model not in models:
277280
continue
278281
selected_models.append(model)
@@ -324,6 +327,7 @@ def main() -> None:
324327
models,
325328
runners,
326329
)
330+
print(benchmark_matrix)
327331
set_output("benchmark_matrix", benchmark_matrix)
328332

329333

0 commit comments

Comments
 (0)