Skip to content
This repository was archived by the owner on Jul 4, 2025. It is now read-only.

Commit 73a9ee4

Browse files
authored
Add Latest News section (NVIDIA#368)
1 parent 7ce7e1d commit 73a9ee4

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

README.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -19,8 +19,7 @@ TensorRT-LLM
1919
## Latest News
2020
* [2023/11/13] [**H200** achieves nearly **12,000 tok/sec on Llama2-13B**](./docs/source/blogs/H200launch.md)
2121

22-
<img src="./docs/source/blogs/media/H200launch_Llama70B_tps.png" alt="H200 Llama2 70B" width="250" height="auto">
23-
<img src="./docs/source/blogs/media/H200launch_GPT175B_tps.png" alt="H200 GPT3 175B" width="250" height="auto">
22+
<img src="./docs/source/blogs/media/H200launch_tps.png" alt="H200 TPS" width="500" height="auto">
2423

2524
H200 FP8 achieves 11,819 tok/s on Llama2-13B on a single GPU, and is up to 1.9x faster than H100.
2625

0 commit comments

Comments
 (0)