doc: minor update

terryyz · terryyz · commit 3d0447814817 · 2024-10-06T00:02:23.000+08:00
diff --git a/README.md b/README.md
@@ -91,8 +91,11 @@ pip install "git+https://github.com/bigcode-project/bigcodebench.git#egg=bigcode
 
 We use the greedy decoding as an example to show how to evaluate the generated code samples via remote API.
 
+> [!Note]
+>
+> Remotely executing on `BigCodeBench-Full` typically takes 6-7 minutes, and on `BigCodeBench-Hard` typically takes 4-5 minutes.
+
 ```bash
-# greedy decoding by default
 bigcodebench.evaluate \
   --model meta-llama/Meta-Llama-3.1-8B-Instruct \
   --split [complete|instruct] \
@@ -105,10 +108,6 @@ bigcodebench.evaluate \
 - The evaluation results will be stored in a file named `[model_name]--bigcodebench-[instruct|complete]--[backend]-[temp]-[n_samples]-sanitized_calibrated_eval_results.json`.
 - The pass@k results will be stored in a file named `[model_name]--bigcodebench-[instruct|complete]--[backend]-[temp]-[n_samples]-sanitized_calibrated_pass_at_k.json`.
 
-> [!Note]
->
-> Remotely executing on `BigCodeBench-Full` typically takes 5-7 minutes, and on `BigCodeBench-Hard` typically takes 3-5 minutes.
-
 > [!Note]
 >
 > BigCodeBench uses different prompts for base and chat models.