Skip to content

Conversation

@ibolmo
Copy link
Collaborator

@ibolmo ibolmo commented Dec 22, 2025

No description provided.

@ibolmo ibolmo self-assigned this Dec 22, 2025
@github-actions
Copy link

github-actions bot commented Dec 22, 2025

Braintrust eval report

Autoevals (python-3.10-1767050903)

Score Average Improvements Regressions
NumericDiff 74.5% (+0pp) - -
Time_to_first_token 1.35tok (-0.1tok) 108 🟢 11 🔴
Llm_calls 1.55 (+0) - -
Tool_calls 0 (+0) - -
Errors 0 (+0) - -
Llm_errors 0 (+0) - -
Tool_errors 0 (+0) - -
Prompt_tokens 279.25tok (+0tok) - -
Prompt_cached_tokens 0tok (+0tok) - -
Prompt_cache_creation_tokens 0tok (+0tok) - -
Completion_tokens 18.38tok (+0tok) - -
Completion_reasoning_tokens 0tok (+0tok) - -
Total_tokens 297.62tok (+0tok) - -
Estimated_cost 0$ (+0$) - -
Duration 1.38s (-0.13s) 201 🟢 17 🔴
Llm_duration 2.76s (-0.26s) 112 🟢 7 🔴

@cpinn
Copy link

cpinn commented Dec 29, 2025

not an approver but I think this would be a good change to merge before my zod upgrade attempt.

@ibolmo ibolmo mentioned this pull request Dec 29, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants