ci: refactor tests to exclude heavy model benchmarking by default and allow by manual triggering #31

Mattdl · 2026-01-09T10:45:59Z

Model evaluation is perfect for new contributions to check if they can reproduce results from a paper. However, we don't want this in every PR to trigger full evaluation of all models. Therefore it is disabled by default now and added to Model contributing documentation.

Reproducing results for models on a benchmark is now in a separate github workflow that can be manually triggered and is excluded by default to avoid bloating tests with the number of models being added.

mattdl-techwolf added 3 commits January 9, 2026 11:22

test: refactor of tests to exclude benchmarking validation of models.

102e1cf

Reproducing results for models on a benchmark is now in a separate github workflow that can be manually triggered and is excluded by default to avoid bloating tests with the number of models being added.

docs: CONTRIBUTING model guideline update

687b28b

chore: remove noqa from model tests

3970506

Mattdl merged commit 44dbe7e into main Jan 9, 2026
2 checks passed

Mattdl deleted the refactor-contributions branch January 9, 2026 11:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ci: refactor tests to exclude heavy model benchmarking by default and allow by manual triggering #31

ci: refactor tests to exclude heavy model benchmarking by default and allow by manual triggering #31

Uh oh!

Mattdl commented Jan 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ci: refactor tests to exclude heavy model benchmarking by default and allow by manual triggering #31

ci: refactor tests to exclude heavy model benchmarking by default and allow by manual triggering #31

Uh oh!

Conversation

Mattdl commented Jan 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants