Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
24 commits
Select commit Hold shift + click to select a range
479437a
Add transcriptformer_mlflow method component
lazappi Jul 8, 2025
95ced9b
Adjust transcriptformer_mlflow dependencies
lazappi Jul 8, 2025
6e8dce2
Output embedding in transcriptformer_mlflow
lazappi Jul 8, 2025
440a18b
Install transcriptformer requirements from file
lazappi Jul 8, 2025
5332e00
Add extracting model archive to transcriptformer
lazappi Jul 9, 2025
b84a40a
Add transcriptformer_mlflow to benchmark workflow
lazappi Jul 9, 2025
74be855
Install openproblems package for transcriptformer
lazappi Jul 9, 2025
8693f35
Style transcriptformer_mlflow script
lazappi Jul 9, 2025
b7dae0b
Increase transcriptformer memory label
lazappi Jul 29, 2025
5d3c6e0
Add scvi_mlflow method
lazappi Aug 13, 2025
0243f8d
Add geneformer_mlflow method
lazappi Aug 13, 2025
04d8a5d
Add scgpt_mlflow method
lazappi Aug 13, 2025
5c46c4c
Add uce_mlflow method
lazappi Aug 13, 2025
9c82ff4
Add unpack_directory() utils helper
lazappi Aug 13, 2025
2a2a61b
Add unpack helper to scgpt_mlflow
lazappi Aug 13, 2025
8143651
Add unpack helper to scvi_mlflow
lazappi Aug 13, 2025
f179613
Merge remote-tracking branch 'origin/main' into feature/no-ref/add-ml…
lazappi Aug 13, 2025
5e49b7c
Update scvi_mlflow
lazappi Sep 18, 2025
374d455
Merge remote-tracking branch 'origin/main' into feature/no-ref/add-ml…
rcannood Oct 11, 2025
1884f81
use helper functions in mlflow methods
rcannood Oct 12, 2025
28bb336
use biggpu for uce?
rcannood Oct 12, 2025
e480211
disable old methods
rcannood Oct 12, 2025
b4e2ec8
Merge remote-tracking branch 'origin/main' into feature/no-ref/add-ml…
rcannood Oct 19, 2025
329d25e
fix wf
rcannood Oct 19, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion scripts/run_benchmark/run_full_local.sh
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,7 @@ input_states: resources/datasets/**/state.yaml
rename_keys: 'input_dataset:output_dataset;input_solution:output_solution'
output_state: "state.yaml"
publish_dir: "$publish_dir"
settings: '{"methods_exclude": ["uce", "scgpt_finetuned"]}'
settings: '{"methods_exclude": ["uce", "scgpt_finetuned", "transcriptformer_mlflow"]}'
HERE

# run the benchmark
Expand Down
2 changes: 1 addition & 1 deletion scripts/run_benchmark/run_test_local.sh
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,7 @@ input_states: resources_test/task_batch_integration/**/state.yaml
rename_keys: 'input_dataset:output_dataset;input_solution:output_solution'
output_state: "state.yaml"
publish_dir: "$publish_dir"
settings: '{"methods_exclude": ["uce", "scgpt_finetuned"]}'
settings: '{"methods_exclude": ["uce", "scgpt_finetuned", "transcriptformer_mlflow"]}'
HERE

nextflow run . \
Expand Down
51 changes: 51 additions & 0 deletions src/methods/geneformer_mlflow/config.vsh.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,51 @@
__merge__: ../../api/base_method.yaml

name: geneformer_mlflow
label: Geneformer (MLflow model)
summary: Geneformer is a foundation transformer model pretrained on a large-scale corpus of single cell transcriptomes
description: |
Geneformer is a foundation transformer model pretrained on a large-scale
corpus of single cell transcriptomes to enable context-aware predictions in
network biology. For this task, Geneformer is used to create a batch-corrected
cell embedding.

Here, we use a version packaged as an MLflow model.
references:
doi:
- 10.1038/s41586-023-06139-9
- 10.1101/2024.08.16.608180
links:
documentation: https://geneformer.readthedocs.io/en/latest/index.html
repository: https://huggingface.co/ctheodoris/Geneformer

info:
method_types: [embedding]
preferred_normalization: counts

arguments:
- name: --model
type: file
description: |
An MLflow model URI for the Geneformer model. If it is a .zip or
.tar.gz file it will be extracted to a temporary directory.
required: true

resources:
- type: python_script
path: script.py
- path: /src/utils/read_anndata_partial.py
- path: /src/utils/exit_codes.py
- path: /src/utils/unpack.py
- path: /src/utils/mlflow.py
- path: requirements.txt

engines:
- type: docker
image: openproblems/base_pytorch_nvidia:1
__merge__: /src/utils/mlflow_docker_setup.yaml

runners:
- type: executable
- type: nextflow
directives:
label: [hightime, highmem, midcpu, gpu]
Loading