-
Notifications
You must be signed in to change notification settings - Fork 1
Description
Describe the bug
I KSTAR nextflow on Rivanna HPC. Process stopped during the second step, calculating hypergeometric values after binarizing data.

Data log showed that all data columns were empty.
To Reproduce
Steps to reproduce the behavior:
File was saved as a csv using pandas (df.to_csv ("path") (attached)
TCR_mapped_1UPidNoDups_ACC_SITE.csv
Expected behavior
I would have expected KSTAR to run through the full kinase activity calculation algorithm as usual. When I saved the same files as tsv (df.to_csv(path, sep='\t', index=False), KSTAR ran successfully. I did omit the index in this second save (index=TRUE is default), could also be the issue?
Additional context
I did check that the threshold was low enough to leave sites in the binarized data file. ~100+ sites were left in each column.