Modified scripts to be agnostic to input variantID format #3

dtaylo95 · 2023-09-26T18:50:08Z

As is, the tool requires that variants IDs (both in the input --eqtl file, and --vcf file) be formatted <chrom>_<pos>.... As far as I can tell, there are two reasons for this:

It allows the program to parse the variant's position from its ID.
It meets the formatting requirements used by lmfit in the fitting step.

I am proposing changes that make the tool agnostic to the format of the variant IDs (I can imagine some users have VCFs that use dbSNP rsIDs, for example). Briefly, the changes are as follows:

the --eqtl file now must include two additional columns: variant_chr and variant_pos that describe the (1-based) position of each variant. This information is then used to fetch the genotypes from the tabix-indexed VCF
Variants are assigned unique temporary IDs (a new variant_id_clean column) that meet the formatting requirements of lmfit and are used when fitting the model.
I've also updated the gene_id_clean functionality to match that of the new variant_id_clean column. This assumes no specific formatting of the input gene IDs.

…dated formatting behavior and 2) was causing a Cython type error on compile

dtaylo95 and others added 3 commits September 26, 2023 14:29

Modified scripts to be agnostic to input variantID format

f3ff68c

Tab-separated output

07eab64

Removed function in parse.pyx 1) because it is no longer used with up…

cca1cea

…dated formatting behavior and 2) was causing a Cython type error on compile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Modified scripts to be agnostic to input variantID format #3

Modified scripts to be agnostic to input variantID format #3

Uh oh!

dtaylo95 commented Sep 26, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Modified scripts to be agnostic to input variantID format #3

Are you sure you want to change the base?

Modified scripts to be agnostic to input variantID format #3

Uh oh!

Conversation

dtaylo95 commented Sep 26, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant