Feature/refactor DIMS PeakFinding #76

mraves2 · 2025-07-11T15:33:59Z

The PeakFinding step in the DIMS pipeline has been refactored. Instead of averaging the intensities for technical replicates and doing PeakFinding on the averages, the new method will do PeakFinding for each technical replicate and then average the peak intensities for every biological sample. To do this, several scripts have been modified:

MakeInit: changed variable name 'tmp' to 'replicates_persample'
GenerateBreaks: breaks and trim parameters put into separate RData files
AssignToBins: trim parameters read in separately, 'sample_name' changed to 'techrep_name', weighted mean for half-bad TICs
AverageTechReplicates: averaging part removed, script renamed to EvaluateTics. Update txt file with info on samples and corresponding tech reps
PeakFinding: new simplified way to find peaks for every technical replicate. First step: find regions of interest (roi) with some intensity; second step: integrate intensity in each roi by fitting a Gaussian curve
- preprocessing/peak_finding_functions: new functions for PeakFinding. Note that two functions are borrowed from other R packages which will be included in the Docker image in the future. These two functions may not adhere to our coding standards
AveragePeaks: averaging technical replicates after PeakFinding. Information for technical replicates from a txt file with scanmode included.
CollectAveraged: collect averaged peaks for all biological samples
PeakGrouping: input from CollectAveraged
tests/testthat/test_peak_finding_functions: unit tests for PeakFinding funtions. Note: no unit tests have been added for functions from external packages.

…essing

… file with scanmode

…akFinding method

fdekievit

Veel werk!

Ik heb hier en daar wat comments achter gelaten :)

DIMS/CollectAveraged.nf

DIMS/AssignToBins.R

DIMS/AveragePeaks.R

DIMS/tests/testthat/test_peak_finding_functions.R

fdekievit

Big improvement!

Left some minor comments :)

DIMS/preprocessing/evaluate_tics_functions.R

…ions.R

…sample_name

fdekievit

Een aantal functies kan ik niet volgen omdat ik niet kan zien waar ze vandaan komen (omdat er door sources functies opeens 'bestaan') dus hier heb ik geen comments voor achter gelaten.

Er is een linter toegevoegd, maar zonder de linter warnings toe te passen voegt de linter nu nog niks toe. In de python repo's is volgens mij besloten om error on warning op true te zetten waardoor je geforceerd wordt om deze warnings aan te pakken (anders voegt het natuurlijk nog niet zoveel toe). De linter geeft nu > 500 warnings.

Als laatste wordt er soms over dataframes gelooped zonder te weten of ze gevuld zijn. Als dit geen probleem is, laat dit dan a.u.b. zien door een unittest te runnen op een leeg object b.v.

DIMS/AveragePeaks.R

DIMS/preprocessing/evaluate_tics_functions.R

DIMS/tests/testthat/test_average_peaks.R

DIMS/tests/testthat/test_evaluate_tics.R

mraves2 · 2025-12-22T15:11:21Z

Hai Frank, Je hebt gelijk denk ik, dit is iets dat we in het vervolg als we meerdere stappen gaan samenvoegen en het datamodel aanpassen niet meer nodig zullen hebben, dus ik zal het nu zo laten, maar in het achterhoofd houden wat er allemaal fout zou kunnen gaan. Groetjes, Mia. From: fdekievit ***@***.***> Date: Monday, 22 December 2025 at 16:08 To: UMCUGenetics/CustomModules ***@***.***> Cc: Pras-Raves-2, M.L. (Mia) ***@***.***>, Author ***@***.***> Subject: Re: [UMCUGenetics/CustomModules] Feature/refactor DIMS PeakFinding (PR #76) @fdekievit commented on this pull request.

________________________________ In DIMS/AveragePeaks.R<#76 (comment)>:

@@ -0,0 +1,37 @@

+library(dplyr) + +# define parameters +cmd_args <- commandArgs(trailingOnly = TRUE) + +sample_name <- cmd_args[1] +techreps <- cmd_args[2] +scanmode <- cmd_args[3] +preprocessing_scripts_dir <- cmd_args[4] +tech_reps <- strsplit(techreps, ";")[[1]] maar daarom kan het toch geen kwaad om het toch in een try-catch te zetten? je gaat er vanuit dat het niet voor kan komen (gelukkig maar), maar puur technisch gezien kan de code prima kapot (als je het verkeerd aanroept) en dus is defensief programmeren hier best justified in mijn ogen. Maargoed, ik laat t varen... — Reply to this email directly, view it on GitHub<#76 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/A32BYK5TIDNCV4L2F5GWVS34DACOVAVCNFSM6AAAAACBKEH6L6VHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZTMMBUGQZTMNBUGA>. You are receiving this because you authored the thread.

…

________________________________ De informatie opgenomen in dit bericht kan vertrouwelijk zijn en is uitsluitend bestemd voor de geadresseerde. Indien u dit bericht onterecht ontvangt, wordt u verzocht de inhoud niet te gebruiken en de afzender direct te informeren door het bericht te retourneren. Het Universitair Medisch Centrum Utrecht is een publiekrechtelijke rechtspersoon in de zin van de W.H.W. (Wet Hoger Onderwijs en Wetenschappelijk Onderzoek) en staat geregistreerd bij de Kamer van Koophandel voor Midden-Nederland onder nr. 30244197. Denk s.v.p aan het milieu voor u deze e-mail afdrukt.

________________________________ This message may contain confidential information and is intended exclusively for the addressee. If you receive this message unintentionally, please do not use the contents but notify the sender immediately by return e-mail. University Medical Center Utrecht is a legal person by public law and is registered at the Chamber of Commerce for Midden-Nederland under no. 30244197. Please consider the environment before printing this e-mail.

fdekievit

Approved!

mraves2 added 17 commits January 31, 2025 17:22

GenerateBreaks output split into 2 files

0f724c4

AverageTechReplicates replaced by EvaluateTics

de495e3

refactored PeakFinding, peak finding funtions moved to folder preproc…

434a41f

…essing

refactor DIMS PeakFinding, flow between scripts

9d7a69f

added unit tests for DIMS peak finding

62fc225

changed variable name tmp to replicates_persample

5d311a3

omitted obsolete lines

708b872

added weighted mean for half-bad TICs

0521390

replaced AverageTechReplicates step by EvaluateTics

15403cd

replaced AverageTechReplicates step by EvaluateTics

6a7ada6

removed breaks as input for PeakFinding

6438e0e

changed PeakFinding to new two-step method

1ef197d

functions for new two-step PeakFinding method

9272ec3

unit tests for new two-step PeakFinding method

e006160

information for averaging peaks for technical replicates based on txt…

8102bdb

… file with scanmode

modified input for PeakGrouping corresponding to new PeakFinding method

d0ed769

collect averaged peaks per biological sample, corresponding to new Pe…

a35c4ca

…akFinding method

mraves2 marked this pull request as draft July 11, 2025 15:40

DIMS CustomModules merge conflicts resolved

d903e1b

mraves2 marked this pull request as ready for review July 15, 2025 10:26

fixed path to DIMS peak_finding_functions

295e460

fdekievit requested changes Jul 25, 2025

View reviewed changes

mraves2 added 7 commits October 2, 2025 11:48

created function for averaging peaks in DIMS/AveragePeaks.R

109d664

added unit tests for average_peaks_functions

cb33aba

moved parameters matrix and nr_replicates from workflow into params

db8633e

refactored DIMS/EvaluateTics

004e3e9

moved functions for DIMS/EvaluateTics to separate file

06e5e1a

added unit tests for DIMS/EvaluateTics

3f24b6d

modifications suggested in code review DIMS/PeakFinding

c2c65dd

fdekievit requested changes Oct 16, 2025

View reviewed changes

DIMS/preprocessing/evaluate_tics_functions.R Show resolved Hide resolved

DIMS/preprocessing/evaluate_tics_functions.R Outdated Show resolved Hide resolved

mraves2 added 6 commits October 16, 2025 16:21

removed two obsolete lines

15a25ba

resolved merge conflict in DIMS/EvaluateTics.R

527acf6

moved parameter ppm_peak from DIMS/AveragePeaks.R to inside function

007bea4

added parameter sample_name to DIMS/preprocessing/average_peaks_funct…

e58640b

…ions.R

modified DIMS/tests/testthat/test_average_peaks.R for extra variable …

f0763a0

…sample_name

added fixture files for unit test for DIMS/EvaluateTics

ac9f43f

fdekievit requested changes Dec 1, 2025

View reviewed changes

added unit test for empty peaklist

083aeac

fdekievit approved these changes Dec 22, 2025

View reviewed changes

Feature/refactor DIMS PeakFinding #76

Are you sure you want to change the base?

Feature/refactor DIMS PeakFinding #76

Uh oh!

Conversation

mraves2 commented Jul 11, 2025

Uh oh!

fdekievit left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fdekievit left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

fdekievit left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mraves2 commented Dec 22, 2025 via email

Uh oh!

fdekievit left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants