Skip to content

Could the read name modification be improved? #52

@TomSmithCGAT

Description

@TomSmithCGAT

Hello,

Is there are reason why the read name information needs to be concatenated here?

read.name.replace(" ", "_").replace("/", "").replace("\\", "")

I ask because it breaks downstream tools like umi_tools dedup which use the read names to identify read pairs. Would it be tolerable to add the 'rbc:` string to the end of the first element of the space-delimited read name?

Happy to issue a PR if that would work too. I note this would be a non-backward compatibile change if any other downstream tool/code is dependent upon the current read naming convention.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions