Skip to content

Allow custom options for URL normalisation #40

@falkecarlsen

Description

@falkecarlsen

Note that the www domain label is enough to change the equality of two URLs according to the Url-package. This is expected behaviour, as it does actually change semantics.

For scraping, some links may use the domain label and others may not, e.g. human-written links would probably omit the label while automated links would probably include it for completeness. This could result in a page with two seemingly different tasks, that are actually pointing to the same page.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestreq/could-havecould be a nice to finish before deadline

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions