Skip to content

Extract Editable Text from an ODT #5

@orcmid

Description

@orcmid

Using an ApacheOpenOffice ODT file, convert it to editable text files using pandoc. Demonstrate and make reproducible.

  1. Determine what happens with the document pages and the images.
  2. See how to obtain finer-grain text pages from the conversion so that they can be edited easily.
  3. Resolve how cross-referencing is reflected and preserved also.

Do these until we have a decent assessment of how an ODT file can be preserved enough but made into suitable editable text forms.

Create a reproducible case in the repository where some can perform the same operations.

Provide something about how to install pandoc and how its usage fits here. Screen capture the command-line operation.

Metadata

Metadata

Assignees

No one assigned

    Labels

    taskactions requirement to accomplish a particular task

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions