Skip to content

Conversation

@jeremyestein
Copy link
Collaborator

@jeremyestein jeremyestein commented Dec 18, 2025

Split into three containers, each with their own config files:

  • waveform-controller — the Emap -> CSV bit of the pipeline
  • waveform-exporter — converts to parquet (original and pseudon), and uploads via FTPS.
  • waveform-hasher — from PIXL. Currently not used.

There is a toy hasher built into our code just to get the full pipeline going. This will be replaced with waveform-hasher when we get our credentials for the keyvault.

The exporter runs cron, which doesn't currently run anything, but the individual functions are runnable as individual commands.

All data files are accessed through the mounted directory, with careful separation between pseudon and originals.

@jeremyestein jeremyestein marked this pull request as ready for review December 18, 2025 15:07
Copy link
Collaborator

@thompson318 thompson318 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've had a go at running this. I like the structure with the controller, hasher, and exporter in separate containers. I'll keep playing around with it.
It needs some instructions on installing PIXL (or maybe add PIXL installation to docker compose.)?
Also the config files had to be in the parent directory (../config not ./config). I also couldn't get the exporter crontab working.

("location", pa.string()),
# decimal32 can have a maximum of 9 significant digits.
# We can go to 64 if needed but let's try and keep it compact.
# But they're not exposed?? Use 128 instead.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't understand this comment.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks. Clarified

Copy link
Collaborator

@thompson318 thompson318 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good thank you.

@jeremyestein jeremyestein merged commit 166b090 into dev Dec 23, 2025
1 check passed
@jeremyestein jeremyestein deleted the jeremy/pseudon branch December 23, 2025 13:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants