Skip to content

Conversation

@saksham-jain177
Copy link

Embedding models were previously forced onto GPU via set_device(0), which can
cause stale CUDA context errors in long-running sessions (as described in #6 ).

This change makes the embedding device configurable via the EMBEDDING_DEVICE
environment variable (default: CPU), while preserving optional GPU usage for
users who want it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant