Skip to content

Add UCODE_USE_MODEL_SERVICES opt-in for UC model-services discovery#151

Open
AarushiShah-db wants to merge 1 commit into
mainfrom
model-services-discovery
Open

Add UCODE_USE_MODEL_SERVICES opt-in for UC model-services discovery#151
AarushiShah-db wants to merge 1 commit into
mainfrom
model-services-discovery

Conversation

@AarushiShah-db

Copy link
Copy Markdown
Collaborator

When UCODE_USE_MODEL_SERVICES is set, discover models via the Unity Catalog model-services API and address them as system.ai. instead of the per-family AI Gateway listings. Base URLs are unchanged — only the model name differs.

The flag is sticky: it's persisted into state at configure time so launches honor it without re-exporting the env var (an explicit env var still wins). Discovery uses a small page size with per-page retries because the metastore listing is slow and frequently 504s, and filters to system.ai.* so model services from other schemas don't leak into the family buckets.

Co-authored-by: Isaac

When UCODE_USE_MODEL_SERVICES is set, discover models via the Unity Catalog
model-services API and address them as system.ai.<model-name> instead of the
per-family AI Gateway listings. Base URLs are unchanged — only the model name
differs.

The flag is sticky: it's persisted into state at configure time so launches
honor it without re-exporting the env var (an explicit env var still wins).
Discovery uses a small page size with per-page retries because the metastore
listing is slow and frequently 504s, and filters to system.ai.* so model
services from other schemas don't leak into the family buckets.

Co-authored-by: Isaac
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant