What would you like to be added:
See the whole list: https://github.com/InftyAI/llmaz/milestone/3
We'll focus on three main things:
Glad to have like:
Why is this needed:
Completion requirements:
This enhancement requires the following artifacts:
The artifacts should be linked in subsequent comments.
What would you like to be added:
See the whole list: https://github.com/InftyAI/llmaz/milestone/3
We'll focus on three main things:
xPyD serving with heterogeneous devices, we need a new orchestration layer build on top of lws
More advanced routing policies, e.g. based on request profile & GPU type
GPU spot instances scaling ready for production env
Glad to have like:
Why is this needed:
Completion requirements:
This enhancement requires the following artifacts:
The artifacts should be linked in subsequent comments.