examples: add kthena.io/model-name annotation to ModelServing samples by WHOIM1205 · Pull Request #871 · volcano-sh/kthena

WHOIM1205 · 2026-04-07T07:36:36Z

Summary

Add kthena.io/model-name annotation to ModelServing example manifests
Helps map ModelServing resources to the model label in Prometheus metrics

Scope

Examples only (no code or behavioral changes)

Test Plan

Verified manifests apply successfully using:
kubectl apply -f examples/model-serving/sample.yaml --dry-run=client

ref: Proposal: Prometheus + KEDA autoscaling for ModelServing #868

This PR implements the first step from the proposal:
Proposal: Prometheus + KEDA autoscaling for ModelServing #868

It introduces annotation-based model mapping in examples and docs.

Signed-off-by: WHOIM1205 <rathourprateek8@gmail.com>

Copilot

Pull request overview

This PR adds the kthena.io/model-name annotation to selected ModelServing example manifests to demonstrate the recommended pattern for mapping ModelServing resources to Prometheus metrics. The annotation helps associate a ModelServing resource with its corresponding model label in metrics-based autoscaling scenarios.

Changes:

Added kthena.io/model-name annotation to examples/model-serving/sample.yaml and examples/model-serving/data-parallel-deployment.yaml
Added the same annotation to the documentation example at docs/kthena/docs/assets/examples/model-serving/sample.yaml
Included clarifying comment explaining the annotation's purpose

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
examples/model-serving/sample.yaml	Added kthena.io/model-name annotation with value "sample-model"
examples/model-serving/data-parallel-deployment.yaml	Added kthena.io/model-name annotation with value "my-model"
docs/kthena/docs/assets/examples/model-serving/sample.yaml	Added kthena.io/model-name annotation with value "sample-model" (documentation mirror)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-07T07:38:18Z

+  annotations:
+    # Identifies the model for metrics-based autoscaling (must match Prometheus "model" label).
+    kthena.io/model-name: "sample-model"


This PR adds the kthena.io/model-name annotation to some ModelServing examples but appears incomplete. Other example files in the same directory lack this annotation: gangPolicy.yaml, gpu-pd-disaggregation.yaml, multi-node.yaml, network-topology.yaml, prefill-decode-disaggregation.yaml, and rollingupdate.yaml. For consistency, all ModelServing example manifests should include this annotation to demonstrate the recommended pattern.

gemini-code-assist

Code Review

This pull request adds the kthena.io/model-name annotation to several ModelServing example configuration files to enable metrics-based autoscaling. Feedback was provided regarding the consistency of the annotation value with the resource name in the sample.yaml files to maintain clarity across the documentation.

gemini-code-assist · 2026-04-07T07:38:43Z

  namespace: default
+  annotations:
+    # Identifies the model for metrics-based autoscaling (must match Prometheus "model" label).
+    kthena.io/model-name: "sample-model"


The value of the kthena.io/model-name annotation (sample-model) is inconsistent with the resource name (sample). In other examples, such as data-parallel-deployment.yaml, the annotation value matches the resource name. For better clarity and consistency across the documentation and examples, it is recommended to use the resource name as the model name value unless there is a specific reason to demonstrate they can differ.

kthena.io/model-name: "sample"

gemini-code-assist · 2026-04-07T07:38:43Z

  namespace: default
+  annotations:
+    # Identifies the model for metrics-based autoscaling (must match Prometheus "model" label).
+    kthena.io/model-name: "sample-model"


The value of the kthena.io/model-name annotation (sample-model) is inconsistent with the resource name (sample). In other examples, such as data-parallel-deployment.yaml, the annotation value matches the resource name. For better clarity and consistency across the documentation and examples, it is recommended to use the resource name as the model name value unless there is a specific reason to demonstrate they can differ.

kthena.io/model-name: "sample"

LiZhenCheng9527 · 2026-04-07T12:41:53Z

/lgtm

hzxuzhonghu

Wait: if this annotation requires user to set, why do we need to add to our docs

WHOIM1205 · 2026-04-08T09:07:39Z

hey @hzxuzhonghu
the idea was just to make the mapping to the prometheus model label visible in examples since autoscaling will depend on it it’s not required just a convention for clarity

hzxuzhonghu · 2026-04-09T12:07:44Z

So actually it can be any label key, right? Please only add the label when you document autosclaing example.

…mples Signed-off-by: WHOIM1205 <rathourprateek8@gmail.com>

volcano-sh-bot · 2026-04-09T18:55:26Z

New changes are detected. LGTM label has been removed.

volcano-sh-bot · 2026-04-09T18:55:30Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

WHOIM1205 · 2026-04-09T18:55:57Z

hey @hzxuzhonghu
Updated the PR based on feedback removed the annotation from examples for now will introduce it later along with autoscaling support

LiZhenCheng9527 · 2026-04-15T07:49:07Z

Will there be any further updates to this PR?

WHOIM1205 · 2026-04-15T10:35:43Z

Will there be any further updates to this PR?

no further updates planned for this pr

this was part of an earlier approach but based on feedback i moved away from adding annotations directly

ive opened a new pr with the updated direction focusing on autoscaling example using user defined labels

examples: add kthena.io/model-name annotation to ModelServing samples

946a612

Signed-off-by: WHOIM1205 <rathourprateek8@gmail.com>

Copilot AI review requested due to automatic review settings April 7, 2026 07:36

volcano-sh-bot requested review from LiZhenCheng9527 and git-malu April 7, 2026 07:36

volcano-sh-bot added the size/XS label Apr 7, 2026

Copilot started reviewing on behalf of WHOIM1205 April 7, 2026 07:37 View session

Copilot AI reviewed Apr 7, 2026

View reviewed changes

gemini-code-assist Bot reviewed Apr 7, 2026

View reviewed changes

volcano-sh-bot assigned LiZhenCheng9527 Apr 7, 2026

volcano-sh-bot added the lgtm label Apr 7, 2026

hzxuzhonghu reviewed Apr 8, 2026

View reviewed changes

examples: remove kthena.io/model-name annotation from ModelServing sa…

04293d0

…mples Signed-off-by: WHOIM1205 <rathourprateek8@gmail.com>

volcano-sh-bot removed the lgtm label Apr 9, 2026

Conversation

WHOIM1205 commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Scope

Test Plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

LiZhenCheng9527 commented Apr 7, 2026

Uh oh!

hzxuzhonghu left a comment

Choose a reason for hiding this comment

Uh oh!

WHOIM1205 commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hzxuzhonghu commented Apr 9, 2026

Uh oh!

volcano-sh-bot commented Apr 9, 2026

Uh oh!

volcano-sh-bot commented Apr 9, 2026

Uh oh!

WHOIM1205 commented Apr 9, 2026

Uh oh!

LiZhenCheng9527 commented Apr 15, 2026

Uh oh!

WHOIM1205 commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

WHOIM1205 commented Apr 7, 2026 •

edited

Loading

WHOIM1205 commented Apr 8, 2026 •

edited

Loading