Configuring borrowing and lending limits

When ClusterQueues belong to the same cohort, they can share borrowable resources. By default, the unused nominal quota of all the ClusterQueues in a cohort is available for borrowing by other ClusterQueues. You can use borrowing and lending limits to control how much each ClusterQueue can borrow from or lend to other ClusterQueues in the cohort.

Borrowing limit

The borrowingLimit field on a resource within a ClusterQueue defines the maximum amount of unused quota that the ClusterQueue can borrow from the cohort. If not set, there is no borrowing limit.

Example: ClusterQueue with borrowing limit

apiVersion: kueue.x-k8s.io/v1beta2
kind: ClusterQueue
metadata:
  name: team-a-queue
spec:
  namespaceSelector: {}
  cohort: shared-cohort
  resourceGroups:
  - coveredResources: ["cpu", "memory", "nvidia.com/gpu"]
    flavors:
    - name: "default-flavor"
      resources:
      - name: "cpu"
        nominalQuota: 8
        borrowingLimit: 4
      - name: "memory"
        nominalQuota: 32Gi
        borrowingLimit: 16Gi
      - name: "nvidia.com/gpu"
        nominalQuota: 2
        borrowingLimit: 2

borrowingLimit for CPU: This ClusterQueue can borrow up to 4 additional CPU cores from the cohort, for a total usage of 12 CPU cores (8 nominal + 4 borrowed).
borrowingLimit for memory: This ClusterQueue can borrow up to 16Gi additional memory, for a total usage of 48Gi.
borrowingLimit for GPU: This ClusterQueue can borrow up to 2 additional GPUs, for a total usage of 4 GPUs.

Lending limit

The lendingLimit field on a resource within a ClusterQueue defines the maximum amount of unused quota that the ClusterQueue is willing to lend to other ClusterQueues in the cohort. If not set, the entire unused quota is available for lending.

Example: ClusterQueue with lending limit

apiVersion: kueue.x-k8s.io/v1beta2
kind: ClusterQueue
metadata:
  name: team-b-queue
spec:
  namespaceSelector: {}
  cohort: shared-cohort
  resourceGroups:
  - coveredResources: ["cpu", "memory", "nvidia.com/gpu"]
    flavors:
    - name: "default-flavor"
      resources:
      - name: "cpu"
        nominalQuota: 16
        lendingLimit: 8
      - name: "memory"
        nominalQuota: 64Gi
        lendingLimit: 32Gi
      - name: "nvidia.com/gpu"
        nominalQuota: 4
        lendingLimit: 0

lendingLimit for CPU: This ClusterQueue will lend at most 8 unused CPU cores to other queues in the cohort, reserving a minimum of 8 CPU cores for itself.
lendingLimit for memory: This ClusterQueue will lend at most 32Gi of unused memory.
lendingLimit of 0 for GPU: This ClusterQueue will not lend any GPU resources to other queues, even if its GPUs are unused. This is useful for reserving expensive GPU resources exclusively for a specific team.

The following example shows two teams sharing resources within a cohort, with controlled borrowing and lending:

Team A ClusterQueue:

apiVersion: kueue.x-k8s.io/v1beta2
kind: ClusterQueue
metadata:
  name: team-a-queue
spec:
  namespaceSelector:
    matchLabels:
      kubernetes.io/metadata.name: team-a
  cohort: shared-cohort
  resourceGroups:
  - coveredResources: ["cpu", "memory", "nvidia.com/gpu"]
    flavors:
    - name: "default-flavor"
      resources:
      - name: "cpu"
        nominalQuota: 8
        borrowingLimit: 8
        lendingLimit: 4
      - name: "memory"
        nominalQuota: 32Gi
        borrowingLimit: 32Gi
        lendingLimit: 16Gi
      - name: "nvidia.com/gpu"
        nominalQuota: 2
        borrowingLimit: 2
        lendingLimit: 1

Team B ClusterQueue:

apiVersion: kueue.x-k8s.io/v1beta2
kind: ClusterQueue
metadata:
  name: team-b-queue
spec:
  namespaceSelector:
    matchLabels:
      kubernetes.io/metadata.name: team-b
  cohort: shared-cohort
  resourceGroups:
  - coveredResources: ["cpu", "memory", "nvidia.com/gpu"]
    flavors:
    - name: "default-flavor"
      resources:
      - name: "cpu"
        nominalQuota: 16
        borrowingLimit: 4
        lendingLimit: 8
      - name: "memory"
        nominalQuota: 64Gi
        borrowingLimit: 16Gi
        lendingLimit: 32Gi
      - name: "nvidia.com/gpu"
        nominalQuota: 4
        borrowingLimit: 1
        lendingLimit: 2

In this configuration:

Team A has 8 CPU nominal quota, can borrow up to 8 more, and will lend up to 4 when unused.
Team B has 16 CPU nominal quota, can borrow up to 4 more, and will lend up to 8 when unused.
GPU lending is tightly controlled: Team A lends at most 1 GPU, Team B lends at most 2 GPUs.

TIP

Use borrowing and lending limits to prevent one team from consuming all the shared resources in a cohort. This is especially important for expensive resources like GPUs.

#Configuring borrowing and lending limits

#TOC

#Borrowing limit

#Example: ClusterQueue with borrowing limit

#Lending limit

#Example: ClusterQueue with lending limit

#Combined example: Two teams sharing a cohort

Configuring borrowing and lending limits

TOC

Borrowing limit

Example: ClusterQueue with borrowing limit

Lending limit

Example: ClusterQueue with lending limit

Combined example: Two teams sharing a cohort