Kueue Configuration API
Resource Types
ClientConnection
Appears in:
Field | Description |
---|---|
qps [Required]float32
|
QPS controls the number of queries per second allowed for K8S api server connection. |
burst [Required]int32
|
Burst allows extra queries to accumulate when a client is exceeding its rate. |
ClusterQueueVisibility
Appears in:
Field | Description |
---|---|
maxCount [Required]int32
|
MaxCount indicates the maximal number of pending workloads exposed in the cluster queue status. When the value is set to 0, then ClusterQueue visibility updates are disabled. The maximal value is 4000. Defaults to 10. |
Configuration
Configuration is the Schema for the kueueconfigurations API
Field | Description |
---|---|
namespace [Required]string
|
Namespace is the namespace in which kueue is deployed. It is used as part of DNSName of the webhook Service. If not set, the value is set from the file /var/run/secrets/kubernetes.io/serviceaccount/namespace If the file doesn't exist, default value is kueue-system. |
ControllerManager [Required]ControllerManager
|
(Members of ControllerManager are embedded into this type.)
ControllerManager returns the configurations for controllers |
manageJobsWithoutQueueName [Required]bool
|
ManageJobsWithoutQueueName controls whether or not Kueue reconciles batch/v1.Jobs that don't set the annotation kueue.x-k8s.io/queue-name. If set to true, then those jobs will be suspended and never started unless they are assigned a queue and eventually admitted. This also applies to jobs created before starting the kueue controller. Defaults to false; therefore, those jobs are not managed and if they are created unsuspended, they will start immediately. |
internalCertManagement [Required]InternalCertManagement
|
InternalCertManagement is configuration for internalCertManagement |
waitForPodsReady [Required]WaitForPodsReady
|
WaitForPodsReady is configuration to provide simple all-or-nothing scheduling semantics for jobs to ensure they get resources assigned. This is achieved by blocking the start of new jobs until the previously started job has all pods running (ready). |
clientConnection [Required]ClientConnection
|
ClientConnection provides additional configuration options for Kubernetes API server client. |
integrations [Required]Integrations
|
Integrations provide configuration options for AI/ML/Batch frameworks integrations (including K8S job). |
queueVisibility [Required]QueueVisibility
|
QueueVisibility is configuration to expose the information about the top pending workloads. |
multiKueue [Required]MultiKueue
|
MultiKueue controls the behaviour of the MultiKueue AdmissionCheck Controller. |
ControllerConfigurationSpec
Appears in:
ControllerConfigurationSpec defines the global configuration for controllers registered with the manager.
Field | Description |
---|---|
groupKindConcurrency map[string]int
|
GroupKindConcurrency is a map from a Kind to the number of concurrent reconciliation allowed for that controller. When a controller is registered within this manager using the builder utilities, users have to specify the type the controller reconciles in the For(...) call. If the object's kind passed matches one of the keys in this map, the concurrency for that controller is set to the number specified. The key is expected to be consistent in form with GroupKind.String(),
e.g. ReplicaSet in apps group (regardless of version) would be |
cacheSyncTimeout time.Duration
|
CacheSyncTimeout refers to the time limit set to wait for syncing caches. Defaults to 2 minutes if not set. |
ControllerHealth
Appears in:
ControllerHealth defines the health configs.
Field | Description |
---|---|
healthProbeBindAddress string
|
HealthProbeBindAddress is the TCP address that the controller should bind to for serving health probes It can be set to "0" or "" to disable serving the health probe. |
readinessEndpointName string
|
ReadinessEndpointName, defaults to "readyz" |
livenessEndpointName string
|
LivenessEndpointName, defaults to "healthz" |
ControllerManager
Appears in:
Field | Description |
---|---|
webhook ControllerWebhook
|
Webhook contains the controllers webhook configuration |
leaderElection k8s.io/component-base/config/v1alpha1.LeaderElectionConfiguration
|
LeaderElection is the LeaderElection config to be used when configuring the manager.Manager leader election |
metrics ControllerMetrics
|
Metrics contains the controller metrics configuration |
health ControllerHealth
|
Health contains the controller health configuration |
pprofBindAddress string
|
PprofBindAddress is the TCP address that the controller should bind to for serving pprof. It can be set to "" or "0" to disable the pprof serving. Since pprof may contain sensitive information, make sure to protect it before exposing it to public. |
controller ControllerConfigurationSpec
|
Controller contains global configuration options for controllers registered within this manager. |
ControllerMetrics
Appears in:
ControllerMetrics defines the metrics configs.
Field | Description |
---|---|
bindAddress string
|
BindAddress is the TCP address that the controller should bind to for serving prometheus metrics. It can be set to "0" to disable the metrics serving. |
enableClusterQueueResources bool
|
EnableClusterQueueResources, if true the cluster queue resource usage and quotas metrics will be reported. |
ControllerWebhook
Appears in:
ControllerWebhook defines the webhook server for the controller.
Field | Description |
---|---|
port int
|
Port is the port that the webhook server serves at. It is used to set webhook.Server.Port. |
host string
|
Host is the hostname that the webhook server binds to. It is used to set webhook.Server.Host. |
certDir string
|
CertDir is the directory that contains the server key and certificate. if not set, webhook server would look up the server key and certificate in {TempDir}/k8s-webhook-server/serving-certs. The server key and certificate must be named tls.key and tls.crt, respectively. |
Integrations
Appears in:
Field | Description |
---|---|
frameworks [Required][]string
|
List of framework names to be enabled. Possible options:
|
podOptions [Required]PodIntegrationOptions
|
PodOptions defines kueue controller behaviour for pod objects |
InternalCertManagement
Appears in:
Field | Description |
---|---|
enable [Required]bool
|
Enable controls whether to enable internal cert management or not. Defaults to true. If you want to use a third-party management, e.g. cert-manager, set it to false. See the user guide for more information. |
webhookServiceName [Required]string
|
WebhookServiceName is the name of the Service used as part of the DNSName. Defaults to kueue-webhook-service. |
webhookSecretName [Required]string
|
WebhookSecretName is the name of the Secret used to store CA and server certs. Defaults to kueue-webhook-server-cert. |
MultiKueue
Appears in:
Field | Description |
---|---|
gcInterval k8s.io/apimachinery/pkg/apis/meta/v1.Duration
|
GCInterval defines the time interval between two consecutive garbage collection runs. Defaults to 1min. If 0, the garbage collection is disabled. |
origin string
|
Origin defines a label value used to track the creator of workloads in the worker clusters. This is used by multikueue in components like its garbage collector to identify remote objects that ware created by this multikueue manager cluster and delete them if their local counterpart no longer exists. |
workerLostTimeout k8s.io/apimachinery/pkg/apis/meta/v1.Duration
|
WorkerLostTimeout defines the time a local workload's multikueue admission check state is kept Ready if the connection with its reserving worker cluster is lost. Defaults to 15 minutes. |
PodIntegrationOptions
Appears in:
Field | Description |
---|---|
namespaceSelector [Required]k8s.io/apimachinery/pkg/apis/meta/v1.LabelSelector
|
NamespaceSelector can be used to omit some namespaces from pod reconciliation |
podSelector [Required]k8s.io/apimachinery/pkg/apis/meta/v1.LabelSelector
|
PodSelector can be used to choose what pods to reconcile |
QueueVisibility
Appears in:
Field | Description |
---|---|
clusterQueues [Required]ClusterQueueVisibility
|
ClusterQueues is configuration to expose the information about the top pending workloads in the cluster queue. |
updateIntervalSeconds [Required]int32
|
UpdateIntervalSeconds specifies the time interval for updates to the structure of the top pending workloads in the queues. The minimum value is 1. Defaults to 5. |
RequeuingStrategy
Appears in:
Field | Description |
---|---|
timestamp RequeuingTimestamp
|
Timestamp defines the timestamp used for re-queuing a Workload that was evicted due to Pod readiness. The possible values are:
|
backoffLimitCount int32
|
BackoffLimitCount defines the maximum number of re-queuing retries.
Once the number is reached, the workload is deactivated ( Every backoff duration is about "1.41284738^(n-1)+Rand" where the "n" represents the "workloadStatus.requeueState.count", and the "Rand" represents the random jitter. During this time, the workload is taken as an inadmissible and other workloads will have a chance to be admitted. For example, when the "waitForPodsReady.timeout" is the default, the workload deactivation time is as follows: {backoffLimitCount, workloadDeactivationSeconds} ~= {1, 601}, {2, 902}, ...,{5, 1811}, ...,{10, 3374}, ...,{20, 8730}, ...,{30, 86400(=24 hours)}, ... Defaults to null. |
RequeuingTimestamp
(Alias of string
)
Appears in:
WaitForPodsReady
Appears in:
Field | Description |
---|---|
enable [Required]bool
|
Enable when true, indicates that each admitted workload
blocks the admission of all other workloads from all queues until it is in the
|
timeout k8s.io/apimachinery/pkg/apis/meta/v1.Duration
|
Timeout defines the time for an admitted workload to reach the PodsReady=true condition. When the timeout is reached, the workload admission is cancelled and requeued in the same cluster queue. Defaults to 5min. |
blockAdmission [Required]bool
|
BlockAdmission when true, cluster queue will block admissions for all subsequent jobs until the jobs reach the PodsReady=true condition. It defaults to false if Enable is false and defaults to true otherwise. |
requeuingStrategy RequeuingStrategy
|
RequeuingStrategy defines the strategy for requeuing a Workload. |
Feedback
Was this page helpful?
Glad to hear it! Please tell us how we can improve.
Sorry to hear that. Please tell us how we can improve.