Helm Installation

The Helm project has ended support for Helm 2 as of November, 2020. As of Spring Cloud Data Flow 2.7.0, the chart is based on Helm 3, dropping support for Helm 2.

You must migrate from Helm 2 to Helm 3. To prepare for the migration, you should read the Helm v2 to v3 Migration Guide. Additionally, you can find some helpful tips on data migration and upgrades in the post-migration issues article.

As of Spring Cloud Data Flow 2.6.1, the Bitnami team maintains the Helm chart. To report bugs or request features, use the Bitnami Issue Tracker.

Spring Cloud Data Flow offers a Helm Chart for deploying the Spring Cloud Data Flow server and its required services to a Kubernetes Cluster.

The following sections cover how to initialize Helm and install Spring Cloud Data Flow on a Kubernetes cluster.

If you use Minikube, see Setting Minikube Resources for details on CPU and RAM resource requirements.

Installing Spring Cloud Data Flow Server and Required Services

You should review the following documentation and adjust any parameter customizations that have been made for your environment or how they may differ from the legacy official Helm chart. Value names, defaults, and so on may have changed during the Bitnami chart migration. You can find more information in the Parameter tables and the Upgrading and Notable Changes sections.

Spring Cloud Data Flow packaged by Bitnami

Spring Cloud Data Flow is a microservices-based toolkit for building streaming and batch data processing pipelines in Cloud Foundry and Kubernetes.

Overview of Spring Cloud Data Flow

TL;DR

helm repo add bitnami https://charts.bitnami.com/bitnami
helm install my-release bitnami/spring-cloud-dataflow

Introduction

This chart bootstraps a Spring Cloud Data Flow deployment on a Kubernetes cluster using the Helm package manager.

Bitnami charts can be used with Kubeapps for deployment and management of Helm Charts in clusters.

Prerequisites

  • Kubernetes 1.19+
  • Helm 3.2.0+
  • PV provisioner support in the underlying infrastructure

Installing the Chart

To install the chart with the release name my-release:

helm repo add bitnami https://charts.bitnami.com/bitnami
helm install my-release bitnami/spring-cloud-dataflow

These commands deploy Spring Cloud Data Flow on the Kubernetes cluster with the default configuration. The parameters section lists the parameters that can be configured during installation.

Tip: List all releases using helm list

Uninstalling the Chart

To uninstall/delete the my-release chart:

helm uninstall my-release

Parameters

Global parameters

Name Description Value
global.imageRegistry Global Docker image registry ""
global.imagePullSecrets Global Docker registry secret names as an array []
global.storageClass Global StorageClass for Persistent Volume(s) ""

Common parameters

Name Description Value
nameOverride String to partially override scdf.fullname template (will maintain the release name). ""
fullnameOverride String to fully override scdf.fullname template. ""
commonAnnotations Annotations to add to all deployed objects {}
commonLabels Labels to add to all deployed objects {}
kubeVersion Force target Kubernetes version (using Helm capabilities if not set) ""
clusterDomain Default Kubernetes cluster domain cluster.local
extraDeploy Array of extra objects to deploy with the release []

Dataflow Server parameters

Name Description Value
server.image.registry Spring Cloud Dataflow image registry docker.io
server.image.repository Spring Cloud Dataflow image repository bitnami/spring-cloud-dataflow
server.image.tag Spring Cloud Dataflow image tag (immutable tags are recommended) 2.9.4-debian-11-r0
server.image.pullPolicy Spring Cloud Dataflow image pull policy IfNotPresent
server.image.pullSecrets Specify docker-registry secret names as an array []
server.image.debug Enable image debug mode false
server.hostAliases Deployment pod host aliases []
server.composedTaskRunner.image.registry Spring Cloud Dataflow Composed Task Runner image registry docker.io
server.composedTaskRunner.image.repository Spring Cloud Dataflow Composed Task Runner image repository bitnami/spring-cloud-dataflow-composed-task-runner
server.composedTaskRunner.image.tag Spring Cloud Dataflow Composed Task Runner image tag (immutable tags are recommended) 2.9.4-debian-11-r0
server.configuration.streamingEnabled Enables or disables streaming data processing true
server.configuration.batchEnabled Enables or disables batch data (tasks and schedules) processing true
server.configuration.accountName The name of the account to configure for the Kubernetes platform default
server.configuration.trustK8sCerts Trust K8s certificates when querying the Kubernetes API false
server.configuration.containerRegistries Container registries configuration {}
server.configuration.grafanaInfo Endpoint to the grafana instance (Deprecated: use the metricsDashboard instead) ""
server.configuration.metricsDashboard Endpoint to the metricsDashboard instance ""
server.configuration.defaultSpringApplicationJSON Injects default values for environment variable SPRING_APPLICATION_JSON true
server.existingConfigmap ConfigMap with Spring Cloud Dataflow Server Configuration ""
server.command Override default container command (useful when using custom images) []
server.args Override default container args (useful when using custom images) []
server.lifecycleHooks for the Dataflow server container(s) to automate configuration before or after startup {}
server.extraEnvVars Extra environment variables to be set on Dataflow server container []
server.extraEnvVarsCM ConfigMap with extra environment variables ""
server.extraEnvVarsSecret Secret with extra environment variables ""
server.replicaCount Number of Dataflow server replicas to deploy 1
server.podAffinityPreset Dataflow server pod affinity preset. Ignored if server.affinity is set. Allowed values: soft or hard ""
server.podAntiAffinityPreset Dataflow server pod anti-affinity preset. Ignored if server.affinity is set. Allowed values: soft or hard soft
server.containerPort Dataflow server port 8080
server.nodeAffinityPreset.type Dataflow server node affinity preset type. Ignored if server.affinity is set. Allowed values: soft or hard ""
server.nodeAffinityPreset.key Dataflow server node label key to match Ignored if server.affinity is set. ""
server.nodeAffinityPreset.values Dataflow server node label values to match. Ignored if server.affinity is set. []
server.affinity Dataflow server affinity for pod assignment {}
server.nodeSelector Dataflow server node labels for pod assignment {}
server.tolerations Dataflow server tolerations for pod assignment []
server.podAnnotations Annotations for Dataflow server pods {}
server.updateStrategy.type Deployment strategy type for Dataflow server pods. RollingUpdate
server.podLabels Extra labels for Dataflow Server pods {}
server.priorityClassName Dataflow Server pods' priority ""
server.schedulerName Name of the k8s scheduler (other than default) ""
server.topologySpreadConstraints Topology Spread Constraints for pod assignment []
server.podSecurityContext.enabled Enabled Dataflow Server pods' Security Context true
server.podSecurityContext.fsGroup Group ID for the volumes of the pod 1001
server.containerSecurityContext.enabled Enabled Dataflow Server containers' Security Context true
server.containerSecurityContext.runAsUser Set Dataflow Server container's Security Context runAsUser 1001
server.resources.limits The resources limits for the Dataflow server container {}
server.resources.requests The requested resources for the Dataflow server container {}
server.startupProbe.enabled Enable startupProbe false
server.startupProbe.initialDelaySeconds Initial delay seconds for startupProbe 120
server.startupProbe.periodSeconds Period seconds for startupProbe 20
server.startupProbe.timeoutSeconds Timeout seconds for startupProbe 1
server.startupProbe.failureThreshold Failure threshold for startupProbe 6
server.startupProbe.successThreshold Success threshold for startupProbe 1
server.livenessProbe.enabled Enable livenessProbe true
server.livenessProbe.initialDelaySeconds Initial delay seconds for livenessProbe 120
server.livenessProbe.periodSeconds Period seconds for livenessProbe 20
server.livenessProbe.timeoutSeconds Timeout seconds for livenessProbe 1
server.livenessProbe.failureThreshold Failure threshold for livenessProbe 6
server.livenessProbe.successThreshold Success threshold for livenessProbe 1
server.readinessProbe.enabled Enable readinessProbe true
server.readinessProbe.initialDelaySeconds Initial delay seconds for readinessProbe 120
server.readinessProbe.periodSeconds Period seconds for readinessProbe 20
server.readinessProbe.timeoutSeconds Timeout seconds for readinessProbe 1
server.readinessProbe.failureThreshold Failure threshold for readinessProbe 6
server.readinessProbe.successThreshold Success threshold for readinessProbe 1
server.customStartupProbe Override default startup probe {}
server.customLivenessProbe Override default liveness probe {}
server.customReadinessProbe Override default readiness probe {}
server.service.type Kubernetes service type ClusterIP
server.service.port Service HTTP port 8080
server.service.nodePort Specify the nodePort value for the LoadBalancer and NodePort service types ""
server.service.clusterIP Dataflow server service cluster IP ""
server.service.externalTrafficPolicy Enable client source IP preservation Cluster
server.service.loadBalancerIP Load balancer IP if service type is LoadBalancer ""
server.service.loadBalancerSourceRanges Addresses that are allowed when service is LoadBalancer []
server.service.extraPorts Extra ports to expose (normally used with the sidecar value) []
server.service.annotations Provide any additional annotations which may be required. Evaluated as a template. {}
server.service.sessionAffinity Session Affinity for Kubernetes service, can be "None" or "ClientIP" None
server.service.sessionAffinityConfig Additional settings for the sessionAffinity {}
server.ingress.enabled Enable ingress controller resource false
server.ingress.path The Path to WordPress. You may need to set this to '/*' in order to use this with ALB ingress controllers. /
server.ingress.apiVersion Force Ingress API version (automatically detected if not set) ""
server.ingress.pathType Ingress path type ImplementationSpecific
server.ingress.hostname Default host for the ingress resource dataflow.local
server.ingress.annotations Additional annotations for the Ingress resource. To enable certificate autogeneration, place here your cert-manager annotations. {}
server.ingress.tls Enable TLS configuration for the hostname defined at ingress.hostname parameter false
server.ingress.certManager Add the corresponding annotations for cert-manager integration false
server.ingress.extraHosts The list of additional hostnames to be covered with this ingress record. []
server.ingress.extraPaths An array with additional arbitrary paths that may need to be added to the ingress under the main host []
server.ingress.extraTls The tls configuration for additional hostnames to be covered with this ingress record. []
server.ingress.secrets If you're providing your own certificates, please use this to add the certificates as secrets []
server.ingress.ingressClassName IngressClass that will be be used to implement the Ingress (Kubernetes 1.18+) ""
server.ingress.extraRules Additional rules to be covered with this ingress record []
server.initContainers Add init containers to the Dataflow Server pods []
server.sidecars Add sidecars to the Dataflow Server pods []
server.pdb.create Enable/disable a Pod Disruption Budget creation false
server.pdb.minAvailable Minimum number/percentage of pods that should remain scheduled 1
server.pdb.maxUnavailable Maximum number/percentage of pods that may be made unavailable ""
server.autoscaling.enabled Enable autoscaling for Dataflow server false
server.autoscaling.minReplicas Minimum number of Dataflow server replicas ""
server.autoscaling.maxReplicas Maximum number of Dataflow server replicas ""
server.autoscaling.targetCPU Target CPU utilization percentage ""
server.autoscaling.targetMemory Target Memory utilization percentage ""
server.extraVolumes Extra Volumes to be set on the Dataflow Server Pod []
server.extraVolumeMounts Extra VolumeMounts to be set on the Dataflow Container []
server.jdwp.enabled Set to true to enable Java debugger false
server.jdwp.port Specify port for remote debugging 5005
server.proxy Add proxy configuration for SCDF server {}
server.applicationProperties Specify common application properties added by SCDF server to streams and/or tasks {}

Dataflow Skipper parameters

Name Description Value
skipper.enabled Enable Spring Cloud Skipper component true
skipper.hostAliases Deployment pod host aliases []
skipper.image.registry Spring Cloud Skipper image registry docker.io
skipper.image.repository Spring Cloud Skipper image repository bitnami/spring-cloud-skipper
skipper.image.tag Spring Cloud Skipper image tag (immutable tags are recommended) 2.8.4-debian-11-r0
skipper.image.pullPolicy Spring Cloud Skipper image pull policy IfNotPresent
skipper.image.pullSecrets Specify docker-registry secret names as an array []
skipper.image.debug Enable image debug mode false
skipper.configuration.accountName The name of the account to configure for the Kubernetes platform default
skipper.configuration.trustK8sCerts Trust K8s certificates when querying the Kubernetes API false
skipper.existingConfigmap Name of existing ConfigMap with Skipper server configuration ""
skipper.command Override default container command (useful when using custom images) []
skipper.args Override default container args (useful when using custom images) []
skipper.lifecycleHooks for the Skipper container(s) to automate configuration before or after startup {}
skipper.extraEnvVars Extra environment variables to be set on Skipper server container []
skipper.extraEnvVarsCM Name of existing ConfigMap containing extra environment variables ""
skipper.extraEnvVarsSecret Name of existing Secret containing extra environment variables ""
skipper.replicaCount Number of Skipper server replicas to deploy 1
skipper.podAffinityPreset Skipper pod affinity preset. Ignored if skipper.affinity is set. Allowed values: soft or hard ""
skipper.podAntiAffinityPreset Skipper pod anti-affinity preset. Ignored if skipper.affinity is set. Allowed values: soft or hard soft
skipper.nodeAffinityPreset.type Skipper node affinity preset type. Ignored if skipper.affinity is set. Allowed values: soft or hard ""
skipper.nodeAffinityPreset.key Skipper node label key to match Ignored if skipper.affinity is set. ""
skipper.nodeAffinityPreset.values Skipper node label values to match. Ignored if skipper.affinity is set. []
skipper.affinity Skipper affinity for pod assignment {}
skipper.nodeSelector Skipper node labels for pod assignment {}
skipper.tolerations Skipper tolerations for pod assignment []
skipper.podAnnotations Annotations for Skipper server pods {}
skipper.updateStrategy.type Deployment strategy type for Skipper server pods. RollingUpdate
skipper.podLabels Extra labels for Skipper pods {}
skipper.priorityClassName Controller priorityClassName ""
skipper.schedulerName Name of the k8s scheduler (other than default) ""
skipper.topologySpreadConstraints Topology Spread Constraints for pod assignment []
skipper.podSecurityContext.enabled Enabled Skipper pods' Security Context true
skipper.podSecurityContext.fsGroup Group ID for the volumes of the pod 1001
skipper.containerSecurityContext.enabled Enabled Datafkiw Skipper containers' Security Context true
skipper.containerSecurityContext.runAsUser Set Dataflow Skipper container's Security Context runAsUser 1001
skipper.resources.limits The resources limits for the Skipper server container {}
skipper.resources.requests The requested resources for the Skipper server container {}
skipper.startupProbe.enabled Enable startupProbe false
skipper.startupProbe.initialDelaySeconds Initial delay seconds for startupProbe 120
skipper.startupProbe.periodSeconds Period seconds for startupProbe 20
skipper.startupProbe.timeoutSeconds Timeout seconds for startupProbe 1
skipper.startupProbe.failureThreshold Failure threshold for startupProbe 6
skipper.startupProbe.successThreshold Success threshold for startupProbe 1
skipper.livenessProbe.enabled Enable livenessProbe true
skipper.livenessProbe.initialDelaySeconds Initial delay seconds for livenessProbe 120
skipper.livenessProbe.periodSeconds Period seconds for livenessProbe 20
skipper.livenessProbe.timeoutSeconds Timeout seconds for livenessProbe 1
skipper.livenessProbe.failureThreshold Failure threshold for livenessProbe 6
skipper.livenessProbe.successThreshold Success threshold for livenessProbe 1
skipper.readinessProbe.enabled Enable readinessProbe true
skipper.readinessProbe.initialDelaySeconds Initial delay seconds for readinessProbe 120
skipper.readinessProbe.periodSeconds Period seconds for readinessProbe 20
skipper.readinessProbe.timeoutSeconds Timeout seconds for readinessProbe 1
skipper.readinessProbe.failureThreshold Failure threshold for readinessProbe 6
skipper.readinessProbe.successThreshold Success threshold for readinessProbe 1
skipper.customStartupProbe Override default startup probe {}
skipper.customLivenessProbe Override default liveness probe {}
skipper.customReadinessProbe Override default readiness probe {}
skipper.service.type Kubernetes service type ClusterIP
skipper.service.port Service HTTP port 80
skipper.service.nodePort Service HTTP node port ""
skipper.service.clusterIP Skipper server service cluster IP ""
skipper.service.externalTrafficPolicy Enable client source IP preservation Cluster
skipper.service.loadBalancerIP Load balancer IP if service type is LoadBalancer ""
skipper.service.loadBalancerSourceRanges Address that are allowed when service is LoadBalancer []
skipper.service.extraPorts Extra ports to expose (normally used with the sidecar value) []
skipper.service.annotations Annotations for Skipper server service {}
skipper.service.sessionAffinity Session Affinity for Kubernetes service, can be "None" or "ClientIP" None
skipper.service.sessionAffinityConfig Additional settings for the sessionAffinity {}
skipper.initContainers Add init containers to the Dataflow Skipper pods []
skipper.sidecars Add sidecars to the Skipper pods []
skipper.pdb.create Enable/disable a Pod Disruption Budget creation false
skipper.pdb.minAvailable Minimum number/percentage of pods that should remain scheduled 1
skipper.pdb.maxUnavailable Maximum number/percentage of pods that may be made unavailable ""
skipper.autoscaling.enabled Enable autoscaling for Skipper server false
skipper.autoscaling.minReplicas Minimum number of Skipper server replicas ""
skipper.autoscaling.maxReplicas Maximum number of Skipper server replicas ""
skipper.autoscaling.targetCPU Target CPU utilization percentage ""
skipper.autoscaling.targetMemory Target Memory utilization percentage ""
skipper.extraVolumes Extra Volumes to be set on the Skipper Pod []
skipper.extraVolumeMounts Extra VolumeMounts to be set on the Skipper Container []
skipper.jdwp.enabled Enable Java Debug Wire Protocol (JDWP) false
skipper.jdwp.port JDWP TCP port for remote debugging 5005
externalSkipper.host Host of a external Skipper Server localhost
externalSkipper.port External Skipper Server port number 7577

Deployer parameters

Name Description Value
deployer.resources.limits Streaming applications resource limits {}
deployer.resources.requests Streaming applications resource requests {}
deployer.readinessProbe.initialDelaySeconds Initial delay seconds for readinessProbe 120
deployer.livenessProbe.initialDelaySeconds Initial delay seconds for livenessProbe 90
deployer.nodeSelector The node selectors to apply to the streaming applications deployments in "key:value" format ""
deployer.tolerations Streaming applications tolerations []
deployer.volumeMounts Streaming applications extra volume mounts {}
deployer.volumes Streaming applications extra volumes {}
deployer.environmentVariables Streaming applications environment variables []
deployer.podSecurityContext.enabled Enabled pods' Security Context of the deployed pods batch or stream pods true
deployer.podSecurityContext.runAsUser Set Dataflow Streams container's Security Context runAsUser 1001
deployer.imagePullSecrets Streaming applications imagePullSecrets []
deployer.secretRefs Streaming applications secretRefs []
deployer.entryPointStyle An entry point style affects how application properties are passed to the container to be deployed. Allowed values: exec (default), shell, boot exec
deployer.imagePullPolicy An image pull policy defines when a Docker image should be pulled to the local registry. Allowed values: IfNotPresent (default), Always, Never IfNotPresent

RBAC parameters

Name Description Value
serviceAccount.create Enable the creation of a ServiceAccount for Dataflow server and Skipper server pods true
serviceAccount.name Name of the created serviceAccount. If not set and create is true, a name is generated using the scdf.fullname template ""
serviceAccount.automountServiceAccountToken Automount service account token for the server service account true
serviceAccount.annotations Annotations for service account. Evaluated as a template. Only used if create is true. {}
rbac.create Whether to create and use RBAC resources or not true

Metrics parameters

Name Description Value
metrics.enabled Enable Prometheus metrics false
metrics.image.registry Prometheus Rsocket Proxy image registry docker.io
metrics.image.repository Prometheus Rsocket Proxy image repository bitnami/prometheus-rsocket-proxy
metrics.image.tag Prometheus Rsocket Proxy image tag (immutable tags are recommended) 1.4.0-debian-11-r0
metrics.image.pullPolicy Prometheus Rsocket Proxy image pull policy IfNotPresent
metrics.image.pullSecrets Specify docker-registry secret names as an array []
metrics.resources.limits The resources limits for the Prometheus Rsocket Proxy container {}
metrics.resources.requests The requested resources for the Prometheus Rsocket Proxy container {}
metrics.replicaCount Number of Prometheus Rsocket Proxy replicas to deploy 1
metrics.podAffinityPreset Prometheus Rsocket Proxy pod affinity preset. Ignored if metrics.affinity is set. Allowed values: soft or hard ""
metrics.podAntiAffinityPreset Prometheus Rsocket Proxy pod anti-affinity preset. Ignored if metrics.affinity is set. Allowed values: soft or hard soft
metrics.nodeAffinityPreset.type Prometheus Rsocket Proxy node affinity preset type. Ignored if metrics.affinity is set. Allowed values: soft or hard ""
metrics.nodeAffinityPreset.key Prometheus Rsocket Proxy node label key to match Ignored if metrics.affinity is set. ""
metrics.nodeAffinityPreset.values Prometheus Rsocket Proxy node label values to match. Ignored if metrics.affinity is set. []
metrics.affinity Prometheus Rsocket Proxy affinity for pod assignment {}
metrics.nodeSelector Prometheus Rsocket Proxy node labels for pod assignment {}
metrics.hostAliases Prometheus Proxy pods host aliases []
metrics.tolerations Prometheus Rsocket Proxy tolerations for pod assignment []
metrics.podAnnotations Annotations for Prometheus Rsocket Proxy pods {}
metrics.podLabels Extra labels for Prometheus Proxy pods {}
metrics.podSecurityContext.enabled Enabled Prometheus Proxy pods' Security Context false
metrics.podSecurityContext.fsGroup Set Prometheus Proxy pod's Security Context fsGroup 1001
metrics.containerSecurityContext.enabled Enabled Prometheus Proxy containers' Security Context false
metrics.containerSecurityContext.runAsUser Set Prometheus Proxy containers' Security Context runAsUser 1001
metrics.command Override default container command (useful when using custom images) []
metrics.args Override default container args (useful when using custom images) []
metrics.lifecycleHooks for the Prometheus Proxy container(s) to automate configuration before or after startup {}
metrics.extraEnvVars Array with extra environment variables to add to Prometheus Proxy nodes []
metrics.extraEnvVarsCM Name of existing ConfigMap containing extra env vars for Prometheus Proxy nodes ""
metrics.extraEnvVarsSecret Name of existing Secret containing extra env vars for Prometheus Proxy nodes ""
metrics.extraVolumes Optionally specify extra list of additional volumes for the Prometheus Proxy pod(s) []
metrics.extraVolumeMounts Optionally specify extra list of additional volumeMounts for the Prometheus Proxy container(s) []
metrics.containerPorts.http Prometheus Proxy HTTP container port 8080
metrics.containerPorts.rsocket Prometheus Proxy Rsocket container port 7001
metrics.sidecars Add additional sidecar containers to the Prometheus Proxy pod(s) []
metrics.initContainers Add additional init containers to the Prometheus Proxy pod(s) []
metrics.updateStrategy.type Prometheus Proxy deployment strategy type. RollingUpdate
metrics.priorityClassName Prometheus Rsocket Proxy pods' priority. ""
metrics.schedulerName Name of the k8s scheduler (other than default) ""
metrics.topologySpreadConstraints Topology Spread Constraints for pod assignment []
metrics.service.type Prometheus Proxy service type ClusterIP
metrics.service.ports.http Prometheus Rsocket Proxy HTTP port 8080
metrics.service.ports.rsocket Prometheus Rsocket Proxy Rsocket port 7001
metrics.service.nodePorts.http Node port for HTTP ""
metrics.service.nodePorts.rsocket Node port for Rsocket ""
metrics.service.clusterIP Prometheys Proxy service Cluster IP ""
metrics.service.loadBalancerIP Prometheys Proxy service Load Balancer IP ""
metrics.service.loadBalancerSourceRanges Prometheys Proxy service Load Balancer sources []
metrics.service.externalTrafficPolicy Prometheys Proxy service external traffic policy Cluster
metrics.service.extraPorts Extra ports to expose (normally used with the sidecar value) []
metrics.service.annotations Annotations for the Prometheus Rsocket Proxy service {}
metrics.service.sessionAffinity Session Affinity for Kubernetes service, can be "None" or "ClientIP" None
metrics.service.sessionAffinityConfig Additional settings for the sessionAffinity {}
metrics.serviceMonitor.enabled if true, creates a Prometheus Operator ServiceMonitor (also requires metrics.enabled to be true) false
metrics.serviceMonitor.namespace Namespace in which ServiceMonitor is created if different from release ""
metrics.serviceMonitor.jobLabel The name of the label on the target service to use as the job name in prometheus. ""
metrics.serviceMonitor.interval Interval at which metrics should be scraped. ""
metrics.serviceMonitor.scrapeTimeout Timeout after which the scrape is ended ""
metrics.serviceMonitor.relabelings RelabelConfigs to apply to samples before scraping []
metrics.serviceMonitor.metricRelabelings MetricRelabelConfigs to apply to samples before ingestion []
metrics.serviceMonitor.selector ServiceMonitor selector labels {}
metrics.serviceMonitor.labels Extra labels for the ServiceMonitor {}
metrics.serviceMonitor.honorLabels honorLabels chooses the metric's labels on collisions with target labels false
metrics.pdb.create Enable/disable a Pod Disruption Budget creation false
metrics.pdb.minAvailable Minimum number/percentage of pods that should remain scheduled 1
metrics.pdb.maxUnavailable Maximum number/percentage of pods that may be made unavailable ""
metrics.autoscaling.enabled Enable autoscaling for Prometheus Rsocket Proxy false
metrics.autoscaling.minReplicas Minimum number of Prometheus Rsocket Proxy replicas ""
metrics.autoscaling.maxReplicas Maximum number of Prometheus Rsocket Proxy replicas ""
metrics.autoscaling.targetCPU Target CPU utilization percentage ""
metrics.autoscaling.targetMemory Target Memory utilization percentage ""

Init Container parameters

Name Description Value
waitForBackends.enabled Wait for the database and other services (such as Kafka or RabbitMQ) used when enabling streaming true
waitForBackends.image.registry Init container wait-for-backend image registry docker.io
waitForBackends.image.repository Init container wait-for-backend image name bitnami/kubectl
waitForBackends.image.tag Init container wait-for-backend image tag 1.24.1-debian-11-r0
waitForBackends.image.pullPolicy Init container wait-for-backend image pull policy IfNotPresent
waitForBackends.image.pullSecrets Specify docker-registry secret names as an array []
waitForBackends.resources.limits Init container wait-for-backend resource limits {}
waitForBackends.resources.requests Init container wait-for-backend resource requests {}

Database parameters

Name Description Value
mariadb.enabled Enable/disable MariaDB chart installation true
mariadb.architecture MariaDB architecture. Allowed values: standalone or replication standalone
mariadb.auth.rootPassword Password for the MariaDB root user ""
mariadb.auth.username Username of new user to create dataflow
mariadb.auth.password Password for the new user change-me
mariadb.auth.database Database name to create dataflow
mariadb.auth.forcePassword Force users to specify required passwords in the database false
mariadb.auth.usePasswordFiles Mount credentials as a file instead of using an environment variable false
mariadb.initdbScripts Specify dictionary of scripts to be run at first boot {}
flyway.enabled Enable/disable flyway running Dataflow and Skipper Database creation scripts on startup true
externalDatabase.host Host of the external database localhost
externalDatabase.port External database port number 3306
externalDatabase.driver The fully qualified name of the JDBC Driver class ""
externalDatabase.scheme The scheme is a vendor-specific or shared protocol string that follows the "jdbc:" of the URL ""
externalDatabase.password Password for the above username ""
externalDatabase.existingPasswordSecret Existing secret with database password ""
externalDatabase.existingPasswordKey Key of the existing secret with database password, defaults to datasource-password ""
externalDatabase.dataflow.url JDBC URL for dataflow server. Overrides external scheme, host, port, database, and jdbc parameters. ""
externalDatabase.dataflow.database Name of the existing database to be used by Dataflow server dataflow
externalDatabase.dataflow.username Existing username in the external db to be used by Dataflow server dataflow
externalDatabase.skipper.url JDBC URL for skipper. Overrides external scheme, host, port, database, and jdbc parameters. ""
externalDatabase.skipper.database Name of the existing database to be used by Skipper server skipper
externalDatabase.skipper.username Existing username in the external db to be used by Skipper server skipper
externalDatabase.hibernateDialect Hibernate Dialect used by Dataflow/Skipper servers ""

RabbitMQ chart parameters

Name Description Value
rabbitmq.enabled Enable/disable RabbitMQ chart installation true
rabbitmq.auth.username RabbitMQ username user
externalRabbitmq.enabled Enable/disable external RabbitMQ false
externalRabbitmq.host Host of the external RabbitMQ localhost
externalRabbitmq.port External RabbitMQ port number 5672
externalRabbitmq.username External RabbitMQ username guest
externalRabbitmq.password External RabbitMQ password. It will be saved in a kubernetes secret guest
externalRabbitmq.vhost External RabbitMQ virtual host. It will be saved in a kubernetes secret ""
externalRabbitmq.existingPasswordSecret Existing secret with RabbitMQ password. It will be saved in a kubernetes secret ""

Kafka chart parameters

Name Description Value
kafka.enabled Enable/disable Kafka chart installation false
kafka.replicaCount Number of Kafka brokers 1
kafka.offsetsTopicReplicationFactor Kafka Secret Key 1
kafka.zookeeper.replicaCount Number of Zookeeper replicas 1
externalKafka.enabled Enable/disable external Kafka false
externalKafka.brokers External Kafka brokers localhost:9092
externalKafka.zkNodes External Zookeeper nodes localhost:2181

Specify each parameter using the --set key=value[,key=value] argument to helm install. For example,

helm install my-release --set server.replicaCount=2 bitnami/spring-cloud-dataflow

The above command install Spring Cloud Data Flow chart with 2 Dataflow server replicas.

Alternatively, a YAML file that specifies the values for the parameters can be provided while installing the chart. For example,

helm install my-release -f values.yaml bitnami/spring-cloud-dataflow

Tip: You can use the default values.yaml

Configuration and installation details

Rolling VS Immutable tags

It is strongly recommended to use immutable tags in a production environment. This ensures your deployment does not change automatically if the same tag is updated with a different image.

Bitnami will release a new chart updating its containers if a new version of the main container, significant changes, or critical vulnerabilities exist.

Features

If you only need to deploy tasks and schedules, streaming and Skipper can be disabled:

server.configuration.batchEnabled=true
server.configuration.streamingEnabled=false
skipper.enabled=false
rabbitmq.enabled=false

If you only need to deploy streams, tasks and schedules can be disabled:

server.configuration.batchEnabled=false
server.configuration.streamingEnabled=true
skipper.enabled=true
rabbitmq.enabled=true

NOTE: Both server.configuration.batchEnabled and server.configuration.streamingEnabled should not be set to false at the same time.

Messaging solutions

There are two supported messaging solutions in this chart:

  • RabbitMQ (default)
  • Kafka

To change the messaging layer to Kafka, use the the following parameters:

rabbitmq.enabled=false
kafka.enabled=true

Only one messaging layer can be used at a given time.

Using an external database

Sometimes you may want to have Spring Cloud components connect to an external database rather than installing one inside your cluster, e.g. to use a managed database service, or use run a single database server for all your applications. To do this, the chart allows you to specify credentials for an external database under the externalDatabase parameter. You should also disable the MariaDB installation with the mariadb.enabled option. For example with the following parameters:

mariadb.enabled=false
externalDatabase.scheme=mariadb
externalDatabase.host=myexternalhost
externalDatabase.port=3306
externalDatabase.password=mypassword
externalDatabase.dataflow.user=mydataflowuser
externalDatabase.dataflow.database=mydataflowdatabase
externalDatabase.dataflow.user=myskipperuser
externalDatabase.dataflow.database=myskipperdatabase

NOTE: When using the indidual properties (scheme, host, port, database, an optional jdbcParameters) this chart will format the JDBC URL as jdbc:{scheme}://{host}:{port}/{database}{jdbcParameters}. The URL format follows that of the MariaDB database drive but may not work for other database vendors.

To use an alternate database vendor (other than MariaDB) you can use the externalDatabase.dataflow.url and externalDatabase.skipper.url properties to provide the JDBC URLs for the dataflow server and skipper respectively. If these properties are defined, they will take precedence over the individual attributes. As an example of configuring an external MS SQL Server database:

mariadb.enabled=false
externalDatabase.password=mypassword
externalDatabase.dataflow.url=jdbc:sqlserver://mssql-server:1433
externalDatabase.dataflow.user=mydataflowuser
externalDatabase.skipper.url=jdbc:sqlserver://mssql-server:1433
externalDatabase.skipper.user=myskipperuser
externalDatabase.hibernateDialect=org.hibernate.dialect.SQLServer2012Dialect

NOTE: If you disable MariaDB per above you MUST supply values for the externalDatabase connection.

Adding extra flags

In case you want to add extra environment variables to any Spring Cloud component, you can use XXX.extraEnvs parameter(s), where XXX is placeholder you need to replace with the actual component(s). For instance, to add extra flags to Spring Cloud Data Flow, use:

server:
  extraEnvs:
    - name: FOO
      value: BAR

Using custom Dataflow configuration

This helm chart supports using custom configuration for Dataflow server.

You can specify the configuration for Dataflow server setting the server.existingConfigmap parameter to an external ConfigMap with the configuration file.

Using custom Skipper configuration

This helm chart supports using custom configuration for Skipper server.

You can specify the configuration for Skipper server setting the skipper.existingConfigmap parameter to an external ConfigMap with the configuration file.

Sidecars and Init Containers

If you have a need for additional containers to run within the same pod as Dataflow or Skipper components (e.g. an additional metrics or logging exporter), you can do so via the XXX.sidecars parameter(s), where XXX is placeholder you need to replace with the actual component(s). Simply define your container according to the Kubernetes container spec.

server:
  sidecars:
    - name: your-image-name
      image: your-image
      imagePullPolicy: Always
      ports:
        - name: portname
          containerPort: 1234

Similarly, you can add extra init containers using the XXX.initContainers parameter(s).

server:
  initContainers:
    - name: your-image-name
      image: your-image
      imagePullPolicy: Always
      ports:
        - name: portname
          containerPort: 1234

Ingress

This chart provides support for ingress resources. If you have an ingress controller installed on your cluster, such as nginx-ingress or traefik you can utilize the ingress controller to serve your Spring Cloud Data Flow server.

To enable ingress integration, please set server.ingress.enabled to true

Hosts

Most likely you will only want to have one hostname that maps to this Spring Cloud Data Flow installation. If that's your case, the property server.ingress.hostname will set it. However, it is possible to have more than one host. To facilitate this, the server.ingress.extraHosts object is can be specified as an array. You can also use server.ingress.extraTLS to add the TLS configuration for extra hosts.

For each host indicated at server.ingress.extraHosts, please indicate a name, path, and any annotations that you may want the ingress controller to know about.

For annotations, please see this document. Not all annotations are supported by all ingress controllers, but this document does a good job of indicating which annotation is supported by many popular ingress controllers.

TLS

This chart will facilitate the creation of TLS secrets for use with the ingress controller, however, this is not required. There are four common use cases:

  • Helm generates/manages certificate secrets based on the parameters.
  • User generates/manages certificates separately.
  • Helm creates self-signed certificates and generates/manages certificate secrets.
  • An additional tool (like cert-manager) manages the secrets for the application. In the first two cases, it's needed a certificate and a key. We would expect them to look like this:
  • certificate files should look like (and there can be more than one certificate if there is a certificate chain)

    -----BEGIN CERTIFICATE-----
    MIID6TCCAtGgAwIBAgIJAIaCwivkeB5EMA0GCSqGSIb3DQEBCwUAMFYxCzAJBgNV
    ...
    jScrvkiBO65F46KioCL9h5tDvomdU1aqpI/CBzhvZn1c0ZTf87tGQR8NK7v7
    -----END CERTIFICATE-----
  • keys should look like:

    -----BEGIN RSA PRIVATE KEY-----
    MIIEogIBAAKCAQEAvLYcyu8f3skuRyUgeeNpeDvYBCDcgq+LsWap6zbX5f8oLqp4
    ...
    wrj2wDbCDCFmfqnSJ+dKI3vFLlEz44sAV8jX/kd4Y6ZTQhlLbYc=
    -----END RSA PRIVATE KEY-----
  • If you are going to use Helm to manage the certificates based on the parameters, please copy these values into the certificate and key values for a given server.ingress.secrets entry.
  • In case you are going to manage TLS secrets separately, please know that you must create a TLS secret with name INGRESS_HOSTNAME-tls (where INGRESS_HOSTNAME is a placeholder to be replaced with the hostname you set using the server.ingress.hostname parameter).
  • To use self-signed certificates created by Helm, set server.ingress.tls to true and server.ingress.certManager to false.
  • If your cluster has a cert-manager add-on to automate the management and issuance of TLS certificates, set server.ingress.certManager boolean to true to enable the corresponding annotations for cert-manager.

Setting Pod's affinity

This chart allows you to set your custom affinity using the XXX.affinity parameter(s). Find more information about Pod's affinity in the kubernetes documentation.

As an alternative, you can use of the preset configurations for pod affinity, pod anti-affinity, and node affinity available at the bitnami/common chart. To do so, set the XXX.podAffinityPreset, XXX.podAntiAffinityPreset, or XXX.nodeAffinityPreset parameters.

Troubleshooting

Find more information about how to deal with common errors related to Bitnami Helm charts in this troubleshooting guide.

Upgrading

If you enabled RabbitMQ chart to be used as the messaging solution for Skipper to manage streaming content, then it's necessary to set the rabbitmq.auth.password and rabbitmq.auth.erlangCookie parameters when upgrading for readiness/liveness probes to work properly. Inspect the RabbitMQ secret to obtain the password and the Erlang cookie, then you can upgrade your chart using the command below:

To 10.0.0

This major updates the Kafka subchart to it newest major, 18.0.0. No major issues are expected during the upgrade.

To 11.0.0

This chart bumps the RabbitMQ version to 3.10.x. No issues are expected during the upgrade.

To 10.0.0

This major updates the Kafka subchart to it newest major, 17.0.0. No major issues are expected during the upgrade.

To 9.0.0

This major updates the RabbitMQ subchart to it newest major, 9.0.0. Here you can find more information about the changes introduced in that version.

To 8.0.0

This major release bumps the MariaDB version to 10.6. Follow the upstream instructions for upgrading from MariaDB 10.5 to 10.6. No major issues are expected during the upgrade.

To 7.0.0

This major updates the Kafka subchart to it newest major, 16.0.0. Here you can find more information about the changes introduced in that version.

To 6.0.0

This major release updates the Kafka subchart to its newest major 15.x.x, which contain several changes in the supported values and bumps Kafka major version to 3.x series (check the upgrade notes to obtain more information).

To upgrade to 6.0.0 from 5.x using Kafka as messaging solution, it should be done maintaining the Kafka 2.x series. To do so, follow the instructions below (the following example assumes that the release name is scdf and the release namespace default):

  1. Obtain the credentials on your current release:
export MARIADB_ROOT_PASSWORD=$(kubectl get secret --namespace default scdf-mariadb -o jsonpath="{.data.mariadb-root-password}" | base64 -d)
export MARIADB_PASSWORD=$(kubectl get secret --namespace default scdf-mariadb -o jsonpath="{.data.mariadb-password}" | base64 -d)
  1. Upgrade your release using the same Kafka version:
export CURRENT_KAFKA_VERSION=$(kubectl exec scdf-kafka-0 -- bash -c 'echo $BITNAMI_IMAGE_VERSION')
helm upgrade scdf bitnami/spring-cloud-dataflow \
  --set rabbitmq.enabled=false \
  --set kafka.enabled=true \
  --set kafka.image.tag=$CURRENT_KAFKA_VERSION \
  --set mariadb.auth.password=$MARIADB_PASSWORD \
  --set mariadb.auth.rootPassword=$MARIADB_ROOT_PASSWORD

To 5.0.0

This major release renames several values in this chart and adds missing features, in order to be inline with the rest of assets in the Bitnami charts repository.

Affected values:

  • serviceMonitor.extraLabels renamed as serviceMonitor.labels.
  • server.strategyType and skikker.strategyType have been replaced by server.updateStrategy and skikker.updateStrategy respectively. While strategyType was interpreted as a String, while updateStrategy is interpreted as an object.
  • The service account token is now set to automount false by default. To change this, set the value serviceAccount.automountServiceAccountToken to true.

Additionally updates the MariaDB subchart to it newest major, 10.0.0, which contains similar changes. Here you can find more information about the changes introduced in this version.

To 4.0.0

This major updates the Kafka subchart its newest major, 14.0.0. Here you can find more information about the changes introduced in this version.

To 3.0.0

This major updates the Kafka subchart to its newest major 13.0.0. For more information on this subchart's major, please refer to kafka upgrade notes.

To 2.0.0

On November 13, 2020, Helm v2 support was formally finished, this major version is the result of the required changes applied to the Helm Chart to be able to incorporate the different features added in Helm v3 and to be consistent with the Helm project itself regarding the Helm v2 EOL.

What changes were introduced in this major version?

  • Previous versions of this Helm Chart use apiVersion: v1 (installable by both Helm 2 and 3), this Helm Chart was updated to apiVersion: v2 (installable by Helm 3 only). Here you can find more information about the apiVersion field.
  • Move dependency information from the requirements.yaml to the Chart.yaml
  • After running helm dependency update, a Chart.lock file is generated containing the same structure used in the previous requirements.lock
  • The different fields present in the Chart.yaml file has been ordered alphabetically in a homogeneous way for all the Bitnami Helm Charts

Considerations when upgrading to this version

  • If you want to upgrade to this version from a previous one installed with Helm v3, you shouldn't face any issues
  • If you want to upgrade to this version using Helm v2, this scenario is not supported as this version doesn't support Helm v2 anymore
  • If you installed the previous version with Helm v2 and wants to upgrade to this version with Helm v3, please refer to the official Helm documentation about migrating from Helm v2 to v3

Useful links

v0.x.x

helm upgrade my-release bitnami/spring-cloud-dataflow --set mariadb.rootUser.password=[MARIADB_ROOT_PASSWORD] --set rabbitmq.auth.password=[RABBITMQ_PASSWORD] --set rabbitmq.auth.erlangCookie=[RABBITMQ_ERLANG_COOKIE]

v1.x.x

helm upgrade my-release bitnami/spring-cloud-dataflow --set mariadb.auth.rootPassword=[MARIADB_ROOT_PASSWORD] --set rabbitmq.auth.password=[RABBITMQ_PASSWORD] --set rabbitmq.auth.erlangCookie=[RABBITMQ_ERLANG_COOKIE]

To 1.0.0

MariaDB dependency version was bumped to a new major version that introduces several incompatibilities. Therefore, backwards compatibility is not guaranteed unless an external database is used. Check MariaDB Upgrading Notes for more information.

To upgrade to 1.0.0, you will need to reuse the PVC used to hold the MariaDB data on your previous release. To do so, follow the instructions below (the following example assumes that the release name is dataflow):

NOTE: Please, create a backup of your database before running any of those actions.

Obtain the credentials and the name of the PVC used to hold the MariaDB data on your current release:

export MARIADB_ROOT_PASSWORD=$(kubectl get secret --namespace default dataflow-mariadb -o jsonpath="{.data.mariadb-root-password}" | base64 -d)
export MARIADB_PASSWORD=$(kubectl get secret --namespace default dataflow-mariadb -o jsonpath="{.data.mariadb-password}" | base64 -d)
export MARIADB_PVC=$(kubectl get pvc -l app=mariadb,component=master,release=dataflow -o jsonpath="{.items[0].metadata.name}")
export RABBITMQ_PASSWORD=$(kubectl get secret --namespace default dataflow-rabbitmq -o jsonpath="{.data.rabbitmq-password}" | base64 -d)
export RABBITMQ_ERLANG_COOKIE=$(kubectl get secret --namespace default dataflow-rabbitmq -o jsonpath="{.data.rabbitmq-erlang-cookie}" | base64 -d)

Upgrade your release (maintaining the version) disabling MariaDB and scaling Data Flow replicas to 0:

helm upgrade dataflow bitnami/spring-cloud-dataflow --version 0.7.4 \
  --set server.replicaCount=0 \
  --set skipper.replicaCount=0 \
  --set mariadb.enabled=false \
  --set rabbitmq.auth.password=$RABBITMQ_PASSWORD \
  --set rabbitmq.auth.erlangCookie=$RABBITMQ_ERLANG_COOKIE

Finally, upgrade you release to 1.0.0 reusing the existing PVC, and enabling back MariaDB:

helm upgrade dataflow bitnami/spring-cloud-dataflow \
  --set mariadb.primary.persistence.existingClaim=$MARIADB_PVC \
  --set mariadb.auth.rootPassword=$MARIADB_ROOT_PASSWORD \
  --set mariadb.auth.password=$MARIADB_PASSWORD \
  --set rabbitmq.auth.password=$RABBITMQ_PASSWORD \
  --set rabbitmq.auth.erlangCookie=$RABBITMQ_ERLANG_COOKIE

You should see the lines below in MariaDB container logs:

$ kubectl logs $(kubectl get pods -l app.kubernetes.io/instance=dataflow,app.kubernetes.io/name=mariadb,app.kubernetes.io/component=primary -o jsonpath="{.items[0].metadata.name}")
...
mariadb 12:13:24.98 INFO  ==> Using persisted data
mariadb 12:13:25.01 INFO  ==> Running mysql_upgrade
...

License

Copyright © 2022 Bitnami

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Expected output

After issuing the helm install command, you should see output similar to the following:

NAME: my-release
LAST DEPLOYED: Sun Nov 22 21:12:29 2020
NAMESPACE: default
STATUS: deployed
REVISION: 1
TEST SUITE: None
NOTES:
** Please be patient while the chart is being deployed **

Spring Cloud Data Flow chart was deployed by using the following components:

  • Spring Cloud Data Flow server
  • Spring Cloud Skipper server

You can access Spring Cloud Data Flow through the following DNS name from within your cluster:

my-release-spring-cloud-dataflow-server.default.svc.cluster.local (port 8080)

To access Spring Cloud Data Flow dashboard from outside the cluster, run the following commands:

  1. Get the Data Flow dashboard URL by running the following commands:

    export SERVICEPORT=$(kubectl get --namespace default -o jsonpath="{.spec.ports[0].port}" services my-release-spring-cloud-dataflow-server) kubectl port-forward --namespace default svc/my-release-spring-cloud-dataflow-server ${SERVICEPORT}:${SERVICEPORT} & echo "http://127.0.0.1:${SERVICEPORT}/dashboard"

  2. Open a browser and access the Data Flow dashboard by using the obtained URL.

If you prefer, you can change the Spring Cloud Data Flow service type by passing the following set argument to helm install:

--set server.service.type=ServiceType

Where ServiceType would be a valid service name (for example LoadBalancer, NodePort, and so on).

It may take a few minutes for the LoadBalancer IP to be available. You can watch the status of the server by running kubectl get svc -w my-release-spring-cloud-dataflow-server

If you use Minikube without load balancer support, you can use the following command to get the URL for the server:

minikube service --url my-release-spring-cloud-dataflow-server

You have now created a new release in the default namespace of your Kubernetes cluster. It takes a couple of minutes for the application and its required services to start. You can check on the status by issuing a kubectl get pod -w command. You need to wait for the READY column to show 1/1 for all pods.

When all pods are ready, you can access the Spring Cloud Data Flow dashboard by accessing http://<SERVICE_ADDRESS>/dashboard, where <SERVICE_ADDRESS> is the address returned by either the kubectl or minikube commands shown earlier.

Version Compatibility

The following listing shows Spring Cloud Data Flow’s version compatibility with the respective Helm Chart releases:

Deprecated chart mappings from official Helm repository:

SCDF Version Chart Version
SCDF-K8S-Server 1.7.x 1.0.x
SCDF-K8S-Server 2.0.x 2.2.x
SCDF-K8S-Server 2.1.x 2.3.x
SCDF-K8S-Server 2.2.x 2.4.x
SCDF-K8S-Server 2.3.x 2.5.x
SCDF-K8S-Server 2.4.x 2.6.x
SCDF-K8S-Server 2.5.x 2.7.x
SCDF-K8S-Server 2.6.x 2.8.x

Bitnami chart mappings:

SCDF Version Chart Version
SCDF-K8S-Server 2.6.x 1.1.x
SCDF-K8S-Server 2.7.x 2.0.x

Registering Prebuilt Applications

All of the prebuilt streaming applications:

  • Are available as Apache Maven artifacts or Docker images.
  • Use RabbitMQ or Apache Kafka.
  • Support monitoring via Prometheus and InfluxDB.
  • Contain metadata for application properties used in the UI and code completion in the shell.

You can register applications individually by using the app register functionality or as a group by using the app import functionality. There are also dataflow.spring.io links that represent the group of prebuilt applications for a specific release, which is useful for getting started.

You can register applications by using either the UI or the shell. Even though we use only two prebuilt applications, we register the full set of prebuilt applications.

The easiest way to install Data Flow on Kubernetes is to use the Helm chart that uses RabbitMQ as the default messaging middleware. The command to import the RabbitMQ version of the applications is as follows:

dataflow:>app import --uri https://dataflow.spring.io/rabbitmq-docker-latest

Change rabbitmq to kafka in the above URL if you set kafka.enabled=true in the helm chart or followed the manual kubectl based installation instructions for installing Data Flow on Kubernetes and chose to use Kafka as the messaging middleware.

Only applications registered with a --uri property pointing to a Docker resource are supported by the Data Flow Server for Kubernetes. However, we do support Maven resources for the --metadata-uri property, which is used to list the properties supported by each application. For example, the following application registration is valid:

app register --type source --name time --uri docker://springcloudstream/time-source-rabbit:{docker-time-source-rabbit-version} --metadata-uri maven://org.springframework.cloud.stream.app:time-source-rabbit:jar:metadata:{docker-time-source-rabbit-version}

Any application registered with a Maven, HTTP, or File resource or the executable jar (by using a --uri property prefixed with maven://, http:// or file://) is not supported.

Application and Server Properties

This section covers how you can customize the deployment of your applications. You can use a number of properties to influence settings for the applications that are deployed. Properties can be applied on a per-application basis or in the appropriate server configuration for all deployed applications.

Properties set on a per-application basis always take precedence over properties set as the server configuration. This arrangement lets you override global server-level properties on a per-application basis.

Properties to be applied for all deployed Tasks are defined in the src/kubernetes/server/server-config.yaml file and for Streams in src/kubernetes/skipper/skipper-config-(binder).yaml. Replace (binder) with the messaging middleware you are using —- for example, rabbit or kafka.

Memory and CPU Settings

Applications are deployed with default memory and CPU settings. If needed, these values can be adjusted. The following example shows how to set Limits to 1000m for CPU and 1024Mi for memory and Requests to 800m for CPU and 640Mi for memory:

deployer.<app>.kubernetes.limits.cpu=1000m
deployer.<app>.kubernetes.limits.memory=1024Mi
deployer.<app>.kubernetes.requests.cpu=800m
deployer.<app>.kubernetes.requests.memory=640Mi

Those values results in the following container settings being used:

Limits:
cpu: 1
memory: 1Gi
Requests:
cpu: 800m
memory: 640Mi

You can also control the default values to which to set the cpu and memory globally.

The following example shows how to set the CPU and memory for streams and tasks:

data:
  application.yaml: |-
    spring:
      cloud:
        skipper:
          server:
            platform:
              kubernetes:
                accounts:
                  default:
                    limits:
                      memory: 640mi
                      cpu: 500m
data:
  application.yaml: |-
    spring:
      cloud:
        dataflow:
          task:
            platform:
              kubernetes:
                accounts:
                  default:
                    limits:
                      memory: 640mi
                      cpu: 500m

The settings we have used so far affect only the settings for the container. They do not affect the memory setting for the JVM process in the container. If you would like to set JVM memory settings, you can provide an environment variable to do so. See the next section for details.

Environment Variables

To influence the environment settings for a given application, you can use the spring.cloud.deployer.kubernetes.environmentVariables deployer property. For example, a common requirement in production settings is to influence the JVM memory arguments. You can do so by using the JAVA_TOOL_OPTIONS environment variable, as follows:

deployer.<app>.kubernetes.environmentVariables=JAVA_TOOL_OPTIONS=-Xmx1024m

The environmentVariables property accepts a comma-delimited string. If an environment variable contains a value that is also a comma-delimited string, it must be enclosed in single quotation marks —- for example,

spring.cloud.deployer.kubernetes.environmentVariables=spring.cloud.stream.kafka.binder.brokers='somehost:9092, anotherhost:9093'

Doin so overrides the JVM memory setting for the desired <app> (replace <app> with the name of your application).

Liveness and Readiness Probes

The liveness and readiness probes use paths called /health and /info, respectively. They use a delay of 10 for both and a period of 60 and 10, respectively. You can change these defaults when you deploy the stream by using deployer properties. The liveness and readiness probes are applied only to streams.

The following example changes the liveness probe (replace <app> with the name of your application) by setting deployer properties:

deployer.<app>.kubernetes.livenessProbePath=/health
deployer.<app>.kubernetes.livenessProbeDelay=120
deployer.<app>.kubernetes.livenessProbePeriod=20

You can declare the same as part of the server global configuration for streams, as follows:

data:
  application.yaml: |-
    spring:
      cloud:
        skipper:
          server:
            platform:
              kubernetes:
                accounts:
                  default:
                    livenessProbePath: /health
                    livenessProbeDelay: 120
                    livenessProbePeriod: 20

Similarly, you can swap liveness for readiness to override the default readiness settings.

By default, port 8080 is used as the probe port. You can change the defaults for both liveness and readiness probe ports by setting deployer properties, as follows:

deployer.<app>.kubernetes.readinessProbePort=7000
deployer.<app>.kubernetes.livenessProbePort=7000

You can declare the same as part of the global configuration for streams, as follows:

data:
  application.yaml: |-
    spring:
      cloud:
        skipper:
          server:
            platform:
              kubernetes:
                accounts:
                  default:
                    readinessProbePort: 7000
                    livenessProbePort: 7000

By default, the liveness and readiness probe paths use Spring Boot 2.x+ actuator endpoints. To use Spring Boot 1.x actuator endpoint paths, you must adjust the liveness and readiness values, as follows (replace <app> with the name of your application):

deployer.<app>.kubernetes.livenessProbePath=/health
deployer.<app>.kubernetes.readinessProbePath=/info

To automatically set both liveness and readiness endpoints on a per-application basis to the default Spring Boot 1.x paths, you can set the following property:

deployer.<app>.kubernetes.bootMajorVersion=1

You can access secured probe endpoints by using credentials stored in a Kubernetes secret. You can use an existing secret, provided the credentials are contained under the credentials key name of the secret’s data block. You can configure probe authentication on a per-application basis. When enabled, it is applied to both the liveness and readiness probe endpoints by using the same credentials and authentication type. Currently, only Basic authentication is supported.

To create a new secret:

  1. Generate the base64 string with the credentials used to access the secured probe endpoints.

    Basic authentication encodes a username and password as a base64 string in the format of username:password.

    The following example (which includes output and in which you should replace user and pass with your values) shows how to generate a base64 string:

    echo -n "user:pass" | base64
    dXNlcjpwYXNz
  2. With the encoded credentials, create a file (for example, myprobesecret.yml) with the following contents:

    apiVersion: v1
    kind: Secret
    metadata:
    name:
    myprobesecret type:
    Opaque data:
    credentials: GENERATED_BASE64_STRING
  3. Replace GENERATED_BASE64_STRING with the base64-encoded value generated earlier.
  4. Create the secret by using kubectl, as follows:

    kubectl create -f ./myprobesecret.yml
    secret "myprobesecret" created
  5. Set the following deployer properties to use authentication when accessing probe endpoints, as follows:

    deployer.<app>.kubernetes.probeCredentialsSecret=myprobesecret

    Replace <app> with the name of the application to which to apply authentication.

Using SPRING_APPLICATION_JSON

You can use a SPRING_APPLICATION_JSON environment variable to set Data Flow server properties (including the configuration of Maven repository settings) that are common across all of the Data Flow server implementations. These settings go at the server level in the container env section of a deployment YAML. The following example shows how to do so:

env:
  - name: SPRING_APPLICATION_JSON
    value: |-
    {
      "maven": {
        "local-repository": null,
        "remote-repositories": {
          "repo1": {
            "url": "https://repo.spring.io/libs-snapshot"
          }
        }
      }
    }

Private Docker Registry

You can pull Docker images from a private registry on a per-application basis. First, you must create a secret in the cluster. Follow the Pull an Image from a Private Registry guide to create the secret.

Once you have created the secret, you can use the imagePullSecret property to set the secret to use, as follows:

deployer.<app>.kubernetes.imagePullSecret=mysecret

Replace <app> with the name of your application and mysecret with the name of the secret you created earlier.

You can also configure the image pull secret at the global server level.

The following example shows how to do so for streams and tasks:

data:
  application.yaml: |-
    spring:
      cloud:
        skipper:
          server:
            platform:
              kubernetes:
                accounts:
                  default:
                    imagePullSecret: mysecret
data:
  application.yaml: |-
    spring:
      cloud:
        dataflow:
          task:
            platform:
              kubernetes:
                accounts:
                  default:
                    imagePullSecret: mysecret

Replace mysecret with the name of the secret you created earlier.

Volume Mounted Secretes

Data Flow uses the application metadata stored in a container image label. To access the metadata labels in a private registry, you have to extend the Data Flow deployment configuration and mount the registry secrets as a Secrets PropertySource:

    spec:
      containers:
      - name: scdf-server
        ...
        volumeMounts:
          - name: mysecret
            mountPath: /etc/secrets/mysecret
            readOnly: true
        ...
      volumes:
        - name: mysecret
          secret:
            secretName: mysecret

Annotations

You can add annotations to Kubernetes objects on a per-application basis. The supported object types are pod Deployment, Service, and Job. Annotations are defined in a key:value format, allowing for multiple annotations separated by a comma. For more information and use cases on annotations, see Annotations.

The following example shows how you can configure applications to use annotations:

deployer.<app>.kubernetes.podAnnotations=annotationName:annotationValue
deployer.<app>.kubernetes.serviceAnnotations=annotationName:annotationValue,annotationName2:annotationValue2
deployer.<app>.kubernetes.jobAnnotations=annotationName:annotationValue

Replace <app> with the name of your application and the value of your annotations.

Entry Point Style

An entry point style affects how application properties are passed to the container to be deployed. Currently, three styles are supported:

  • exec (default): Passes all application properties and command line arguments in the deployment request as container arguments. Application properties are transformed into the format of --key=value.
  • shell: Passes all application properties and command line arguments as environment variables. Each of the application and command line argument properties is transformed into an uppercase string, and . characters are replaced with _.
  • boot: Creates an environment variable called SPRING_APPLICATION_JSON that contains a JSON representation of all application properties. Command line arguments from the deployment request are set as container args.

When using the docker shell entrypoint on a container be aware of its impact on SIGTERM of an application, as the excerpt from the Docker Documentation discusses.

In all cases, environment variables defined at the server-level configuration and on a per-application basis are set onto the container as is.

You can configure applications as follows:

deployer.<app>.kubernetes.entryPointStyle=<Entry Point Style>

Replace <app> with the name of your application and <Entry Point Style> with your desired entry point style.

You can also configure the entry point style at the global server level.

The following example shows how to do so for streams:

data:
  application.yaml: |-
    spring:
      cloud:
        skipper:
          server:
            platform:
              kubernetes:
                accounts:
                  default:
                    entryPointStyle: entryPointStyle

The following example shows how to do so for tasks:

data:
  application.yaml: |-
    spring:
      cloud:
        dataflow:
          task:
            platform:
              kubernetes:
                accounts:
                  default:
                    entryPointStyle: entryPointStyle

Replace entryPointStyle with the desired entry point style.

You should choose an Entry Point Style of either exec or shell, to correspond to how the ENTRYPOINT syntax is defined in the container’s Dockerfile. For more information and uses cases on exec versus shell, see the ENTRYPOINT section of the Docker documentation.

Using the boot entry point style corresponds to using the exec style ENTRYPOINT. Command line arguments from the deployment request are passed to the container, with the addition of application properties being mapped into the SPRING_APPLICATION_JSON environment variable rather than command line arguments.

It is recommended that you use exec entrypoint style when launching Spring Batch/Boot Applications or implement your own method of setting job parameters in your application. This is because the shell entry point style, converts command line args to environment variables, this prevents Spring Boot from creating job parameters from the command line args. This can be seen here.

When you use the boot Entry Point Style, the deployer.<app>.kubernetes.environmentVariables property must not contain SPRING_APPLICATION_JSON.

Deployment Service Account

You can configure a custom service account for application deployments through properties. You can use an existing service account or create a new one. One way to create a service account is by using kubectl, as follows:

kubectl create serviceaccount myserviceaccountname
serviceaccount "myserviceaccountname" created

Then you can configure individual applications, as follows:

deployer.<app>.kubernetes.deploymentServiceAccountName=myserviceaccountname

Replace <app> with the name of your application and myserviceaccountname with your service account name.

You can also configure the service account name at the global server level.

The following example shows how to do so for streams:

data:
  application.yaml: |-
    spring:
      cloud:
        skipper:
          server:
            platform:
              kubernetes:
                accounts:
                  default:
                    deploymentServiceAccountName: myserviceaccountname

The following example shows how to do so for tasks:

data:
  application.yaml: |-
    spring:
      cloud:
        dataflow:
          task:
            platform:
              kubernetes:
                accounts:
                  default:
                    deploymentServiceAccountName: myserviceaccountname

Replace myserviceaccountname with the service account name to be applied to all deployments.

Image Pull Policy

An image pull policy defines when a Docker image should be pulled to the local registry. Currently, three policies are supported:

  • IfNotPresent (default): Do not pull an image if it already exists.
  • Always: Always pull the image regardless of whether it already exists.
  • Never: Never pull an image. Use only an image that already exists.

The following example shows how you can individually configure applications:

deployer.<app>.kubernetes.imagePullPolicy=Always

Replace <app> with the name of your application and Always with your desired image pull policy.

You can configure an image pull policy at the global server level.

The following example shows how to do so for streams:

data:
  application.yaml: |-
    spring:
      cloud:
        skipper:
          server:
            platform:
              kubernetes:
                accounts:
                  default:
                    imagePullPolicy: Always

The following example shows how to do so for tasks:

data:
  application.yaml: |-
    spring:
      cloud:
        dataflow:
          task:
            platform:
              kubernetes:
                accounts:
                  default:
                    imagePullPolicy: Always

Replace Always with your desired image pull policy.

Deployment Labels

You can set custom labels on objects related to Deployment. See Labels for more information on labels. Labels are specified in key:value format.

The following example shows how you can individually configure applications:

deployer.<app>.kubernetes.deploymentLabels=myLabelName:myLabelValue

Replace <app> with the name of your application, myLabelName with your label name, and myLabelValue with the value of your label.

Additionally, you can apply multiple labels, as follows:

deployer.<app>.kubernetes.deploymentLabels=myLabelName:myLabelValue,myLabelName2:myLabelValue2

NodePort

Applications are deployed with a Service type of ClusterIP, which is the default Kubernetes Service type if not defined otherwise. ClusterIP services are only reachable from within the cluster itself.

To expose the deployed application to be available externally, one option is to use NodePort. See the NodePort documentation for more information.

The following example shows how you can individually configure applications by using Kubernetes assigned ports:

deployer.<app>.kubernetes.createNodePort=true

Replace <app> with the name of your application.

Additionally, you can define the port to use for the NodePort Service, as follows:

deployer.<app>.kubernetes.createNodePort=31101

Replace <app> with the name of your application and the value of 31101 with your desired port.

When defining the port manually, the port must not already be in use and be within the defined NodePort range. Per NodePort the default port range is 30000-32767.

Monitoring

To learn more about the monitoring experience in Data Flow with Prometheus running on Kubernetes, see the Stream Monitoring or Task Monitoring guides.