Member-only story

Kafka Consumer Autoscaling with KEDA

Zhimin Wen

Published in

ITNEXT

5 min readMar 13, 2022

Let's explore the Kafka consumer autoscaling in Kubernetes.

If the current reading offset of a consumer is lagging behind the actual offset in the partition (Log End Offset) too much, exceeding some threshold, then additional consumer replicas will be created to speed up the processing of the Kafka topic. If the lagging drops below the threshold, then the number of consumers should be dropping down also.

This is a typical Horizontal Pod Autoscaling use case that can be achieved with custom metrics. Instead of doing it this way, we use KEDA which has a readily available scaler for Kafka to achieve the autoscaling of the Kafka consumer. Let’s test it out.

About the test environment, the Kubernetes engine is the OpenShift Container Platform (OCP). I am using the IBM event streams (based on the Strimizi operator) for Kafka.

Install KEDA

We install KEDA as an operator in OCP.

Create a namespace named keda first. Create the following OperatorGroup and Subscription,

apiVersion: operators.coreos.com/v1
kind: OperatorGroup
metadata:
  name: keda-og
  namespace: keda
spec:
  targetNamespaces:
---
apiVersion: operators.coreos.com/v1alpha1
kind: Subscription
metadata:
  name: keda-operator
  namespace: keda
spec:
  name: keda
  source: community-operators
  sourceNamespace: openshift-marketplace

The operator will be installed on the keda namespace and will be managing all the namespaces (targetNamespaces is set empty).

Once the operator is installed, create the following Keda controller CRD, skipping all the fields with the default value.

apiVersion: keda.sh/v1alpha1
kind: KedaController
metadata:
  name: keda
  namespace: keda
spec:
  watchNamespace: ""
  logEncoder: console
  logLevel: info
  logLevelMetrics: '0'

The KEDA controller is now watching any namespace for the KEDA CRDs. The KEDA deployment is completed.

Kafka Producer and Consumer

The Kafka producer and consumer are created in Golang using the Kafka library github.com/segmentio/kafka-go.

ITNEXT

Kafka Consumer Autoscaling with KEDA

Install KEDA

Kafka Producer and Consumer

Create an account to read the full story.

Published in ITNEXT

Written by Zhimin Wen

No responses yet

More from Zhimin Wen and ITNEXT

Creating a Local LLM Application with Golang

Creating a LLM application is now days so easy. Let’s spin a locally running LLM model, and create a command line LLM utilities with…

Are you an Angular developer? Don’t be.

Especially if you're afraid of AI

How JavaScript Proxies Fixed My Messy Frontend Code (and Saved My Sanity)

Stop writing the same boilerplate

Exploring Llama 3.2 Vision with Ollama

Ollama (0.4.0) just get Llama 3.2 vision supported. Let’s explore it with Golang.

Recommended from Medium

How we handled pod kills (Crash Loop Backoff) due to memory spikes while running heavy scripts in…

🤔 Problem Statement

Stateless vs. Stateful Stream Processing with Kafka Streams and Apache Flink

Concepts and benefits of stateless and stateful stream processing with Kafka Streams and Apache Flink vs. databases and data lakes.

Kafka Streams — How to build an advanced stateful data stream processing

A practical example of real time account balance calculation using Kafka Streams processor & key-value state store

Kubernetes Multi-AWS Account Management: GitOps for Multiple EKS Clusters

The Banking Infrastructure on Cloud series shares insights about banking systems’ infrastructure architecture on the AWS Cloud.

Integrating Flink with Kafka

Apache Flink is a processing framework for large-scale, distributed, complex real-time event-driven processing, batch processing, and…

Dynamic Partition Re-Balancing in Kafka

Dynamic Partition Re-Balancing in Kafka : Lessons from operating at scale