Direktiv — Kubernetes-Native Engine

TL;DR

Event bus optimizations and observability updates for a Kubernetes-native serverless workflow engine.

The Problem

The workflow engine experienced latency spikes and event drops under high concurrent load, with insufficient runtime visibility for debugging active workflows.

The Implementation

Re-architected the internal event routing, decoupled long-running database queries, and implemented a structured slog logging system in Go. Built open-source observability pipelines using OpenTelemetry, Fluent Bit, and Prometheus.

The Stack

Go, Kubernetes, Knative, PostgreSQL, OpenTelemetry, Prometheus, Grafana.

The Outcome

Increased event processing throughput by 10x and reduced workflow execution latency on critical routing paths to under 5ms.

Contributed open-source PRs: