Direktiv — Kubernetes-Native Engine
TL;DR
Event bus optimizations and observability updates for a Kubernetes-native serverless workflow engine.
The Problem
The workflow engine experienced latency spikes and event drops under high concurrent load, with insufficient runtime visibility for debugging active workflows.
The Implementation
Re-architected the internal event routing, decoupled long-running database queries, and implemented a structured slog logging system in Go. Built open-source observability pipelines using OpenTelemetry, Fluent Bit, and Prometheus.
The Stack
Go, Kubernetes, Knative, PostgreSQL, OpenTelemetry, Prometheus, Grafana.
The Outcome
Increased event processing throughput by 10x and reduced workflow execution latency on critical routing paths to under 5ms.
Contributed open-source PRs: