Frank Ittermann

Senior Platform Engineer

linkedin.com/in/frank-ittermann github.com/fr12k medium.com/@frank.ittermann_46267

Platform engineer with 15+ years of experience designing and operating cloud-native infrastructure, Kubernetes platforms, and distributed systems at scale. Focused on developer productivity, self-service platforms, and site reliability. Creator of Franky (streaming LLM agent framework), engine-ci (container-native CI/CD engine), and DuneBot (GitHub App for automated dependency management). Writes about CI/CD, platform engineering, and developer tooling on Medium.

Working Principles

Meeting
Change
Efficiency
Tools / People
Leadership

Education

Diploma, Applied Computer Sciene — Fachhochschule für Technik und Wirtschaft Berlin HTW (formerly FHTW) (2001 - 2006)

Interests

beach volleyball
science and technology
books
leadership

Frank Ittermann

Senior Platform Engineer

medium.com/@frank.ittermann_46267 frank-ittermann fr12k fr123k

Professional Summary

Experience

Jul, 2024 – present

ContainifyCI

Open Source Developer

Building LLM infrastructure, platform tooling, and container-native CI/CD. Author of Franky (streaming LLM agent framework), zompress (token compression), and engine-ci (container-native CI/CD).

franky — A provider-agnostic, streaming LLM agent and framework written in Zig
engine-ci — Reduced per-project CI setup from hours to minutes by building a container-native pipeline engine in Go (103 releases) that runs identically locally and in CI via Docker or Podman
DuneBot — Eliminated 10+ hours/week of manual Dependabot PR triage by building a GitHub App (20 releases) that automates approvals and dependency merges
Authored a 4-part tech blog series (12K+ reads) documenting the journey from GitHub Actions frustrations, through Dagger.io evaluation, to building engine-ci
go-file — Published a Go file abstraction package with lazy initialization, buffer I/O, and test-friendly error injection for simplified file testing

Go (Golang) Python Shell Zig Docker GitHub Actions Goreleaser Podman Trivy LLM Provider APIs SSE / HTTP Streaming

Mar, 2023 – present

Flink SE

Senior Platform Engineer

Overseeing the production infrastructure of a cloud-native, event-driven platform serving 1M+ users across Europe on Google Kubernetes Engine (GKE) and GCP.

5x faster deployment velocity for 30+ developers by architecting a self-service platform on Argo CD and Temporal, enabling independent microservice deployments
Cut infrastructure provisioning from days to minutes by designing Infrastructure as Code for 200+ GCP resources using Terraform and Config Connector
Zero-downtime GKE upgrade across production clusters spanning v1.26 to v1.29 within 3 months
50+ daily deployments across dev, staging, and production by engineering CI/CD pipelines with GitHub Actions and automated quality gates

Go (Golang) Helm YAML Argo CD GCP Config Connector GitHub Actions HashiCorp Terraform SonarQube Temporal Google Cloud Platform Kubernetes (GKE)

Feb, 2022 – Feb 2023

Planetly GmbH

Senior Site Reliability Engineer

Managed AWS and Azure infrastructure for an enterprise-scale carbon management platform. Focused on reliability, cost optimization, and cloud migration.

99.95% platform uptime and 20% cloud cost reduction by managing 50+ AWS resources via Terraform with reserved instance optimization and right-sizing
Zero data loss migration of 30+ production workloads from AWS to Azure within 4 months, achieving less than 2 hours total downtime
Reduced mean time to detection from 30 minutes to under 5 minutes by integrating Datadog APM with custom alerting thresholds
60% faster infrastructure deployments by designing Terraform CI/CD pipelines with CircleCI and automated provisioning workflows

Bash Go (Golang) Python YAML Ansible CircleCI HashiCorp Terraform AWS Azure Kubernetes PostgreSQL

Jan, 2020 – Oct 2021

Data4Life

Team Lead / Senior Site Reliability Engineer

Led a 4-person SRE team for a health-data platform serving 500K+ users. Transitioned from IC to team lead, managing people growth, ISO 27001 compliance, and infrastructure reliability.

Zero major findings in ISO 27001 re-audit by leading a 4-person SRE team through German BSI certification, implementing compliance controls across the infrastructure stack
Halved junior SRE onboarding time from 6 to 3 months through structured mentorship program covering tooling, runbooks, and incident response
PostgreSQL cluster provisioning cut from 4 hours to 30 minutes by designing automated cluster management with Ansible playbooks

Bash Go (Golang) Python Ansible HashiCorp Packer HashiCorp Terraform HashiCorp Vault Jenkins AWS Azure Kubernetes OpenStack PostgreSQL

Apr, 2015 – Aug, 2017

QualityPark

Senior Java Software Engineer

Developed and customized the Micro Focus Dimensions RM (Requirement Management) solution for enterprise customers. Responsible for designing, implementing, and testing Java-based components integrated into the core platform.

60% reduction in legacy system dependency by reverse engineering database access and web service APIs from C/C++ to Java
Increased deployment frequency from monthly to weekly by designing CI/CD pipeline with Jenkins and automated testing

C++ Java JavaScript Apache Maven Jenkins Oracle Database Tomcat Apache Jersey JUnit

Aug, 2006 – Feb, 2011

AWIN AG (formerly zanox AG)

Senior Java Software Developer

Migrated core services from legacy C/C++ to Java EE at AWIN (10M+ API requests/month). Technical lead for the public WebServices team building a scalable public REST/SOAP API.

10M+ API requests/month served after migrating core services from legacy C/C++ to Java EE platform, enabling horizontal scaling and modern CI/CD
5K+ requests/second at peak as technical lead for public REST/SOAP API team, architecting OAuth-based authentication (zanox connect) for 100+ B2B/B2C integrations

C++ Java Apache Maven Jenkins JBoss 4 MSSQL Tomcat Apache CXF Hibernate

Technical Skills

Cloud & Infrastructure

Google Cloud Platform (GCP), Amazon Web Services (AWS), Microsoft Azure, OpenStack, Cloud-Native Architecture

Kubernetes & Containers

Kubernetes (GKE, EKS, AKS), Docker, Helm, Argo CD, Kustomize

Infrastructure as Code & CI/CD

HashiCorp Terraform, GCP Config Connector, Ansible, Packer, GitHub Actions, CircleCI, Jenkins

Programming Languages

Go (Golang), Zig, Python, Bash, Java, TypeScript / JavaScript

LLM & AI Infrastructure

LLM Provider Integration (Anthropic, OpenAI, Gemini, Vertex), SSE / HTTP Streaming, Token Compression, Agent Orchestration, Container Sandboxing, Multi-Agent Systems

Observability & Reliability

Prometheus, Grafana, Datadog, OpenTelemetry, SLO / Error Budget Management, Incident Response & On-Call

Platform Engineering & Leadership

Internal Developer Platform (IDP), Self-Service Infrastructure, Developer Experience (DX), Temporal, ISO 27001, HashiCorp Vault, Team Leadership & Mentorship

Education

Diploma, Applied Computer Sciene

Fachhochschule für Technik und Wirtschaft Berlin HTW (formerly FHTW)

2001 - 2006

Working Principles

After reading the book 'Principles' by Ray Dalio. I was inspired to put together my personal working principles.

Meeting

Don't take the time and attention of work colleagues for granted.

Change

The power of change and adaptation starts with yourself.

Efficiency

Just work smart — break repeating cycles, focus on the ratio of time and outcome.

Tools / People

Tools have to follow people, not the people the tools.

Leadership

It's not about you — focus has to be on the people to lead, help them to grow.
It's all about you — be authentic and lead by example.

Interests

beach volleyball science and technology books leadership