EC2/systemd → EKS
45 services · 250+ instances
Migrating 45 services from EC2/systemd to EKS — and cutting deploys 4.5×
A field report from leading a platform migration of 45 services and 250+ instances at a multi-tenant SaaS company.
// WRITING
Field notes on distributed systems, data platforms, and large-scale migrations — written after the dust settles.
EC2/systemd → EKS
45 services · 250+ instances
A field report from leading a platform migration of 45 services and 250+ instances at a multi-tenant SaaS company.
Mongo → StarRocks
ms-latency dashboards
2026-04-28 · 7 min · DATA PLATFORMS
What broke when analytical queries shared a tenant cluster with OLTP — and what moving to a real OLAP engine actually gave us.
Java 8 → 17
35 services · 100+ CVEs closed
2026-04-10 · 6 min · MIGRATIONS
End-of-support runtimes don't just block CVE fixes — they block every transitive update that depends on them. A field report from a 35-service upgrade.
Mongo → Kafka via Debezium
oplog CDC at multi-tenant scale
2026-03-20 · 11 min · DISTRIBUTED SYSTEMS
Lessons from tailing a multi-tenant Mongo oplog into Kafka without putting the source database under more load than it already had.
GPS normalization
8–10k vehicles · 40–80M events/day
2026-03-05 · 8 min · DISTRIBUTED SYSTEMS
Polished APIs, flaky devices, customer-internal feeds — what cross-source GPS normalization actually looks like in production.
Iceberg lakehouse
bronze / silver / gold · 10–15 TB
2026-02-18 · 10 min · DATA PLATFORMS
Why the three-tier lakehouse pattern works for replacing OLAP-on-OLTP, and what each tier actually does in practice.
OTel + Grafana stack
trace / log / metric on one backbone
2026-02-02 · 9 min · DISTRIBUTED SYSTEMS
What it actually costs to consolidate observability onto OTel + Grafana + Loki + Tempo + VictoriaMetrics, and what you get back.