Confluent Kafka data-product platform
A major UK telecom operator. Client abstracted for confidentiality; technical scope as delivered.
Context
The organisation needed to turn streaming data across 30+ source systems into reliable, reusable data products that analytics teams could depend on — with clear ownership and schema evolution that wouldn't break consumers downstream.
What I built
A Confluent Kafka data-product platform:
- 20+ productised data streams, each domain-owned rather than centrally bottlenecked.
- Schema governance via Schema Registry, so producers and consumers evolve independently.
- Hive LLAP and Spark 3 for query acceleration over the streamed data.
- A cross-team ownership model so domains, not a central queue, owned their products.
Impact
- 20+ data products in operation across the business.
- 30+ source systems integrated under one governed model.
- Domain-oriented ownership that scaled without a central gatekeeper.
Role & stack
Data engineer and technology architect (Accenture CTA group) — designed the platform and guided cross-border delivery teams.
Stack: Apache Kafka (Confluent Cloud), Schema Registry, Hive LLAP, Spark 3, NiFi, Hadoop, Elastic Stack.
→ See also Event-driven & streaming and Data & lakehouse.