
Arti
I keep systems alive at scale so your team can sleep at night
Skills

Bekijk mijn diensten

Werkervaring
SRE
Fintech • Fulltime
Apr 2015 - Present • 11 yrs 2 mos
Led SRE teams managing distributed infrastructure (Aerospike, Elasticsearch, RabbitMQ, MariaDB, ZooKeeper) serving hundreds of millions of users at peak load Architected high-availability and disaster recovery systems across 45+ critical services with fully automated quarterly DR drills Built an AI-powered on-call automation tool using Golang, NATS, and OpenAI — enabling automated RCA generation, Jira ticketing, and intelligent failure detection Migrated 35 ZooKeeper clusters and Elasticsearch replication across 8 production clusters using full Ansible automation, delivered ahead of schedule Designed and automated financial services infrastructure on Azure including secure on-premise connectivity via WireGuard Owned compliance readiness across PCI DSS, ICoFR, PA-DSS, RBI, and SEBI audits for 4+ consecutive years with zero licence delays Reduced weekly sprint planning from 2 hours to 10 minutes using GitHub Copilot and OpenAI automation Migrated legacy enterprise applications to Kubernetes for Fortune 500 clients including Micron, IBM, and Autodesk Automated large-scale data centre migrations on AWS using Ansible, Docker, and Terraform Mentored 30+ engineers and interns; coached senior SREs into Engineering Manager roles Drove eBPF-based performance debugging to identify kernel-level bottlenecks and network anomalies across large-scale distributed services Established automation-first SRE culture — Ansible workflows, alert noise reduction, on-call runbooks, and knowledge-sharing programs across teams