Senior Platform Engineer · DevOps · Kubernetes · SRE

Building Reliable
Cloud Platforms
at Scale

Senior Platform Engineer with 4.5+ years of experience building Kubernetes-native developer platforms, GitOps workflows, observability systems, CI/CD automation, and hybrid cloud infrastructure for enterprise-scale environments.

$ kubectl get pods -A
✔ 67+ applications migrated
$ terraform apply
✔ Infrastructure standardized
$ helm upgrade platform
✔ Zero downtime deployments
$ argocd sync prod
✔ GitOps delivery enabled
$ prometheus status
✔ MTTR reduced by 42%
67+
Applications Migrated
20+
Engineering Teams Supported
$500K
Monitoring Cost Savings
Deployment Velocity Increase

Core Expertise

Platform engineering, Kubernetes infrastructure, GitOps delivery, observability, CI/CD automation, and hybrid-cloud operations across enterprise-scale production environments.

Cloud & Infrastructure

Production Infrastructure Engineering · 4+ Years
AWS EKS ECR EC2 IAM VPC S3 CloudWatch Terraform Ansible Linux Networking IaC Hybrid Cloud

Containers & Orchestration

Kubernetes Platform Operations · 4+ Years
Kubernetes Helm ArgoCD FluxCD Docker cert-manager Vault

CI/CD & GitOps

Release Engineering & Automation · 4+ Years
GitHub Actions CI/CD GitOps Jenkins OPA SAST Snyk Blue-Green

Observability & SRE

Reliability Engineering & Monitoring · 3+ Years
Prometheus Grafana ELK Stack Datadog OpenTelemetry SLI/SLO Error Budgets AlertManager Incident Management Distributed Tracing

AI & Platform Engineering

Internal Developer Platforms & GenAI
RAG OpenAI Embeddings Vector Search Platform Engineering Internal Developer Platform IDP

Programming & Data

Automation & Backend Engineering · 4+ Years
Python Bash YAML Node.js Kafka Apache Airflow Neo4j GraphQL REST APIs PowerShell PostgreSQL Shell Scripting

Professional Experience

Enterprise platform engineering, Kubernetes infrastructure, observability modernization, GitOps delivery, hybrid-cloud automation, and developer enablement across production-scale environments.

Senior DevOps Engineer (SDE-3)
Morgan Stanley · Bangalore, India
December 2023 — Present
  • Architected and deployed an internal hybrid-cloud developer platform using Kubernetes, Helm, ArgoCD, Terraform, and AWS-integrated infrastructure services, accelerating application onboarding by 33% and supporting 20+ engineering teams within 9 months.
  • Designed and managed Infrastructure-as-Code (IaC) workflows using Terraform for Kubernetes platform provisioning, environment standardization, secrets integration, and scalable infrastructure lifecycle management, reducing deployment errors by 35%.
  • Spearheaded integration of Jenkins, FluxCD, and Helm into advanced CI/CD pipelines, streamlining deployments across diverse environments and enabling rollback, approval workflows, and GitOps-based release automation for 20+ product teams.
  • Architected centralized observability strategy using Prometheus, Grafana, and Loki, implementing automated monitoring and alerting systems that reduced incident response time by 42% and saved 250+ operational hours.
  • Developed a fully automated TLS/SSL certificate lifecycle management system for Kubernetes using Vault and cert-manager, reducing certificate-related production incidents by 95%.
  • Implemented Neo4j backup and recovery pipelines triggered automatically post-deployment and data synchronization, enabling zero-data-loss recovery readiness across 100% of production environments.
  • Integrated SAST security scanners into CI/CD pipelines using SonarQube and Snyk, automating vulnerability detection across 20+ microservices and reducing remediation time by 35%.
  • Engineered PolicyGPT, a GenAI-powered RAG assistant leveraging OpenAI embeddings and Neo4j vector search, enabling semantic policy discovery across 1,000+ compliance documents.
  • Partnered with infrastructure teams to strengthen AWS and on-prem Kubernetes environments through networking improvements, secrets management, infrastructure governance, and deployment standardization initiatives.
DevOps Engineer (SDE)
Wiley India · Bangalore, India
July 2021 — December 2023
  • Built and owned a hybrid-cloud governance platform using Python Flask, IBM DB2, AWS services, and Terraform-driven infrastructure automation, improving platform usability by 27% and streamlining developer workflows.
  • Automated ServiceNow operational workflows using Python and Ansible, reducing manual effort and operational overhead by approximately 30%.
  • Managed Kubernetes deployments using Helm across hybrid infrastructure environments, supporting scalable container orchestration, deployment standardization, and improved release reliability.
  • Spearheaded migration of 67 applications from legacy monitoring systems to a centralized observability stack using Prometheus, Grafana, Loki, and AWS-integrated monitoring services, implementing FinOps practices that reduced annual monitoring costs by $500K.
  • Developed centralized logging, alerting, and custom Prometheus exporters including SNMP, Blackbox, and Python-based exporters, improving observability, infrastructure reliability, and operational efficiency across distributed systems.
  • Engineered and maintained Kafka Connect integrations for 30+ microservices using Confluent Platform tools deployed on containerized infrastructure, reducing data pipeline latency by 42%.

Platform Engineering Highlights

Enterprise-scale Kubernetes platforms, GitOps infrastructure, observability modernization, internal developer platforms, and AI-powered governance systems.

Observability Modernization

Enterprise-wide migration from fragmented monitoring systems to a centralized Prometheus, Grafana, and Loki observability stack supporting 67+ applications.

Prometheus Grafana Loki AlertManager Fluent Bit

PolicyGPT

GenAI-powered Retrieval-Augmented Generation assistant enabling semantic policy search using OpenAI embeddings and Neo4j vector indexing across enterprise governance systems.

OpenAI Neo4j RAG Vector Search Python

Engineering Principles

Core principles driving platform reliability, operational excellence, automation, and scalable infrastructure delivery.

GitOps First

Infrastructure and deployments managed declaratively through version-controlled Git workflows.

Infrastructure as Code

Automated, repeatable, and scalable infrastructure provisioning using Terraform and automation tooling.

Observability by Default

Metrics, logging, tracing, and proactive alerting integrated into every production platform.

Secure Platform Design

Automated certificate management, policy enforcement, and zero-trust engineering practices.