Vijay Mourya
Senior DevOps & Infrastructure Reliability Engineer

Professional Experience

6+ years building reliable cloud infrastructure and DevOps platforms across AWS, GCP, Kubernetes, and production-grade CI/CD pipelines.

Key Achievements

🎤 Live Presentation

Presented service release to 500+ team members at Roche

🏆 Recognition Awards

3 internal recognitions/Applause for technical excellence 2024-2025

📐 Architecture Design

Service architecture design and comprehensive documentation

📚 Documentation

Extensive technical documentation and SOPs

Conference Attendance

🌐
KubeCon + CloudNativeCon
India 2024 • Hyderabad
☁️
AWS re:Invent
2024 • Las Vegas, USA

Career Timeline

Detailed work experience in chronological order (most recent first)

Roche Information Solutions India

Pune, India
Jul 2023 – Present
2.5+ years
Senior DevOps Engineer • Service Owner
🎯 Agile Delivery Coordinator (Acting Team Lead)
  • introduced AI tools for Daily Standups and Sprint Planning, as well implemting MCP servers for Jira automation, resulting in 20% reduction in meeting times
  • Directly mentoring 2 new hires along with onboarding, managing their tasks and providing technical guidance
  • Managing Jira board, sprint planning in absence of PM/PO, ensuring smooth team operations and timely delivery of sprint goals of 8 member team
  • Unblocking team members by providing technical support, facilitating cross-team collaboration, and ensuring alignment with project objectives
  • PI planning and backlog grooming sessions with PM/PO to prioritize tasks and plan releases effectively
🎯 [Service Owner] Kubernetes-based GitLab CI/CD Infrastructure (EKS/GKE)
  • Designed & deployed fully automated, scalable GitLab runner infrastructure across AWS and GCP in kubernetes environments (EKS/GKE) to support CI/CD pipelines for multiple teams
  • Supporting 5+ teams and 2,500+ jobs per release cycle, reducing manual efforts significantly. enabling 30% faster release cycles and reliability improvements
  • Developed on-demand deployment pipelines using AWS CodePipeline and GCP Cloud Build
  • Automated GitLab runner provisioning via Helm-based deployments with integrated IAM configuration
  • Packaged solution as reusable Terraform module for consistent, rapid deployments
  • Authored comprehensive architecture and implementation documentation
  • Standardized deployment process through IaC, achieving 30% faster delivery, reduced error rates, and cost optimization
⚡ [Service Owner] Event-driven multi Account AWS Patch Management System
  • Designed and deployed fully automated, event-driven patch management solution for AWS
  • Solution leverages AWS Serverless to patch multiple accounts efficiently, controlling the patch order between dev, staging, and production environments and promoting only tested patches to production
  • Patching 500+ EC2 instances monthly, improving security and operational efficiency
  • Developed metrics framework (Athena and S3) for actionable insights and BDD enablement
  • Presented solution to ~500 attendees at domain-wide event with video tutorial content
  • Authored technical documentation, troubleshooting SOPs, and system architecture diagrams (Lucid)
  • Implemented CI/CD pipeline for serverless deployments with TDD practices (unit & integration tests)
  • Recognized for optimizing patch management, reducing manual intervention, operational costs, and improving infrastructure security
🚀 Other Key Contributions
  • proficient in using AI tools like copilot, custom AI agents, and custom scripts to automate routine tasks, enhance documentation, and optimize workflows
  • Supported product launch initiatives through technical and business presentations
  • Implemented AWS cost optimization automations, saving $10,000+/month by cleaning unused resources example: unattached EBS volumes and orphaned snapshots
  • Deployed SSM document-based automations for configuration compliance and EC2 lifecycle management
  • Enhanced hardened machine image (AMI/GCP image) pipelines integrating Ansible playbooks and Packer
  • Engineered dynamic, parameterized GitLab CI pipelines using Jinja2, Python, YAML for one-click provisioning
Tech Stack:
AWS (EKS, Lambda, EventBridge, Systems Manager, CodePipeline, Athena, S3) • GCP (GKE, Cloud Build) • Kubernetes • Terraform • GitLab CI/CD • Helm • Packer • Ansible • Docker • Python • Jinja2 • Bash • Lucid/Draw.io

Amazon Development Centre India

Hyderabad, India
Apr 2022 – Jun 2023
1 year 3 months
DevOps Engineer • On-Call PoC
⚙️ Serverless Ops Engine
  • Developed serverless automation tools on AWS for organizational ticket tracking and process automation, improving operational efficiency and scalability by processing ~500 tickets monthly
  • Designed and implemented high-throughput AWS Lambda workflows to process large-scale CSV/JSON data from S3
  • Leveraged asynchronous invocation, file indexing, and function chaining to resolve Lambda timeout constraints and optimize data ingestion into DynamoDB
  • Created solution design diagrams with Draw.io and documented implementation with SOP guide books
🚨 On-Call & Incident Management
  • As Service PoC on call, authored detailed technical documentation, SOPs, and mitigation playbooks, significantly reducing recurring incidents and expediting team onboarding
  • Managed high-severity incidents as primary on-call engineer, implementing mitigation strategies, root cause analyses, and proactive monitoring to minimize customer impact and prevent recurrence
  • Contributed to KPI-driven monitoring and alerting efforts where within team I played key role in identifying KPIs to monitor & alert teams on any anomalies
🔄 HTTP(S) VIP Migration Project
  • Contributed in HTTP(s) VIP migration project, transitioning infrastructure from NetScaler to AWS native load balancers (ALB/NLB)
  • Developed comprehensive migration, rollback, and automation strategies, including IAM role design, prerequisite validation, and migration timeline planning
  • Collaborated with globally distributed teams, contributing to system design reviews and code quality assurance to ensure robust, production-ready deliverables
Tech Stack:
AWS (Lambda, S3, DynamoDB, ALB/NLB, CloudWatch, SNS, SQS) • Python • Draw.io • On-Call Operations • Root Cause Analysis

Tata Consultancy Services Limited

Nagpur, India
Jul 2019 – Mar 2022
2 years 9 months
DevOps Engineer
☁️ On-Prem to Cloud Migration
  • Supported cloud migration project by provisioning AWS Compute and Storage resources using Terraform and CloudFormation
  • Automating AMI creation with Packer & Ansible to bake custom tools into servers
  • Implemented hybrid infrastructure by integrating on-premises servers with AWS for file sharing (NAS) and establishing secure connectivity
🔄 Disaster Recovery & CI/CD
  • Engineered "point in time" disaster recovery solutions, automating failover with Jenkins jobs triggered by SQS and developing Lambda-based monitoring and recovery actions
  • Managed & Supported robust CI/CD pipelines in Jenkins for microservices and MFEs, leveraging shared libraries, multibranch pipelines, and integration with AWS CLI, SSM, and Lambda triggers
📊 Infrastructure & Operations
  • Automated infrastructure monitoring and alerting using CloudWatch, SNS, and Lambda, ensuring high availability and incident response
  • Deployed and managed containerized applications with Docker and Kubernetes, supporting microservices and troubleshooting pod deployments
  • Collaborated with clients and internal teams to provide project updates, conduct infrastructure knowledge transfer, and deliver disaster recovery planning and documentation
  • Contributed to documentation and SOPs to streamline operations, improve onboarding, and standardize change management processes
Tech Stack:
AWS (EC2, S3, CloudWatch, SNS, SQS, Lambda) • Terraform • CloudFormation • Jenkins • Packer • Ansible • Docker • Kubernetes • Python • Bash

Complete Technical Skillset

☁️ Cloud Platforms

  • AWS: EC2, EKS, Lambda, S3, DynamoDB, EventBridge, Systems Manager, CloudFormation, CloudWatch
  • GCP: GKE, Compute Engine, Cloud Functions, Cloud Storage
  • Multi-cloud architecture & cost optimization

🏗️ Infrastructure as Code

  • Terraform (modules, remote state, workspaces)
  • Ansible (playbooks, roles, vault)
  • Packer (AMI/image building)
  • Helm (chart development & management)
  • CloudFormation

🚀 CI/CD & DevOps

  • GitLab CI/CD (runners, pipelines, GitOps)
  • Jenkins (pipelines, plugins, distributed builds)
  • GitHub Actions
  • ArgoCD / GitOps practices
  • Security scanning (Trivy, SonarQube)

🐳 Containers & Orchestration

  • Kubernetes (EKS, GKE, on-prem)
  • Docker (multi-stage builds, optimization)
  • Container security & best practices
  • Service mesh (exploring Istio)

📊 Observability & Monitoring

  • Prometheus & Grafana
  • CloudWatch (metrics, logs, alarms)
  • ELK Stack (Elasticsearch, Logstash, Kibana)
  • Distributed tracing & APM
  • SRE practices & SLA/SLO management

💻 Programming & Scripting

  • Python (automation, serverless, data processing)
  • Bash/Shell scripting
  • Go (learning for tooling)
  • SQL (Athena, DynamoDB queries)
  • YAML, JSON, HCL

Career Highlights

6+
Years of Experience
3
Major Organizations
2,500+
Jobs per Release Cycle
$10K+
Monthly Cost Savings