Highlights Experience Skills Projects Education Contact
Open to opportunities

Kshitij Ambre

Infrastructure Engineer · Incident Manager · Cloud Operations

Results-driven Infrastructure Engineer with a strong track record in multi-cloud environments spanning Azure and AWS. Skilled in platform engineering, incident management, cloud capacity management, and championing AI adoption across cross-functional teams.

scroll

Impact at a Glance

~40%
Reduction in Mean Time to Resolution (MTTR) across all customer-facing services after deploying FireHydrant incident management platform.
🛡️
99.9%
SLA adherence maintained as full-time incident manager for production incidents, driving post-incident reviews that yield actionable reliability improvements.
🤖
AI-First
Led AI adoption initiatives integrating Claude (Anthropic) and Microsoft Copilot Studio into operational workflows to automate toil and accelerate engineering productivity.
🔧
~60%
Reduction in manual patching effort through Python-based automation scripts, ensuring consistent, auditable patch cycles across hybrid infrastructure.

Where I've Worked

Building resilient infrastructure and driving operational excellence across multi-cloud environments.

May 2025 — Present
Modulr Finance
Mumbai, Pune

Infrastructure Engineer

  • Manage day-to-day cloud administration and operations across Azure and AWS, including resource provisioning, access governance, networking, and cost monitoring for production and non-production environments.
  • Own platform engineering initiatives that standardize deployment pipelines, infrastructure-as-code templates (Terraform), and environment parity, reducing configuration drift and accelerating release velocity.
  • Built and operationalized FireHydrant as the company-wide incident management platform, defining severity taxonomies, escalation policies, runbooks, and automated communications that streamlined the full incident lifecycle.
  • Serve as the dedicated incident manager for all customer-facing production incidents, coordinating cross-functional response teams, maintaining real-time stakeholder communication, and tracking service delivery metrics including MTTR, MTTA, and SLA adherence.
  • Execute cloud capacity management at granular service levels, performing demand forecasting, right-sizing compute and storage resources, and producing capacity reports that inform infrastructure investment decisions.
  • Implemented Linux process monitoring using custom health checks, systemd watchdogs, and integration with observability tooling (Prometheus, Grafana) to proactively detect degradation before user impact.
  • Champion AI adoption as part of a cross-functional team, deploying Claude (Anthropic) for intelligent documentation and knowledge retrieval and Microsoft Copilot Studio for workflow automation across engineering and operations.
Nov 2022 — May 2025
Blenheim Chalcot India
Mumbai, India

Associate — Platforms

  • Administered Azure cloud environments for a portfolio of high-growth ventures, managing identity and access (Azure AD/Entra ID), virtual networks, storage accounts, and policy compliance across multiple subscriptions.
  • Designed and deployed Microsoft Defender for Cloud across the estate, achieving a measurable improvement in Secure Score and establishing continuous security posture monitoring with automated remediation recommendations.
  • Managed a fleet of Linux servers (Ubuntu, RHEL), performing OS hardening, patch management, and performance tuning to maintain security compliance and uptime targets.
  • Developed Python-based automation scripts for patch orchestration, reducing manual patching effort by approximately 60% and ensuring consistent, auditable patch cycles across hybrid infrastructure.
  • Administered endpoint management using Microsoft Intune (Windows) and Addigy (macOS), enforcing device compliance policies, application deployment, and zero-trust conditional access controls.
  • Contributed to IT security initiatives including vulnerability management, firewall rule reviews, incident triage, and security awareness programmes, strengthening the organisation's overall risk posture.
  • Provided infrastructure support across networking, storage, and compute, collaborating with development teams to troubleshoot performance bottlenecks and ensure reliable service delivery.

Tools & Technologies

A comprehensive toolkit built across years of hands-on infrastructure and cloud engineering.

Cloud Platforms
Azure AD / Entra ID Azure VNets Azure NSGs Azure Monitor Azure Defender AWS EC2 AWS S3 AWS IAM CloudWatch AWS VPC AWS RDS
Infrastructure & Automation
Terraform Ansible IaC CI/CD Pipelines Git Linux (Ubuntu, RHEL) Bash Scripting
Incident Management
FireHydrant PagerDuty On-Call Management Runbooks Post-Incident Review MTTR / MTTA / SLA
Observability & Monitoring
Prometheus Grafana Azure Monitor CloudWatch Log Analytics Alerting Pipelines
Security
MS Defender for Cloud Vulnerability Mgmt Endpoint Protection Zero Trust Conditional Access Intune Addigy
Programming & Scripting
Python Bash PowerShell YAML JSON
DevOps & Platform Engineering
Docker Kubernetes Helm GitHub Actions Azure DevOps GitOps
AI & Productivity Tools
Claude (Anthropic) Copilot Studio AI Workflow Automation

What I've Built

Major initiatives and platforms delivered across my career.

Incident Management

FireHydrant Platform Deployment

Built and operationalized a company-wide incident management platform from scratch, establishing structured incident lifecycles.

FireHydrantSRERunbooksSLA
Cloud & Platform

Cloud Capacity Management

Delivered granular service-level capacity management across Azure and AWS, optimizing costs while maintaining headroom for peak demand.

AzureAWSTerraformCost Optimization
Automation & AI

AI Adoption & Workflow Automation

Led cross-functional AI adoption, integrating Claude and Microsoft Copilot Studio into operational workflows to automate toil.

ClaudeCopilot StudioPython
Security

Microsoft Defender for Cloud Rollout

Designed and deployed Defender for Cloud across the entire estate, improving Secure Score and establishing continuous security posture monitoring.

DefenderAzureZero Trust
Cloud & Platform

Platform Engineering & IaC

Standardized deployment pipelines, Terraform templates, and environment parity—reducing config drift and accelerating release velocity.

TerraformCI/CDGitOpsDocker
Automation & AI

Automated Patch Orchestration

Built Python-based patch automation reducing manual effort by ~60%, with consistent, auditable cycles across hybrid infrastructure.

PythonBashLinuxAutomation

Academic Background

Bachelor's Degree in Information Technology

Vidyalankar School of Information Technology, University of Mumbai

Graduated 2021

9.12
CGPA
☁️
Microsoft Certified: Azure Fundamentals (AZ-900)
📈
Continuous professional development in SRE practices, cloud-native architecture, and platform engineering

Let's Connect

Interested in working together or have a question? I'd love to hear from you.

I'm always open to discussing infrastructure challenges, cloud architecture, incident management strategies, or opportunities to collaborate. Feel free to reach out through the form or directly via email or phone.

Based in
Mumbai / Pune, India