Results-driven Site Reliability Engineer with 10+ years of experience in Infrastructure Operations and SRE practices.
Results-oriented individual with a passion for continuous learning and innovation. Known for leveraging analytical thinking and creativity to solve problems and deliver high-impact solutions in fast-paced environments.
Overview
10
10
years of professional experience
1
1
Certification
Work History
Senior Service Operations Engineer
Providence Global Business Center
Hyderabad
02.2022 - Current
Led implementation of Ansible automation for configuration management, reducing deployment time by 40% and ensuring infrastructure consistency across environments
Developed Python scripts for automated health checks and self-healing capabilities, reducing manual intervention by 35%
Designed and implemented SRE best practices including SLIs, SLOs, and error budgets, improving service reliability by 25%
Managed AKS clusters for high-availability applications, maintaining 99.9% uptime
Established comprehensive monitoring using Azure Monitor and Application Insights with custom Python dashboards
Implemented GitOps workflows with deployment pipelines to automate application releases, increasing deployment frequency by 50%
Conducted incident response and post-mortem analysis, reducing critical incidents by 30%
Led cross-functional teams during major system outages, coordinating troubleshooting efforts and communication
System Specialist
IBM India Pvt Ltd
Hyderabad
10.2019 - 02.2022
Improved infrastructure availability by 18% through strategic planning, rigorous testing, and efficient implementation
Managed Linux installation, support, configuration, performance tuning, troubleshooting, and maintenance
Led initiatives on KPI and CPI factors as Squad Leader, preventing SLA breaches and improving performance by 15%
Conducted hardware replacements and software patching change requests with zero downtime
Utilized remote login methods including DRAC, ILOM, and XSCF for efficient system management
Administered virtual servers using VMware, optimizing resource utilization by 20%
Collaborated with cross-functional teams for on-call support, reducing resolution time by 25%
Documented issues and solutions using Confluence Wiki and Jira, improving knowledge sharing by 40%
Led improvements in IT governance processes, achieving a 15% reduction in alerts
Technical Business Analyst
AT&T Global Business Services India Pvt Ltd
Hyderabad
07.2017 - 09.2019
Automated server alert ticket data analysis, implementing a tangible alert reduction plan that decreased alerts by 23%
Developed schemas, data models, and visualization dashboards using PowerBI based on business requirements
Created Ops Manager Dashboard in PowerBI for outage and incident management data from Oracle DB
Designed and documented PowerBI Proof of Concept architecture for enterprise-wide implementation
Integrated data from multiple sources including MySQL, Oracle databases, SharePoint, and SAP BO into PowerBI
Developed workspaces and content packs for business users, optimizing report publishing and sharing
Collaborated with global teams on the Continuous Service Improvement Program (CSIP) to identify alerting patterns
Tested StackStorm build scripts, providing performance analysis that improved script efficiency by 30%
Event Management Operations Centre Analyst
AT&T Global Business Services India Pvt Ltd
Hyderabad
03.2015 - 07.2017
Provided technical support across Linux, Solaris, AIX, HP-UX, and Windows platforms, resolving issues within SLA by 95%
Optimized system operations and resource utilization, improving system performance by 20%
Documented incidents per SLAs and SOPs, enabling efficient problem diagnosis and resolution
Performed OS patching, hardware replacements, and Veritas cluster upgrades on Solaris servers
Monitored server status using BMC Patrol Central Console, reducing system downtime by 15%
Managed AT&T Integrated Cloud (AIC) 2.0 Infrastructure server operations
Administered Windows server patching and access request resolutions with 100% compliance
Developed an internal wiki with updated technical documentation, improving team efficiency by 25%
Education
Master of Technology - Communication Systems
GITAM school of Technology
01.2015
Bachelor of Technology - Electronics and Communications Engineering
GITAM School of Technology
01.2014
Skills
Azure SRE
Linux RHEL, UBUNTU
Kubernetes
Docker
VMware
Git and GitHub Actions
ITIL
PowerShell
Bash
Ansible
Python
Windows
AIX
HP-UX
Solaris
MySQL
ServiceNow
Jira
Confluence
PagerDuty
Certification
Azure Beginner (Az-900)
Azure Associate Administrator (Az-104)
ITIL v4
Timeline
Senior Service Operations Engineer
Providence Global Business Center
02.2022 - Current
System Specialist
IBM India Pvt Ltd
10.2019 - 02.2022
Technical Business Analyst
AT&T Global Business Services India Pvt Ltd
07.2017 - 09.2019
Event Management Operations Centre Analyst
AT&T Global Business Services India Pvt Ltd
03.2015 - 07.2017
Master of Technology - Communication Systems
GITAM school of Technology
Bachelor of Technology - Electronics and Communications Engineering