Summary
Overview
Work History
Education
Skills
Certifications Learning
Core Competencies
Timeline
Generic

Shankar Nandala

Team Lead, Cloud SAAS
Hyderabad

Summary

Cloud & DevOps Team Lead with 14+ years of experience architecting, managing, and optimizing cloud-based SaaS infrastructure. Proven track record in building scalable environments, automating operations, ensuring up time, and leading global teams. Adept in AWS, monitoring, CI/CD pipelines, and cross-functional leadership.

Overview

14
14
years of professional experience
3
3
years of post-secondary education

Work History

Team Lead –Cloud SaaS Operations

Infor
09.2023 - Current
  • Designed, deployed, and managed highly available and scalable infrastructure across AWS multi-tenant and single-tenant environments, with emphasis on security, performance, and operational resilience.
  • Implemented and tested disaster recovery strategies, ensuring data integrity and system continuity through backup automation and cross-region replication.
  • Built and managed CI/CD pipelines using GitHub Actions and GitLab CI to enable rapid, automated deployments across development, staging, and production environments.
  • Automated provisioning and operational tasks using Python, PowerShell, and AWS CLI/SSM to eliminate manual interventions.
  • Developed advanced Grafana dashboards and TICKscripts for end-to-end observability, with proactive alerting integrated via PagerDuty, MS Teams, and email notifications.
  • Configured and maintained Sumo Logic dashboards, log pipelines, and scheduled searches to support real-time monitoring, compliance, and RCA efforts.
  • Led P1 incident response and investigation, conducted detailed root cause analysis, and coordinated post-incident remediation to ensure service continuity and adherence to operational SLAs.
  • Created and maintained operational runbooks, SOPs, and checklists to improve first-response accuracy and reduce MTTR.
  • Enforced cloud security best practices including IAM role separation, least-privilege access, encryption standards, and automated compliance audits.
  • Managed tagging strategies and cost controls using AWS Cost Explorer, budgets, and rightsizing recommendations to optimize cloud spend and resource utilization.
  • Guided and mentored CloudOps/DevOps engineers on infrastructure best practices, automation patterns, and monitoring strategies.
  • Facilitated Agile ceremonies including sprint planning, daily standups, and retrospectives, ensuring team deliverables were aligned with business goals and priorities.
  • Collaborated with development, QA, and product teams to ensure reliable deployments and rapid feedback loops.
  • Managed and maintained multiple environments (dev, staging, pre-prod, production), ensuring deployment readiness and environmental consistency.
  • Coordinated CMW and application module deployments, supporting both manual and automated rollout strategies.
  • Enabled zero-downtime deployments through implementation of blue-green and canary deployment methods.
  • Maintained and updated internal documentation, SOPs, and onboarding materials to promote operational consistency and reduce ramp-up time.
  • Encouraged a knowledge-sharing culture through internal tech talks, peer code reviews, and skill development sessions.

Sr. Development Operations Engineer

Infor
12.2020 - 10.2023
  • Managed deployment and maintenance of single and multi-tenant SaaS environments, including applications like dEPM, Optiva AWS CloudFormation to ensure scalable and maintainable infrastructure.
  • Designed highly available, cost-effective, and fault-tolerant systems utilizing EC2 instances, Auto Scaling Groups, and Elastic Load Balancers, optimizing performance and resource utilization.
  • Implemented disaster recovery policies for production environments, improving system reliability by reducing Recovery Time Objectives (RTO) and Recovery Point Objectives (RPO).
  • Automated routine tasks and patching processes via AWS Systems Manager (SSM), improving operational efficiency and reducing manual intervention.
  • Enhanced Grafana dashboards with dynamic templating and proactive alerting based on threshold-driven TICK scripts, facilitating real-time monitoring and issue detection.
  • Refined Sumo Logic dashboards and queries to provide near real-time insights, reducing incident triage time and improving response effectiveness.
  • Developed and maintained a centralized monitoring stack using Sumo Logic, Monocle, and CloudWatch, ensuring consistent alerting and observability across multiple tenants.
  • Led major incident responses and provided root cause analyses (RCA) for critical P1 issues to stakeholders, implementing remedy plans to prevent recurrence.
  • Integrated AWS cost monitoring and rightsizing recommendations using Cost Explorer and Trusted Advisor, contributing to cost optimization efforts.
  • Participated in quarterly Availability Zone (AZ) failure testing in staging environments to validate the high availability of multi-tenant applications.
  • Provided 24x7 on-call support for production and non-production environments, promptly addressing incidents and maintaining system reliability.
  • Coordinated with cross-functional teams, including Development, Customer Support, QA, and Customer Success, to plan and execute production releases and change orders, ensuring seamless deployments.
  • Developed training modules to assist in onboarding new employees, enhancing team knowledge and operational efficiency.
  • Managed cloud infrastructure on AWS, including EC2, S3, ECS, and RDS, handling backups, patches, and scaling operations to ensure optimal performance.
  • Conducted monthly maintenance windows for multi-tenant and single-tenant applications, ensuring timely upgrades to the latest code versions and minimizing downtime.
  • Addressed day-to-day customer-reported technical incidents in collaboration with the customer support team, providing timely fixes and maintaining customer satisfaction.

Cloud Suite Administrator / Sr. Cloud Suite Admin

Infor
02.2017 - 11.2020
  • Deployed and maintained AWS infrastructure, including EC2, S3, RDS, Lambda, and VPC configurations, ensuring robust and scalable cloud environments.
  • Automated routine tasks using Python scripts and AWS Lambda, improving operational efficiency and streamlining processes.
  • Supported multi-tenant SaaS infrastructure by managing IAM, SSO, and role-based policies, enabling secure access across tenants.
  • Provided customer support for cases logged through ServiceNow, ensuring quick resolution of issues and maintaining high customer satisfaction.
  • Monitored alerts and applied fixes as needed to maintain system stability, leveraging centralized monitoring tools such as Sumo Logic and Grafana for consistent alerting.
  • Driven the alert standardization initiative through the implementation of severity tagging and noise-reduction filters, improving the quality of actionable insights.
  • Maintained consistent service health reporting through PowerShell scripts and Sumo dashboards to proactively detect system issues.
  • Created runbooks and knowledge base articles to enhance L1/L2 team efficiency and improve troubleshooting workflows.
  • Actively participated in weekend release deployments, environment cloning, and application patching, ensuring minimal downtime.
  • Worked closely with product and QA teams to deliver environment refreshes and perform load test scenarios.
  • Provided 24x7 on-call support for both production and non-production environments, ensuring high availability and quick incident resolution.

ICS Consultant

Infor
11.2014 - 01.2017
  • Provisioned Single-Tenant environments on AWS, including infrastructure setup, application installation, and configuration tailored to customer needs.
  • Managed end-to-end customer onboarding, covering environment readiness, access provisioning, and initial validation checks.
  • Used AWS CloudFormation to deploy infrastructure templates and performed manual installation of application suites when required.
  • Configured VPN tunnels, SFTP, and SMTP for secure communication between customer systems and hosted environments.
  • Implemented SSO integrations using Active Directory, AD-to-AD trust, and ADFS, enabling seamless and secure authentication.
  • Maintained a centralized monitoring stack using Sumo Logic and Grafana, ensuring consistent alerting and visibility across environments.
  • Led an alert standardization initiative using severity tagging and noise-reduction filters to enhance actionable insights.
  • Created dashboards and proactive health alerts to detect system issues before they impacted customers.
  • Streamlined service health reporting through PowerShell scripts and integrated dashboards.
  • Enabled secure access to environments using IAM policies, SSO, and role-based access controls in a multi-tenant SaaS setup.
  • Built runbooks and knowledge base articles to improve L1/L2 support efficiency and reduce issue resolution time.
  • Regularly participated in weekend release deployments, environment cloning, and application patching activities.
  • Collaborated with product and QA teams to deliver environment refreshes and support load testing scenarios.
  • Managed monthly maintenance windows for updates, patching, and stability checks with minimal customer disruption.
  • Resolved L3-level technical issues, performing deep troubleshooting across infrastructure, network, and application layers.

Associate Consultant

Infor
Hyderabad
02.2011 - 10.2014
  • Delivered application support for enterprise environments, ensuring stability and uptime of key business applications.
  • Provided Windows Server support, including user account management, system patching, and performance monitoring.
  • Administered IIS web servers, managing deployments, configuration updates, and SSL certificate renewals.
  • Performed basic SQL Server activities, such as executing queries, data extraction, running stored procedures, and assisting with backups/restores.
  • Handled daily incident tickets related to infrastructure and applications, ensuring timely resolution and accurate updates in the ticketing system.
  • Completed daily shift transitions, including handover documentation and briefings to ensure seamless 24/7 support coverage.
  • Participated in ITIL-based incident, change, and problem management processes.
  • Assisted in release and deployment activities, working with dev and QA teams to ensure smooth rollouts.
  • Maintained and updated standard operating procedures (SOPs) and created knowledge base articles for common issues and resolutions.

Education

Master of Computer Applications (MCA) -

Geethanjali College of Engineering And Technology
Hyderabad, India
06.2007 - 07.2010

Skills

  • AWS (EC2, RDS, S3, IAM, SSM, Route53, EKS, CloudWatch, Cost Explorer)

  • Python

  • PowerShell

  • GitLab CI

  • AWS CLI

  • Grafana

  • TICKscripts

  • Sumo Logic

  • CloudWatch

  • Monocle

  • PagerDuty

  • Prometheus

  • ServiceNow

  • Windows Server

  • IIS

  • ITIL

  • Agile/Scrum (Jira)

  • Runbook Development

  • RCA

  • Stakeholder Communication

  • Multi-Tenancy Management

Certifications Learning

  • AWS Certified Solutions Architect Associate
  • Azure Fundamentals
  • Scrum Master
  • ITIL Foundation Certificate

Core Competencies

AWS (EC2, RDS, S3, IAM, SSM, Route53, EKS, CloudWatch, Cost Explorer), Python, PowerShell, GitLab CI, AWS CLI, Grafana, TICKscripts, Sumo Logic, CloudWatch, Monocle, PagerDuty, Prometheus, ServiceNow, Windows Server, IIS, ITIL, Agile/Scrum (Jira), Runbook Development, RCA, Stakeholder Communication, Skilled in managing and optimizing multi-tenant environments for improved scalability and resource utilization

Timeline

Team Lead –Cloud SaaS Operations

Infor
09.2023 - Current

Sr. Development Operations Engineer

Infor
12.2020 - 10.2023

Cloud Suite Administrator / Sr. Cloud Suite Admin

Infor
02.2017 - 11.2020

ICS Consultant

Infor
11.2014 - 01.2017

Associate Consultant

Infor
02.2011 - 10.2014

Master of Computer Applications (MCA) -

Geethanjali College of Engineering And Technology
06.2007 - 07.2010
Shankar NandalaTeam Lead, Cloud SAAS