MetLife Insurance- Japan, Cloud SRE Engineer, 04/01/23, Present, DR & PROD Release Support: Provided 24/7 support for Disaster Recovery (DR) and Production (PROD) releases, ensuring seamless deployment and minimal downtime for critical applications., Cost Optimization & Implementation of Spot Instances: Implemented cost-saving strategies, including the use of spot instances in non-production environments across multiple projects, achieving a 40% reduction in monthly cloud expenses., Scheduled Maintenance Activities in Production: Planned and executed scheduled maintenance activities in production environments, including patching, upgrades, and system health checks to ensure high availability and compliance., Non-Production Environment Support: Managed and supported non-production environments, including the setup, configuration, and troubleshooting of development and staging environments to ensure readiness for production deployment., AKS, Istio, and Nginx Version Upgrades: Performed version upgrades for Azure Kubernetes Service (AKS), Istio service mesh, and Nginx ingress controllers, ensuring compatibility and leveraging new features for improved performance and security., Nodepool Size Upgrades: Coordinated with the Active Directory (AD) team to upgrade node pool sizes based on application and workload requirements, optimizing resource allocation and performance., Alert Creation in Production: Developed and implemented monitoring and alerting strategies using Prometheus and Azure Monitor for production environments, enabling proactive identification and resolution of potential issues.