Summary
Overview
Work History
Education
Skills
Websites
Certification
profile snapshot
growth path
Generic
Sandeep Kumar BN

Sandeep Kumar BN

Bengaluru

Summary

DevOps Engineer with 14years of experience in infrastructure automation, CI/CD, configuration management, and cloud platforms. Skilled in tools like Kubernetes, Terraform, Elasticsearch, Prometheus, Ansible, Chef, and Git. Proficient in deploying and upgrading applications and resolving issues. Knowledgeable in HashiCorp Vault and Consul and cloud migrations (AWS,Azure,OCI). Strong troubleshooting and mentoring abilities. Successfully implemented automation solutions and CD pipelines. Proven track record of driving efficiency and collaboration within cross-functional teams. Seeking to involve with an organization to showcase my potential and contribute my knowledge towards organizational goals.

Overview

14
14
years of professional experience

Work History

Senior DevOps Engineer

Synchronoss Technologies India Pvt Ltd.
09.2013 - Current
  • Prometheus and Thanos , Grafana (PTG) architecture designed and implemented on EKS and OKE clusters.
  • PTG Helm files prepared based on project requirements.
  • IPA Server configured auto sync with AD servers as HA model.
  • EKS node autoscaler kaprpenter implemented for SIP projects.
  • CSI and remaining addons automated through terraform.
  • Aqua container policy enforcer tool implemented/designed server client architecture.
  • OSSEC Server Installation for Compliance: Setting up and configuring the OSSEC server for compliance purposes
  • Ensuring proper integration of OSSEC with the existing infrastructure
  • Monitoring and managing security events and logs.
  • Nessus Installation for Vulnerability Scanner: Installing and configuring Nessus, a vulnerability scanner tool
  • Conducting regular vulnerability assessments and scans
  • Analyzing scan results and providing recommendations for remediation.
  • Chef Server Installation and Upgradation in AWS: Installing and upgrading Chef Server in an AWS environment
  • Configuring Chef Server for infrastructure automation and management
  • Collaborating with development teams to define and maintain Chef cookbooks and recipes.
  • Migration of Nodes Chef Bootstrap from Rackspace to AWS Chef Server: Migrating existing nodes from Rackspace to AWS Chef Server
  • Ensuring a smooth transition of configuration management for the migrated nodes
  • Troubleshooting and resolving any issues during the migration process.
  • Migration of Chef Cookbooks to Support New Requirements: Analyzing new requirements and modifying existing Chef cookbooks accordingly.Migrating and adapting Chef cookbooks to support changes in infrastructure and applications.Testing and validating the migrated cookbooks to ensure functionality and compatibility.
  • Application Migrations from Rackspace to Hyperv: Planning and executing the migration of applications from Rackspace to Hyperv
  • Coordinating with stakeholders to ensure minimal downtime and smooth migration
  • Monitoring and troubleshooting any issues that arise during the migration process.
  • Jira and GitLab Installation in K8: Installing and configuring Jira and GitLab in a Kubernetes (K8) environment
  • Setting up integration between Jira and GitLab for seamless issue tracking and project management
  • Providing support and troubleshooting for Jira and GitLab-related issues.
  • MinIO Setup for Object Storage: Setting up and configuring MinIO, an open-source object storage solution
  • Integrating MinIO with existing infrastructure and applications
  • Ensuring high availability, data redundancy, and security of the object storage system.
  • GestioIP Installation for IP Management: Installing and configuring GestioIP, an IP address management (IPAM) tool
  • Managing IP address assignments, DNS records, and subnet allocations
  • Providing accurate and up-to-date IP address information to support network operations
  • Monitoriing and Logging Implementation: Monitoring tools migration from Logicmon to Orion, Alertsite Configuration of ELK in Hyper-V.
  • Knowledge on Azure AKS and DevOps: Having good knowledge and experience in Azure AKS (Azure Kubernetes Service).Understanding the principles and practices of DevOps in an Azure environment.Leveraging Azure DevOps tools and services for CI/CD, automation, and infrastructure management
  • Kubernetes Upgradation on Eks: Kubernetes Cluster upgrade from 1.21 to 1.24 and migrated data from old cluster to new
  • EKS Cluster Creation using Terraform: Utilizing Terraform to provision and configure EKS clusters
  • Defining infrastructure as code for the EKS cluster setup
  • Ensuring the proper configuration and networking of the EKS cluster.
  • Deployment and Upgradation of Applications using Terraform in EKS Clusters: Leveraging Terraform to deploy and upgrade applications in EKS clusters
  • Creating reusable deployment templates for different application components
  • Automating the deployment process to ensure consistency and reliability.
  • Development of YAML Scripts to Deploy Micro Applications in Kubernetes: Creating YAML scripts for the deployment of microservices in Kubernetes
  • Defining the desired state of applications using Kubernetes manifest files
  • Managing deployment configurations and scaling options for microservices.
  • Upgrading EKS Clusters from 1.17 and onwards: Planning and executing the upgrade process for EKS clusters from version 1.17 to newer versions
  • Conducting compatibility tests and ensuring smooth transition during upgrades
  • Addressing any issues or challenges that arise during the upgrade process.
  • Implementation of HashiCorp Vault in Kubernetes to Pull Secrets: Integrating HashiCorp Vault with Kubernetes for secure storage and retrieval of secrets
  • Configuring access controls and policies for secrets management
  • Ensuring proper authentication and authorization mechanisms for accessing secrets.
  • Working on Horizontal Pod Autoscaling (HPA) in Kubernetes: Implementing Horizontal Pod Autoscaling in Kubernetes clusters
  • Defining metrics and thresholds for automatic scaling of pods based on workload demand
  • Monitoring and fine-tuning the HPA configuration to optimize application performance.
  • Troubleshooting Issues in Kubernetes: Identifying and resolving issues related to Kubernetes infrastructure and applications
  • Debugging networking, pod scheduling, and containerization issues
  • Collaborating with cross-functional teams to resolve complex problems.
  • Working with Spinnaker CD for Microservice Deployments in Kubernetes: Leveraging Spinnaker Continuous Deployment (CD) tool for deploying microservices in Kubernetes
  • Creating deployment pipelines and release strategies in Spinnaker
  • Ensuring smooth and controlled rollouts of microservices.
  • Mentoring and Guiding Team Members: Providing guidance and mentorship to junior team members
  • Sharing knowledge and best practices to enhance the team's skills and capabilities
  • Assisting team members with technical challenges and problem-solving.
  • Helping the Team in Resolving Issues: Collaborating with team members to troubleshoot and resolve technical issues
  • Providing support and guidance during critical incidents
  • Participating in on-call rotations and incident response processes.

Linux System Administrator

Unisys
09.2011 - 09.2013
  • Installation and Administration of Rundeck Application:Installing and configuring Rundeck, an open-source automation tool.Managing the Rundeck application, including user management and access control.Monitoring and maintaining the availability and performance of the Rundeck environment.
  • Middleware Application Deployments Automation using Rundeck:Automating the deployment process of middleware applications using Rundeck.Defining deployment jobs and workflows in Rundeck for consistent and reliable deployments.Monitoring and troubleshooting deployment issues to ensure successful application rollouts.
  • Removal of Old Backups on Application Nodes using Shell Script in Rundeck:Developing shell scripts to remove outdated backups on application nodes.Integrating the shell scripts into Rundeck for scheduled and automated execution.Ensuring proper backup management and storage optimization.
  • Bootstrap of Prod and Non-Prod Nodes to Chef Organizations:Bootstrapping production and non-production nodes to Chef organizations.Configuring nodes to communicate with the Chef server for configuration management.Managing node registration and maintaining the desired state of infrastructure.
  • Creation of Environments in Chef Repo for Different Environments: Creating and managing environments in the Chef repository.Defining environment-specific attributes, recipes, and configurations.Ensuring consistency and proper configuration across different environments
  • Experience in integrating code quality tools (Sonarqube, image scanner) : Extensive knowledge on Load balancers, Firewalls, TCP/IP, VLAN, VPN, HTTP(S).
  • Adding Runlist to Nodes, Running Chef Client, and Fixing Issues: Defining and assigning run lists to nodes in Chef
  • Executing the Chef client on nodes to apply configurations and recipes.Troubleshooting and resolving configuration-related issues on nodes
  • Good Knowledge on networking: Extensive knowledge on Load balancers, Firewalls, TCP/IP, VLAN, VPN, and HTTP(S).
  • Creation of Continuous Deployment (CD) Pipelines using Bamboo:Developing CD pipelines in Bamboo for automated deployments in non- production environments.Defining stages, triggers, and approval processes for the deployment pipeline.Monitoring and managing the CD pipeline for efficient and reliable application releases.
  • Creation of Plans for Chef to Push Cookbooks from Git to Chef Servers using Bamboo:Creating plans in Bamboo to automate the process of pushing cookbooks from Git to Chef servers.Configuring triggers and schedules for cookbook updates and deployments.Ensuring version control and proper cookbook management.
  • Good Knowledge on Git:Proficient in using Git for version control and source code management.Familiarity with Git workflows, branching strategies, and collaboration features.Troubleshooting Git-related issues and supporting team members in using Git effectively.
  • Worked on Bitbucket Products (Stash, Git, Confluence): Experience in working with Bitbucket products, including Stash, Git, and Confluence.Collaborating with development teams using Git repositories for source code management.Utilizing Confluence for documentation and knowledge sharing within the team.

Linux Administrator

Locuz Enterprise Solutions India Ltd.
02.2011 - 09.2011
  • DNS master and slave mode implemented for vodafone project.
  • Forwarders configured for DNS based on domains.
  • SMTP relay server implemented, and SPF records configured on DNS on the same, ECHO HELLO implemented and configured ACLs on the same.
  • DKIM configured on smtp server based on costumer request.
  • NFS v4 implemented for connect directory IBM application.
  • Application nodes are configured through chef roles and environments.
  • Linux system administration and scripts prepare based on projects requirements.
  • OS migration scripts prepared for centos migration.

Education

Bachelor of Science - CSE

SJBIT Collage
Bengaluru
03.2007

Skills

  • Excellent communication skill
  • Research skill
  • Analytical
  • Adaptability
  • Problem-solving and Troubleshooting
  • Decision-making
  • Attention-to-detail
  • Collaboration and Teamwork
  • Negotiation skill
  • Project Management
  • Infrastructure Automation Continuous
  • Linux satellite servers package management
  • Visualization Technologies
  • Linux
  • Helm
  • DevOps Practices
  • Security and Compliance
  • Cloud Computing (AWS, Azure, GCP, OCI)
  • Agile Methodologies
  • Networking and DNS Management
  • Scripting and Automation (Python, Bash, PowerShell)
  • Infrastructure as Code (IaC)
  • Incident Management
  • DevOps Toolchains
  • Performance Optimization
  • Containerization (Kubernetes, Docker)
  • Continuous Integration and Deployment (CI/CD)
  • Virtualization Technologies
  • Scripting and Infrastructure Automation
  • Hashicorp Terraform and Vault , Consul
  • Monitoring and Logging

Certification

  • HashiCorp Certified: Terraform Associate (003)
  • Calico Certified Operator AWS.
  • CKA: Certified Kubernetes Administrator.
  • Red Hat Certified Engineer (RHCE).
  • AWS Certified Developer - Associate certification
  • IT Information Library Foundations Certification (ITIL)

profile snapshot

❑ Experienced professional and visionary leader, with 14 years of experience in DevOps Engineering, Site Reliability Engineering, Unix System Administration.
❑ Proficient in the configuration and administration of Red Hat Linux systems, including handling critical tasks such as OS patching, both in production and non-production environments. Experienced in executing OS upgrade projects to ensure system reliability and security. Skilled in the installation and administration of essential components such as NTP, DNS, postfix, and NFS, ensuring seamless system functionality and efficient communication.
❑ Hands on experience in creating and managing essential database objects, including tables and tablespaces, ensuring efficient data organization and storage. Experienced in performing startup and shutdown procedures for individual nodes in RAC (Real Application Clusters), ensuring smooth operation and availability. Skilled in executing export and import operations at the schema level, facilitating data migration and synchronization. Proficient in managing ASM (Automatic Storage Management) disk groups using asmcmd, ensuring optimal utilization and maintenance of storage resources.
❑ Extensive experience in configuring, administering, and providing support for various middleware applications, including WebLogic and JBoss. Proficient in managing WebLogic Server versions 10g, 11g, and 12c, demonstrating expertise in handling critical tasks such as domain management, cluster configuration, and resource administration, including JDBC Connection Pools, JDBC Data sources, and security settings. Skilled in creating and configuring MQ (Message Queue) objects, including Queue Managers, Remote Queues, Local Queues, and Queue Aliases, ensuring seamless and efficient message-based communication within the application ecosystem.
❑ K8 Upgrades from 1.21 to 1.24 and Migration of Data: Planning and executing the upgrade of Kubernetes (K8) cluster from version
1.21 to 1.24. Ensuring compatibility and readiness of applications and workloads for the new version. Migrating data and configurations to the upgraded K8 cluster without data loss or service disruptions.
❑ Implementation of Cert Manager in K8 Cluster for Certificate Auto-Renewal: Installing and configuring Cert Manager in the Kubernetes (K8) cluster. Setting up certificate issuers and managing SSL/TLS certificates. Enabling automated certificate renewal and managing certificate lifecycles.
❑ Proficient in Writing chef cookbooks for Middleware installations and new environment or application configurations while data center migrations and for new customer setup.
❑ Expertise in collaborating with cross-functional teams, automate processes, monitor infrastructure and applications, troubleshoot issues, optimize system performance, ensure security and compliance.
❑ Extensive experience in data center migrations from private data center to Rackspace, oracle cloud and azure.
❑ Expertise in monitoring tools like ELK, Prometheus, Orion, Alert Site etc.
❑ Proficient in writing terraform to build infrastructure in aws, Rackspace and Kubernetes deployments.

growth path

  • Senior DevOps Engineer (09/2018 - Till Date) Synchronoss Technologies
  • DevOps Engineer (06/2016 - 09/2018) Synchronoss Technologies
  • Linux Administrator (08/2013 - 06/2016) Synchronoss Technologies
Sandeep Kumar BN