Close this
Close this

Jyotshna Garlapati

Development
Indiana, United States

Skills

DevOps

About

Jyotshna Garlapati's skills align with Programmers (Information and Communication Technology). Jyotshna also has skills associated with Consultants and Specialists (Information and Communication Technology). Jyotshna Garlapati has 11 years of work experience, with 2 years of management experience, including a low-level position.
View more

Work Experience

Lead DevOps Engineer/ SRE Engineer

Wellsfargo, NJ
October 2023 - Present
  • Responsibilities: * Created AMIs and used AWS EC2 to build application and database servers. * To maintain the integrity of programmers, work was done on the GIT branching for apps by establishing Release and Development branches. * The automation of AWS services, such as Web servers, ELB, Cloud front distribution, databases, security groups, and application setup, was done using Groovy scripts and Python Lambda functions. * Terraform and Python were used to automate the deployment of the cloud. * Worked as a Chef for unattended AWS boot-up. * Used Datadog tool for monitoring project metrics * Used Jenkins pipelines to drive all microservices builds out to the Docker registry and then deployed to Kubernetes, Created Pods and managing using Kubernetes. * Using Salesforce deployments for application to support * Good experience in handling AEM deployments and servers * Created the Python Lambda code for IAM's policy management. * Lambda function was developed to automate the AWS infrastructure. * Ansible modules have been created to start, stop, terminate, and create AWS EC2 instances. * Using Continuous Integration and Continuous Deployment with Jenkins and Ansible. * Designing, developing, and managing Kafka-based data pipelines * Administering, maintaining, and configuring Kafka clusters for optimal performance and reliability * CloudWatch was used to monitor the services, and the logs were saved in Simple Storage Service (S3). S3's lifecycle rules were then utilized to update the bucket's versioning depending on how old the log files are. * For highly available multi-AZ applications, an AWS Virtual Private Cloud (VPC) was created. EC2 instances were then established across several Subnets, and access to these resources was granted to users and groups using IAM roles and policies. * Good Experience in mangoDB * Developing and implementing MongoDB queries * Collaborating with cross-functional teams to identify and solve complex problems in mongoDB. * Installed Chef habitat builder on-prem and developed Chef Cookbooks, and recipes to configure, deploy and maintain software components of the existing infrastructure. Managed Chef Client nodes and uploaded cookbooks to Chef-server using the workstation. * Implemented Ansible automation, and created playbooks and plays to update Test/Production servers' settings. * The user experience and latency were significantly improved by setting up GCP Firewall rules to allow or prohibit traffic to and from the VM's instances depending on the given configuration and using GCP cloud CDN (content delivery network) to distribute content from GCP cache locations. * Red Hat and SuSE systems were managed using VMware Virtualization. * All the Docker containers within the pods were managed by a Kubernetes minikube cluster, which was set with auto scaling in the Replication controller. * Several Ansible playbooks and Terraform scripts were written to supply the infrastructure with the application's requirements and deploy the application onto cloud instances. * AWS Route53 was set up to register domain names, direct public traffic to domains, and continually verify the resources' status using Cloud Watch. * Docker setup that includes a hub for managing Docker images and deploying applications inside of software containers has been installed and configured. * Managed Puppet master to install and configure Puppet on computer machines to automate the creation of node setting and wrote manifest files and modules in Puppet to configure the AWS instances. * Kubernetes was set up and configured, and Docker containers were orchestrated using Kubernetes nodes. * Worked on Creating the automation for few applications using python AWS lambda and fixing we tasks based on errors and restating applications and retrieving logs as per the issues. * Puppet, Jenkins, GitHub, Maven, and Ant are some of the Devops tools I've installed, setup, and managed. * Managed weekly builds, tests, and deployments using Jenkins, and weekly releases using GIT using the Dev/Test/Prod Branching Model. * System management tasks for more than 14,000 Linux Red Hat & CentOS servers using Token Sessions and Intune MDM. This comprises regular, pro-active monitoring of the stability/availability and health of server-based assets daily. * Placed your playbooks into GIT and worked on Python scripts to automate the health check procedure for Dynamic Inventory in Ansible Managed Playbooks and Playbook directories. * For Windows/Linux support, communication engagement through emails, IMs, Slack, Jabber, etc. for Verizon Data Services - India (VDSI), first level support is available. * Maintaining servers running Red Hat Linux AS 5 and SUSE Enterprises Linux 8 in test and production environments, as well as proactively monitoring and managing servers in both environments. * Worked with services offered by the Google Cloud Platform (GCP), including compute engine, cloud load balancing, cloud storage, cloud SQL, stack driver monitoring, and cloud deployment manager. * Managed configurations, deployed applications, provisioned clouds, completed ad hoc tasks, configured, and maintained Tomcat servers, and orchestrated multiple nodes using Ansible. * Written Ansible playbooks for restarting apps and deploying WAR, JAR, and EAR files across all environments. * Ansible's file module was utilized in custom Ansible modules to copy and delete files from distant systems as well as gather information and perform actions to stop and un pause alerts during deployments. * Utilize Python to automate infrastructure provisioning, deployment, and configuration management. * Use Python for log analysis, troubleshooting, and root cause analysis during incidents. * Develop automated incident response systems and tools to mitigate and prevent outages * Maintain high level of production environment availability using automation and monitoring tools capable of completing both front-end and back-end work using a variety of on-prem and in-cloud technology stacks based on the requirements of the project. * Good hands on experience on GKE with Terraform

Lead Devops Engineer

ALDI, Remote (Accenture,India)
November 2021 - December 2022
  • Responsibilities: * Hands on experience in Retail domain * Performed continuous deployments for SAP hybris and salesforce Applications. * Hands on Experience on Splunk for application monitoring, alerts for volumes and usage. * worked on git lab configurations, design, and review pipeline plans, preparing new pipelines according to customer requirements. * Performing application upgrades containers restart in graceful/Rolling manner if needed. * Installed SAP and Salesforce application deployments, upgrades and patches updates. * Analyzing the data on Kibana, Dynatrace, Grafana and Prometheus and deriving the performance of the products. Setup alerting using Prometheus and integrate Nagios and Prometheus with pager duty to notify the respective teams based on the escalation policy. * Set up AWS CloudWatch alarms for server performance monitoring, including CPU utilization and disk usage, ensuring proactive identification of potential issues. * Resolved Jenkins deployments issues for build failures. * Performed capacity assessment for new requests of servers i.e., calculating CPU and Memory for new servers according to the current/future applications running on the system. * Performed Application Monitoring using tools such as New Relic and Dynatrace to consistently check the application performance. * provide on-call production support on a rotational basis. * Introducing and implementing DevOps solutions for a range of Agile projects, creating automated build, deployment, monitoring, testing, and reporting pipelines and solutions * Building and maintaining tools, solutions and microservices associated with deployment and our operations platform, ensuring that all meet our customer service standards and reduce errors. * Actively troubleshoot any issues that arise during testing and production, catching and solving issues before launch. * skilled in defining target performance metrics and reliability goals for critical systems through the establishment of Service Level Objectives (SLOs) * Skilled in negotiating and defining Service Level Agreements (SLAs) with stakeholders, ensuring alignment with business objectives and customer expectations. * Implemented efficient incident management techniques, leading to a significant decrease in both Mean Time to Detect (MTTD) and Mean Time to Resolve (MTTR) for incidents. * Conducted thorough root cause analyses and post-incident reviews (PIRs), identifying areas for improvement and implementing preventive measures to enhance service stability. * Played a key role in incident management, responding to and resolving incidents promptly to minimize service downtime and ensure high availability. * Possessing extensive expertise in incident response planning and conducting regular drills, leading to notable enhancements in team preparedness and response efficiency. * Experienced in implementing Service Level Indicators (SLIs) to measure and monitor system performance and reliability. * Resolved critical issues promptly, leading to reduced downtime and improved service reliability. * Automate our operational processes as needed, with accuracy and in compliance with our security requirements. * Specifying, documenting, and developing new product features, and writing automating scripts so that others may understand. * Established DevOps Engineer team best practices by managing code deployments, fixes, updates, and related processes. * Create automated build triggers like Bitbucket webhooks and scripts that activate when new code is merged into the software repository to begin an automated build process. * Setup Nagios based monitoring tool to provide on-going monitoring, status, and notifications for host and service events. * Performing continuous integration and delivery of new microservices, on-demand trouble shooting of large-scale deployment issues on Linux systems. * Successfully managed and maintained Kubernetes clusters, ensuring high availability and scalability of microservices-based applications. * Implemented robust monitoring and alerting solutions for Kubernetes environments, proactively detecting and resolving potential issues before they impacted services. * Utilized Terraform for provisioning AWS cloud infrastructure, contributing to the creation of AWS Batch policies to enable Lambda functions for various tasks and leveraging the full potential of AWS Batch features. * Worked on EC2, S3, ELB, Autoscaling Groups, Elastic Beanstalk, Cloud Formation, CloudFront, RDS, VPC, Route 53, CloudWatch, IAM & Roles, * Worked on creating a Maven based build environment, testing import and export components of the Jibe framework, integration with Kafka services, monitoring data synchronization between Oracle and Mongo databases. * Identify and correct the root cause of various system alarms. Recommend changes to avoid their recurrence. * Integrate CICD tools like Jenkin, GIT with Nagios, Splunk, cloud watch to monitor and maintain logs. * Design AWS Cloud Formation templates to create custom sized VPC, subnets, NAT to ensure successful deployment of Web applications and database templates. * Implemented SonarQube scanning to find the source code vulnerabilities. * Deployed microservices in EKS clusters using helm charts, docker Images and Artifacts produced in Jenkins using shared libraries and accessing the Secrets using Vault as sidecar. * Utilized Elastic Block Store (EBS) for persistent storage and IAM for access management, ensuring secure and controlled access to AWS resources. * Worked with Developers and Test engineers to collect information for automating the CI/CD process and provide manual judgment steps to deploy to production. * Automated CI/CD tools with Grafana using Prometheus and ELK as data sources to track the microservices onboarding and resource usage. * Demonstrated proficiency in versioning, branching, and managing Jenkins pipelines to achieve continuous integration and deployment objectives. * Automated infrastructure provisioning and configuration using Infrastructure as Code (IaC) tools, resulting in increased environment consistency, and reducing manual errors. * Administrate Bitbucket, JIRA/WIKI and Migrated JIRA/WIKI from server to Atlassian cloud. * Implemented Self-service CICD process to setup GITOPS process using shared libraries and Jenkins file in Jenkins to support multiple services and environments. * Utilized Terraform and AWS Cloud information as an Infrastructure as code tool to define, provision, and manage infrastructure resources. * Managed builds and deployments across multiple environments such as Development, Testing, Pre-production & Production environments, implementing high availability architecture with Elastic Load Balancing (ELB) for load balancing across multiple availability zones.

Sr Devops Engineer

Optum ,india
March 2016 - November 2021
  • Responsibilities: * Hands on experience in healthcare domain * Migrated applications from Openshift to K8s. * Experience with setting up builds maintaining servers in Windows and Linux * Written supported Yaml files to support different functionalities in application to support Billing to customer when customer didn't pay bills properly. * Coordinate with the Development, Database Administration, QA, and IT Operations teams to ensure there are no resource conflicts. * Prepared centos server that requires application to work on Docker, Kubectl instances. * Worked with development and QA to support and enhance the existing SCM processes and toolsets. * Manage multiple build components, maintain, and track build dependencies. * Creating and managing SCM branches and helping with merge and integration across branches. * Supported small to large sized projects and established branching and merging Strategies in GIT Knowledge of concepts in code lines, branching, merging, integration, versioning, Tagging, pushing Manifesto files with build tags etc. * Helped in automating any manual parts of the release process. * Perform K8s Deployments using Blue-Green Deployment methodology. * Track Open-Source Software usage and help with the legal compliance of Open-Source Software license. * Implemented and maintained Jenkins pipelines, streamlining the CI/CD process, and ensuring efficient software delivery. * Wrote and maintained Jenkins files, defining stages, steps, and integrating with a variety of tools for building, testing, and deploying applications. * Configured Jenkins agents with the Kubernetes plugin to provision dynamically on an Amazon EKS cluster. * Created and maintained K8S deployment objects, including deployment configurations and replica sets, to enable efficient rolling updates, ensuring zero-downtime deployments and seamless application availability. * Implemented load balancing by creating service configuration files using YAML scripting. * Performed Infrastructure Monitoring using tools, such as Prometheus and Splunk, to gain insights into clusters and address issues when needed. * Leveraged readiness probes to orchestrate the update process, ensuring that new versions of the application are deployed only when they are ready to serve traffic, enhancing overall reliability and user experience. * Execute Proof of Concepts on behalf of configuration management, CI/CD (continuous integration / continuous deployment) practice, assessing new products and methods, * developing, and implementing appropriate practices across multiple development environments * Continuously evaluate and recommend improvement to CI/CD processes. * Introducing pipeline and automation best practices, putting together an introduction to Docker and Kubernetes on AWS using EKS * Created Docker images specific to project environment to support Development, automation testing, Static analysis, and deployment by spinning up Docker containers using Docker images in Jenkins's pipeline process. * Automating environment build and provisioning using virtualization, containerization, and orchestration tools with Docker, VMWare, Kubernetes. * Create clusters in AWS and manage the clusters using Kubernetes(k8s). Using Jenkins to deploy code to AWS Cloud, create new namespaces, create Docker images and push them to container registry of AWS. * Developed Docker based microservices, deployment modules with Jenkins, Kubernetes and Ansible based pipelines/frameworks. Experimented with Docker, by using Docker-compose. * Hands-on experience in using tools like Docker Compose, Kubernetes etc., for orchestrating, linking, and deploying the services related to the containers. Created Docker images using a Docker file, worked on Docker container Snapshots, removing images, and managing Docker Volumes * Used Kubernetes for container operation in AWS and used Kubernetes clusters as a network and load balancer, and chosen Kubernetes is also good at running web applications in a cluster way, also used in multiple services by creating images and reduced space. Automate NGINX/MySQL Setup and Monitor * Used scripts to replicate production build environments on a local dev box using Docker images. * Support Local System Administrators to troubleshoot Configuration Management and Network issues, IP Networking (Firewall, DNS, and Load Balancers). * Developed, maintained, and distributed release notes for each scheduled release. * Assemble Release Kit and complete paperwork, including implementation/back out plans * Combines software engineering and site reliability engineering best practices to build and run large-scale, distributed, faulttolerant systems with a focus on optimizing systems and reducing toil through automation.

Senior Devops Engineer

United Health Group,india
August 2015 - March 2016
  • Responsibilities: * Hands on experience in healthcare domain * Architected and implemented scalable cloud infrastructure on AWS and Azure following industry best practices. * Developed and executed automated infrastructure provisioning and configuration management using Terraform and Ansible, improving efficiency and reducing manual errors. * Implemented end-to-end CI/CD pipelines using Jenkins, Git, and Docker, accelerating application delivery, and enhancing release management processes. * Leveraged monitoring and optimization tools such as AWS CloudWatch, Azure Monitor, and Stack driver to proactively improve resource utilization and cost efficiency. * Coordinate with the Development, Database Administration, QA, and IT Operations teams to ensure there are no resource conflicts. * Participate in architectural decisions, system design, and technical solutions. * Write clean. Maintainable, and efficient code in python * Debug and troubleshoot applications, fixing errors and optimizing performance. * Collaborated closely with cross-functional teams to gather requirements, provide technical guidance, and troubleshoot cloud infrastructure issues. * Conducted regular security assessments, implemented robust security controls, and ensured compliance with industry standards to protect cloud environments against threats. * Contributed to the development of disaster recovery and business continuity strategies, ensuring the resilience of cloud-based systems. * Creating and managing SCM branches and help with merge and integration across branches. * Managed highly available monitoring infrastructure, leveraging Nagios, New Relic and Dynatrace to monitor diverse application servers and their components. * Helped in automating any manual parts of the release process. * Acted as the primary point of contact for on-call support, effectively resolving incidents and minimizing service disruptions. * Implemented and maintained Jenkins pipelines, streamlining the CI/CD process, and ensuring efficient software delivery. * Expertise in creating and managing user accounts, groups, and disk spaces within a Linux environment. * Writes code, completes programming, reviews peers' code, writes documentation, writes tests, performs testing, and debugs code in lower and production environments.

Cloud Engineer

Seagate
June 2014 - August 2015
  • Responsibilities: * Architected and implemented scalable cloud infrastructure on AWS, and Azure following industry best practices. * Orchestrated the management of virtual machines, storage, networking, and security, ensuring high availability and optimal performance. * Developed and executed automated infrastructure provisioning and configuration management using Terraform and Ansible, improving efficiency and reducing manual errors. * Implemented end-to-end CI/CD pipelines using Jenkins, Git, and Docker, accelerating application delivery, and enhancing release management processes. * Leveraged monitoring and optimization tools such as AWS CloudWatch, Azure Monitor, and Stackdriver to proactively improve resource utilization and cost efficiency. * Collaborated closely with cross-functional teams to gather requirements, provide technical guidance, and troubleshoot cloud infrastructure issues. * Conducted regular security assessments, implemented robust security controls, and ensured compliance with industry standards to protect cloud environments against threats. * Contributed to the development of disaster recovery and business continuity strategies, ensuring the resilience of cloud-based systems. * Managed highly available monitoring infrastructure, leveraging Nagios, New Relic and Dynatrace to monitor diverse application servers and their components. * Acted as the primary point of contact for on-call support, effectively resolving incidents and minimizing service disruptions. * Expertise in creating and managing user accounts, groups, and disk spaces within a Linux environment. * Offered Linux training and consulting, guiding teams to build scalable, supportable, and cost-effective solutions. Mentored technical staff to enhance their skills and professional development. * Implemented and managed centralized log management and analysis tools like Splunk and ELK Stack for comprehensive log monitoring, analysis, and troubleshooting. * Integrated automated testing frameworks like Selenium with CI/CD pipelines to ensure continuous testing and quality assurance of cloud-based applications.

Linux Administrator

Solon International LLC
December 2012 - May 2014
  • Responsibilities: * Performed routine system administration activities, including installing the operating system using Kickstart from Standard Build, patching, installing applications and upgrading hardware, as well as troubleshooting and problem-solving. * Setup and maintained Nagios for monitoring servers and application. * Setup yum repository in the server and upgrade by installing necessary software packages and patches. * Installed and maintained servers for the domain name system. * Used swap and logical volume manager to build, modify, and destroy file systems. * Improved system and application performance by employing tools and techniques including shutdown, reboot, start and stop to find performance problems and fix them. * Performed routine Linux administration duties including file system sharing, user account management, directory services, and permissions. * Loaded and unloaded modules in accordance with the Kernel's specifications and made them persistent during boot by performing the necessary adjustments. Page

Education

JNTU

M. Tech

JNTU

B. Tech
January 2006 - January 2010