Siva Karthick Patakam
Development
Georgia , United States
Skills
DevOps
About
Siva Karthick Patakam's skills align with Consultants and Specialists (Information and Communication Technology). Siva also has skills associated with Programmers (Information and Communication Technology). Siva Karthick Patakam has 8 years of work experience.
View more
Work Experience
Sr Cloud DevOps Specialist
Apple
July 2022 - January 2024
- Responsibilities: Responsible for Implementation, design, architecture, and support of cloud-based solutions across multiple platforms Infrastructure (ensuring system availability, performances, capacity, and continuity through proper response to incidents, events, and problems). Play a key role in supporting deployments in nomad and maintain infrastructure in legacy environment (private Cloud) • Troubleshoot network related issues, LB Issues and fixing the solutions in Legacy Environment. • Maintaining the Orchestrators for Nomad, Consul, HAProxy and Compute Hosts. Create and maintain highly scalable and fault tolerant multi-tier AWS environments spanning across multiple availability zones using Terraform and CloudFormation. • Maintain infrastructure for Prod, QA and Dev as per the business needs. • Worked closely with DC, Dev, Ops and App Teams for maintaining the Application development and operational functions. • Services discovery via Consul and performing health checks. • Planning the resource constraints for any future Requirements. Written Templates for AWS Infrastructure as code using Terraform to build staging and production environments. Integrated Splunk for monitoring the log files, store them and track metrics and used Terraform as a tool, Managed different infrastructure resources Cloud, VMware, and Docker containers. Automating the infra provisioning on AWS using the Terraform code. Well versed in writing the terraform modules and maintaining the terraform state files in line with the existing Architecture. Worked on Kubernetes for creating new Projects, Services for load balancing and adding them to Routes to be accessible from outside, Creation of Pods through new application and control the scaling of pods, troubleshooting pods through ssh and logs, writing/modification of Build configs, templates, Image streams etc. Managed Kubernetes charts using Helm, Created reproducible builds of the Kubernetes applications, managed Kubernetes manifest files and Managed releases of Helm packages. Developed and Implemented Kubernetes manifests, helm charts for deployment of microservices into k8s clusters. Day-Day application support on production and technical documentation for critical production issues. • Involved in multi data center architectural designing. • Cost optimization activities by analysis the application load and requirement. • Fixing the Nomad deployment failure issues and troubleshooting the HAProxy Load Balancer application health check issues. • Monitoring the resources hardware, kernel, CPU, Mem, Storage, networks and application traffic with help of Grafana, EPIC, Splunk, Datadog and Overload Tools. • Helping out new joiners with process and tools KT's. Built Python scripts for installing the security complainer Agents in 100's of servers. • Creating Docs for every new issue and its fixes and Involved in Management Project discussions. • Build Shell and python Automations for Adhoc requests. Supported in on calls for project needs. Created Datadog Dashboard for infrastructure Resource monitoring and Visualization and configured Alerts for failure of any requests in HAProxy logs, Resource high Utilization of CPU/Memory. Created and managed DNS for Orchestrators via Shield and Akamai. Worked on Automations through building python scripts for downloading the certificates from whisper before the application get started during deployment. Troubleshoot the Rio Pipelines failure and help App teams to deploy applications. Worked in GitHub for Code Commit and branching strategy. Created an custom application for testing the infrastructure performance and services working condition. Setup of infrastructure core service like docker, nomad, consul and HAProxy are managed by puppet. Infrastructure segregation has been made via Range feature for all (Data Centers) DC's. Writing Python, Shell scripts for daily tasks, documenting the changes that happen in the environment, and each server, analyzing the error logs, user logs. Replaced the use of CloudFormation with Terraform to not be fully locked into a single cloud provider. Provide Tier 3 support: low-level and systemic troubleshoot and debug of single and distributed systems. Automated each manual task by python Scripts. Migrated the Application servers from on primes to Cloud. Build Terraform modules for the migrated instance to provision in multiple AWS Cloud Organization based on the application Environment: AWS, Private Cloud, GIT, Unix/Linux, Rio, Jfrog Artifactory, Docker, Kubernetes, Nomad, Prometheus, Grafana, puppet, Shell, Ansible, Python, Terraform, EKS.
System Administrator
Cloud SRE
November 2019 - July 2022
- From: DXC Technology, Client: HPI Responsibilities: Infrastructure supporting Amazon Web Services (AWS) and Azure Cloud. • Applying patches for the Windows servers through SCCM. • Worked on WAF Configurations to set the application firewall and analyses the blockages. • Troubleshooting the Domain join issues for computers which are needed to join to AD Domain. • Troubleshoot network related issues, LB Issues and fixing the solutions in AWS Environment. • Worked on Workspaces which will help to connect to the client network securely. • Worked on EFS (shared folder) for Linux servers and integrated with Data sync for replicating the date from primary to secondary region. • Involved in migration activity. (Migrating servers from on-primes to AWS cloud with the help of Cloud Endure) • Hands on experience on Linux (Redhat, Amazon Linux) Platforms and windows (windows2012 R2, windows 2016, windows 2019) Created reproducible builds of the Kubernetes applications, managed Kubernetes manifest files and managed releases of Helm packages. Scheduled, deployed and managed Container replicas onto a node cluster using Kubernetes. Used Ansible and Ansible Tower as a Configuration Management tool, to automate repetitive tasks, quickly deploy critical applications, and automate the build/Configuration of new Servers. Writing Python, Shell scripts for daily tasks, documenting the changes that happen in the environment, and each server, analyzing the error logs, user logs. Drive and execute biweekly releases to different environments using Jenkins/Azure DevOps deployment pipelines. • Involved in Cost Optimization activity and written python scripts for analyzing the resources (EC2/Snapshot/Volumes) which do not contain cost tags and placed the script in Lambda to automate it. • Worked on Automations to install the software in hundreds of servers in different environments and Accounts. Written Templates for AWS Infrastructure as code using Terraform to build staging and production environments. Integrated Splunk for monitoring the log files, store them and track metrics and used Terraform as a tool, Managed different infrastructure resources Cloud, VMware, and Docker containers. Automating the infra provisioning on AWS using the Terraform code. Well versed in writing the terraform modules and maintaining the terraform state files in line with the existing Architecture. • Involved in Disaster recovery implementations for high availability of applications. • Written a volume backup script which can help to take backup On Demand based. • Developed a python Script to notify app team for Log4j Vulnerability. • Experience in handling Logical volumes in Linux servers. Performed Rescue Operation when Boot failure issues. • Worked on Service now ticketing tool. Achieved TTO and TTF SLA's without missing and Addressed issues on time. • Trained people who joined the team with clear Process understanding and technical concepts. • Patching servers for Both AWS and Azure cloud with help of System Manager service from AWS. • Worked on Athena for checking the blockages in WAF which were stored in S3 bucket. • Attended management standup calls and represent the Team achievements • Participation in project meetings with other technical staff and business owners and subject matter experts. • Identification of root causes of critical problems throughout the platform, incident reports and the organization of the engineering team members in steps to resolve issues. Monitoring the Cloud Resource via Cloud watch and configure the Alerts based on the threshold values. Visualizing the Cloud Resources via PowerBI Dashboards. Implemented the step function for installation of CloudWatch agent via System Manager. Provide Tier 3 support: low-level and systemic troubleshoot and debug of single and distributed systems. Day-Day application support on production and technical documentation for critical production issues. Used agile methodology and have experience in Creating and Maintaining user/tasks in JIRA to measure feature progression in PI commitment. Build Terraform modules for the migrated instance to provision in multiple AWS Cloud Organization based on the application. Replaced the use of CloudFormation with Terraform to not be fully locked into a single cloud provider. Worked on step function to schedule the job in sequence, Configure Athena with data source s3 where WAF Logs have been stored. Written Python scripts to update the application team about the server compliance status via email. Environment: AWS, AZURE, Cloud Endure, Active Directory, SCCM, Ansible, Python, PowerShell, Cloud Formation, ARM templates, DFS, shell script, terraform, Kubernetes.
Associate System Administrator
November 2017 - November 2019
- From SourceOne Responsibilities: Worked as a DevOps Engineer for a team that involves different development teams and multiple simultaneous software releases. Develop build and release pipelines for various projects which include branch cut, build, and deployment. Configure and upgrade various tools used in the CI/CD process such as Maven, Subversion, Jenkins, Artifactory, Nexus, Sonar, Fortify, Bitbucket server. Work with Release Management and Change Management to contribute and review implementation plan template. Developed dashboards to monitor the health check of the servers. Build different types of servers using AWS: Importing volumes, launching EC2, creating security groups, Auto-Scaling, Elastic Load Balancers in the defined Virtual Private Cloud. Designed AWS Cloud Formation Templates to automatically provision AWS resources such as VPC, Subnets, EC2, S3, IAM, EBS, Security Group, Auto Scaling, and RDS to ensure successful deployment of Web applications and database templates. Implementing deployment strategies Blue-Green Deployment and Rolling Deployment. Configuring Vault and Consul for Secrets management, encrypting the certificates, username, and passwords of an application by assigning token made by API call. Started as a newly built startup size 4-member team. To continue seamless development and deployment of applications within no time, as a go to solution utilized Bank's prebuilt Jenkins Docker Image Deployed health monitor tools such as consul and check the status of other nodes. Develop, deploy and manage the Docker containers using Kubernetes container application platform to automate the provisioning, management, and scaling the applications. Incident resolution working closely with Development teams, analyzing logs, performance, and supporting the application team or tuning the platform. Troubleshooting OpenShift router operation, analyzing stats with different projects to determine the bottleneck. Utilized Configuration Management Tool Chef and created Chef Cookbooks using recipes to automate system operations. Created Docker images using Docker file from scratch, worked on Docker container snapshots, removing images, and managing Docker volumes customized and modified base images from existing environment configurations and maintaining the image repo for development teams. Experience using Kubernetes for container orchestration that runs Docker containerized applications in a cluster of EC2 instances in Linux Environment. Created and automated the Jenkins pipeline using a pipeline groovy script for the applications. Deployed application using Jenkins server and troubleshoot build and release job failures, resolve, work with engineers on the resolution. Wrote Shell and Python scripts for automating the build and deployment process. Worked on User administration setup, maintaining an account, and monitoring system performance using ELK and Prometheus. Environment: AWS, GIT, SVN, Chef, Ansible, Unix/Linux, Docker, Kubernetes, Jenkins, Maven, Tomcat, Prometheus, ELK, Vault, Consul, Terraform, CloudFormation, Nexus, Vagrant.
System Engineer
January 2016 - November 2017
- From SourceOne Responsibilities: Migrated an existing legacy infrastructure and recreated the entire environment within Amazon's Web Services (AWS). Migrated LINUX Logical Volumes (LVM) from one server to another server using AWS snapshots. Created the AWS Infrastructure using VPC, EC2, S3, Route 53, EBS, Security Group, Auto Scaling, and RDS in Cloud Formation. Managed multiple AWS accounts with multiple VPC's for both production and non-prod where primary objectives included automation build-out, integration, and cost control. Automated the code deployment process in each environment (Development, QA, Staging, Production, and DR). Creating fully automated CI/CD build and deployment infrastructure and processes for multiple projects using Jenkins, Docker, Ansible, and GitHub. Monitored developed applications and fixed bugs. Prepared release documents and SCM plans as part of the process. Co-ordinate with all teams DEV, QA, Change Management, Architects, Stake Holders as part of Releases. Created and wrote shell scripts and Python for setting up baselines, branching, merging, and automation processes across the environments using SCM tools like GIT, Subversion (SVN), Stash, and TFS on Linux and Windows. Environment: AWS (EC2, VPC, ELB, S3, RDS, Cloud Trail and Route 53), VDI, RedHat Linux, Git version Control, VPC, Jenkins, Python, Nexus, Maven, AWS EC2, S3, Route53, EBS, IAM, ELB, Cloud watch, VMware, Cloud Formation, AWS CLI, AWS Auto Scaling.