

Hello! I am Andrii Lytvynenko
Senior DevOps & Cloud Engineer
AWS & Kubernetes Certified | B2B Contractor
About Me
I'm a Senior DevOps & Cloud Engineer with 5+ years of experience designing, implementing, and optimizing cloud infrastructure for enterprise clients. I specialize in building scalable, secure, and cost-efficient solutions using AWS, Kubernetes, and Infrastructure as Code practices.
My experience spans working with Fortune 500 companies like JP Morgan Chase and Mercedes-Benz, where I've led critical infrastructure migrations, reduced cloud costs by up to 50%, and architected high-availability systems handling hundreds of thousands of requests per second.
I'm passionate about automation, continuous improvement, and mentoring teams. Whether it's a complex Terraform migration or implementing GenAI-powered monitoring solutions, I focus on delivering measurable business value.
Key Highlights
Trust & Compliance
Technical Skills
Cloud Platforms
Infrastructure as Code
Containers & Orchestration
CI/CD & Automation
Monitoring & Observability
Programming & Scripting
Certifications


AWS Certified AI Practitioner
Verify
AWS Cloud Practitioner
Verify
Certified Kubernetes Administrator
Verify
AWS Solutions Architect Associate
Verify
AWS Certified SysOps Administrator - Associate
Target: 2026
Languages
Experience
Senior DevOps Engineer
2025 November - Present
- Focused on web projects using AWS services including ECS, EC2, Lambda, API Gateway, RDS, ElastiCache, CloudFront, Route53, SSM Parameter Store, and Secrets Manager, managing multiple AWS accounts using Terraform code.
- Optimized cloud costs using AWS Cost Explorer and implementing savings plans to reduce infrastructure expenses.
- Created monitoring infrastructure from scratch using Amazon Managed Grafana and Amazon Managed Prometheus integrated with CloudWatch.
- Implemented custom monitoring solutions using node_exporter, windows_exporter, and sql_exporter with OpenTelemetry for Linux and Windows instances.
- Deployed monitoring for MSSQL instances, ensuring comprehensive database observability and performance tracking.
Senior/Lead DevOps Engineer
2025 August - 2025 October (Contract successfully completed)
- Delivered and maintained cloud infrastructure using Terraform in multiple dev environments, pre-prod, and prod environments with automatic scaling and serverless services, featuring multi-AZ deployments and multi-region S3 and RDS replications (active-passive) using AWS services (ECS, MWAA, and Lambda).
- Implemented monitoring using Dynatrace, Datadog, and CloudWatch, including investigation of Lambda errors and resource utilization.
- Developed CI/CD pipelines with Jules (a custom Jenkins) and Spinnaker integrated with Terraform Enterprise, covering build, test, and deploy stages involving Java (Vanilla Java, Spring Boot) and Docker.
- Contributed to cloud architecture planning and continuous improvement of platform design.
- Collaborated with cross-functional international teams including DevOps, QA, Developers, Architects, and Analysts.
DevOps Engineer
2023 February - 2025 September
- Led urgent and time-limited Terraform Enterprise migration from Terragrunt, completing it in 30% of the planned time (1 month instead of 3 months) with half the team size (3-5 instead of 10 engineers); provided daily reports to stakeholders on development status, performance tracking, and management via team calls, Jira comments, and Confluence pages.
- Led network architecture redesign to eliminate Nginx network bottlenecks, applying high availability and reliability principles using AWS services (ALB, NLB, EC2, Route53, EKS, CloudFront, VPC, EventBridge), reworking legacy code, reducing incident frequency by 70%, and enabling the handling of high traffic peaks with hundreds of thousands of requests per second.
- Led documentation initiatives to improve transparency for clients and newly onboarded staff, creating complete infrastructure and testing documentation and knowledge-sharing procedures, resulting in improved team efficiency and engagement.
- Led urgent production Kubernetes cluster restoration during weekends following an incident, restoring from automated backups, implementing countermeasures, and documenting the event.
- Regularly mentored a team of 10 members, providing one-to-one support sessions.
- Implemented and maintained IAM policies, secrets management, and conducted regular security audits within the AWS environment.
- Designed and implemented Gerrit High Availability Cluster architecture using Corosync, Pacemaker, and distributed storage (GFS2, DLM, LVM, lvmlockd) with automation, eliminating downtime during maintenance operations.
- Optimized Gerrit infrastructure by migrating from a single large instance to a 3-node cluster, reducing EC2 costs by 40% (saving approximately $3,000 per month) and improving reliability.
- Designed and implemented event streaming between multiple Gerrit instances using Kinesis, Kafka, and Zookeeper to handle peak traffic loads.
- Delivered a comprehensive monitoring solution for Gerrit HA components (instances, metrics, ALB/NLB, EBS, RDS) using Prometheus and Grafana.
- Implemented automated disaster recovery and backup/restoration strategies for Gerrit HA using EBS snapshots.
- Delivered client-facing technical and architectural presentations to audiences ranging from 10 to 100 stakeholders.
- Facilitated team-level technical discussions and solution brainstorming sessions across international teams, leading to performance optimizations and preemptive resolution of critical issues.
- Conducted FinOps initiatives and analyses, reducing test environment costs by 50% and production costs by 20% without performance degradation, saving over $5,000 monthly using AWS Cost Explorer and Cost Optimizer.
- Maintained and implemented Jira automation jobs, streamlining release procedures with Python.
- Implemented build, test, and release pipelines using Java, Groovy, Python, AWS SDK, and Jenkins.
- Developed monitoring and observability solutions: created dashboards with Grafana visualizations and alerts, prepared database queries in SQL and Flux for InfluxDB, and implemented Kubernetes and EC2 monitoring with Prometheus; automated incident ticket creation using Grafana, AlertManager, and Jira Alert.
- Performed regular upgrades and maintenance for Jenkins, Gerrit, Kafka, Zookeeper, Terragrunt, Terraform, Terraform Enterprise, InfluxDB, Grafana, Prometheus, Kubernetes, Ubuntu, HAProxy, and Nginx.
- Developed Single Sign-On (SSO) plugins for Gerrit in Java, enabling unified authentication across enterprise systems.
- Implemented a GenAI-powered log analyzer using AWS Bedrock (OpenAI GPT-4.0), reducing incident investigation time by 60% through automated pattern recognition and root cause analysis.
DevOps Engineer
2021 January - 2023 January
Full-Stack Web Developer
2020 April - 2021 June
System Administrator
2019 January - 2020 February
Education
Computer Engineering: Computer Networks and Systems
Graduated2021 September - 2025 August
Cloud & DevOps Preparation Program
Graduated2022 June - 2023 March