Acerca de

DevOps Engineer

Maintain and improve CI/CD automation processes to enable faster turnaround time for developers.
Maintain and improve network topology for performance and for cost.
Maintain and improve observability infrastructure to provide optimization windows and bug breakdowns for the engineers.
Handle on-call duties to conduct disaster recovery and write post-mortem reports.

Minimum 3 years of experience in software development or DevOps, with a focus on infrastructure management and automation.
Proficiency in Google Cloud Platform (GCP) and Google Kubernetes Engine (GKE) for managing cloud infrastructure and containerized workloads.
Strong expertise in building and maintaining CI/CD pipelines using GitHub Actions to accelerate software delivery.
Hands-on experience with Docker and Kubernetes for container orchestration and automation.
Ability to collaborate with developers to understand their needs and implement solutions that enhance productivity and delivery speed.
Proactive problem-solving skills, with a balance of independent work and timely collaboration, seeking help when needed without waiting for issues to escalate.
Experience managing and troubleshooting infrastructure components, such as Nginx, to support application delivery.
Experience designing and maintaining fault-tolerant infrastructure to minimize downtime during failures.
Ability to implement and test disaster recovery plans, including backup and restore processes, to ensure business continuity.
Proficiency in troubleshooting and resolving production incidents under time pressure to restore services quickly.
Ability to design and implement stable on-call infrastructure, including alerting systems and escalation protocols, to ensure rapid incident response.

Familiarity with infrastructure-as-code tools like Terraform.
Prior contributions to observability initiatives, such as setting up monitoring and logging solutions.
Experience optimizing system performance or reducing downtime in high-traffic environments.
Experience with GitOps tools such as ArgoCD.
Strong communication skills to mentor or bridge gaps between development and operations teams.
Experience conducting post-incident reviews to identify root causes and implement preventive measures.
Familiarity with chaos engineering practices to proactively test system resilience.
Experience integrating observability tools (e.g., Prometheus, Grafana) with on-call systems to enhance incident detection and resolution.

NTD 1,100,000 – 2,000,000 per year

Please attach your resume to the email and include your name, contact number, and the best time for us to contact you.Thank you for your application!