Job brief
We are seeking a highly skilled Operations Engineer to join our team. The ideal candidate should have a deep understanding of operations, automation, and system administration.
They should also be able to collaborate effectively with cross-functional teams, communicate effectively, and have a passion for continuous improvement.
Responsibilities
- Design, implement, and maintain efficient and scalable systems for production and test environments.
- Collaborate with cross-functional teams to identify and solve operational issues.
- Troubleshoot and resolve complex infrastructure and application issues.
- Participate in the creation and implementation of operational policies and procedures.
- Manage and maintain monitoring, logging, and alerting systems.
- Implement automation and tooling to increase efficiency and reduce manual processes.
- Participate in on-call rotation for 24/7 support.
Requirements
- Bachelor's degree in Computer Science or related field
- At least 3 years of experience in Operations Engineering, Site Reliability Engineering, or similar roles
- Solid understanding of Linux systems administration and networking
- Proficiency with at least one programming language (e.g., Python, Ruby, Go)
- Experience with cloud computing platforms (e.g., AWS, Azure, GCP)
- Experience with containerization and orchestration tools (e.g., Docker, Kubernetes)
- Experience with infrastructure-as-code tools (e.g., Terraform, Ansible)
- Strong analytical and problem-solving skills
- Strong communication and collaboration skills
- Comfortable working in a fast-paced, dynamic environment
- Willingness to participate in on-call rotation for 24/7 support