Senior DevOps Engineer
Our client is a leading player in IoT solutions with a decade-long presence in the telematics domain. They offer a diverse range of cutting-edge solutions tailored to optimize operations. Trusted by over 2000 entities in Saudi Arabia. With plans to further grow their Tunis office and become a top software company in the region, our client prioritizes competence, customer-centricity, and innovation in delivering their solutions.
Responsibilities
• Infrastructure Management: Design, build, and maintain scalable, highly available, and reliable infrastructure using Kubernetes, Postgres, and Cassandra.
• CI/CD Pipelines: Develop, implement, and maintain continuous integration and continuous delivery pipelines for efficient software deployment.
• Kubernetes Orchestration: Manage Kubernetes clusters, troubleshoot issues, and ensure optimal performance and security across environments.
• Database Management: Optimize, manage, and maintain Postgres and Cassandra databases, ensuring high availability, performance tuning, and backup/restore processes.
• Monitoring & Logging: Set up monitoring, logging, and alerting systems to ensure the health and performance of the infrastructure using tools like Prometheus, Grafana, or the ELK Stack.
• Automation: Automate infrastructure provisioning, configuration, and deployments using tools like Ansible, Terraform, or Helm.
• Collaboration: Work closely with software engineers, QA teams, and product managers to ensure smooth development and release cycles.
• Security: Implement security best practices and ensure compliance across all services and infrastructure.
Qualifications
• 5-8 years of experience in a DevOps or Site Reliability Engineering (SRE) role.
• Strong experience in Kubernetes orchestration, including deployment, scaling, and monitoring of applications.
• Proven expertise in setting up and managing Cassandra clusters, with knowledge of high availability and performance optimization.
• Experience with cloud platforms like Alibaba Cloud and DEEM Cloud is a plus.
• Experience with infrastructure-as-code tools like Ansible or Helm.
• Familiarity with CI/CD tools such as Jenkins or GitLab CI.
• Proficiency with Git and branching strategies.
• Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack).
• Strong troubleshooting and problem-solving skills.