Overview
ibex. is seeking a Multi-Cloud DevOps Engineer to join our growing team. This pivotal role offers the chance to leverage cutting-edge technologies, champion automation, and significantly enhance infrastructure resilience across diverse cloud environments. If you're a proactive problem-solver with a passion for optimizing complex systems and deploying robust solutions across multiple cloud providers, we'd love to connect with you!
Responsibilities
- Continuous Integration & Deployment (CI/CD): Design, implement, and continuously improve CI/CD pipelines tailored for multi-cloud and Kubernetes-centric platforms. This includes building and maintaining Docker container images and managing all related dependencies.
- Multi-Cloud Environment Management: Configure, automate, and maintain critical environments across various cloud infrastructures, including AWS, Azure, Google Cloud Platform (GCP), on-premises Kubernetes clusters, VMware, and containerized setups. A strong focus on process automation is key to boosting efficiency and consistency.
- Distributed System Management: Deploy and manage large-scale Kubernetes clusters (e.g., Azure AKS, AWS EKS, Google GKE), MySQL clusters, storage solutions (e.g., OpenEBS), ELK stacks, and Redis, ensuring high availability and resilience across cloud boundaries.
- Automation & Configuration: Utilize powerful tools like Terraform, Ansible, and Puppet to automate environment provisioning and streamline configuration management across our diverse multi-cloud infrastructure.
- Application Modernization: Collaborate closely with development teams to facilitate the smooth transition of legacy monolithic applications to microservices architectures, leveraging both on-premises and public cloud platforms (Azure, AWS, GCP).
- Monitoring & Observability: Develop comprehensive dashboards and integrate various components to enhance visibility into multi-cloud infrastructure and application performance.
- Operational Efficiency: Implement automated solutions for repetitive operational tasks using APIs, CLI tools, and custom scripting, thereby improving efficiency and minimizing manual effort across all platforms.
Qualifications
- This role requires 5 to 7 years of relevant experience in DevOps, Site Reliability Engineering, or a similar capacity with a significant focus on multi-cloud solutions.
- A bachelor's degree in a computer-related field or similar experience is preferred.
- Cloud Platforms: Extensive practical experience with production workloads on at least two of the leading public cloud providers: Microsoft Azure, Amazon Web Services (AWS), and Google Cloud Platform (GCP).
- Container Orchestration: Expertise with Kubernetes (on-premises or managed services like Azure AKS, AWS EKS, Google GKE) and Docker.
- Configuration Management & IaC: Skilled in using tools such as Ansible and particularly Terraform for provisioning and managing infrastructure as code across multi-cloud environments.
- Monitoring & Logging: Experience with industry-standard monitoring and logging tools including Grafana, Prometheus, and the ELK stack.
- CI/CD Tools: Hands-on experience with robust build systems (e.g., Dockerfiles) and leading CI/CD tools such as GitLab CI/CD or Jenkins.
- Source Control: Proficient in Git for version control.
- Scripting: Strong proficiency in scripting languages including Bash and Python.
- Operating Systems: Demonstrated proficiency in Unix-based environments (e.g., RedHat, Ubuntu).
- Virtualization & Containerization: Practical experience with VMware and Docker containers.
- API Interaction: Proven ability to interact with REST APIs for automation and integration.
Nice-to-Have Skills:
- Service Mesh: Deployed and operated service meshes (e.g. Istio, App Mesh, Anthos) in multi-cloud Kubernetes environments.
- Cloud Cost Management: Understanding of cloud spending optimization across multiple providers (e.g., billing models, reserved instances, basic FinOps concepts).
- Advanced Observability: Experience with distributed tracing tools (e.g., Jaeger, OpenTelemetry) or more advanced logging aggregation strategies.
- Cloud Networking Fundamentals: Basic understanding of multi-cloud networking principles (e.g., VPNs, peering, network security groups).
- Cloud Security Fundamentals: Familiarity with basic security best practices across cloud platforms (e.g., IAM, security groups).
- Distributed Systems Components: Experience with MySQL clusters, Redis, or similar distributed storage solutions.
- PowerShell Scripting: Additional proficiency in PowerShell.
- VMware CLI/API: Experience with VMware CLI or API for automation.
Report job