a leading AI deep-tech company, is looking for a hands-on IT Infrastructure Manager to lead, build, and maintain the infrastructure powering our cutting-edge AI systems.
This is a working manager role: you'll actively shape and operate both our IT and infrastructure environments while owning the strategic direction and execution of infrastructure projects. Youll collaborate with R&D, security, and product teams to ensure stable, secure, and scalable systems across our on-premise and cloud-based environments.
This is a full-time, on-site position based in Tel Aviv. Presence at the office is required five days a week.
Why Join Us?
This role combines the challenge of building high-performance environments with the opportunity to influence infrastructure strategy in a fast-paced, AI-driven company. Youll work alongside some of the brightest minds in the industry, tackling critical technical challenges in real-time.
Responsibilities
Lead and execute IT infrastructure initiatives across on-prem and cloud environments.
Design, deploy, and maintain scalable systems to support AI training, inference, and data workflows.
Develop and manage CI/CD pipelines (Jenkins, GitHub Actions).
Orchestrate data workflows and ML operations using tools such as Airflow and ClearML.
Administered and troubleshooted Linux-based environments (Ubuntu, RedHat).
Implement containerized infrastructure using Docker and Kubernetes.
Own infrastructure as code (Terraform, Ansible, Helm) and drive automation efforts.
Ensure robust system monitoring and logging using Prometheus, Grafana, Graylog, or similar.
Enforce cybersecurity and privacy standards (ISO27001 or equivalent).
Collaborate cross-functionally with R&D, Product, Support, and Security teams.
Manage vendors, procurement, and IT asset documentation as needed.
Requirements: 6+ years of experience in IT, infrastructure, or DevOps-related roles.
Proven hands-on experience with:
Linux administration (Ubuntu preferred), scripting (Python, Bash).
CI/CD tools: Jenkins, GitHub CI.
Kubernetes, Docker, and container orchestration.
Infrastructure as Code: Terraform, Ansible, Helm.
Networking: TCP/IP, DNS, DHCP, VPN, SSH.
Strong understanding of security principles and familiarity with compliance frameworks (ISO27001).
Experience with monitoring, logging, and troubleshooting production environments.
Excellent organizational and communication skills, with a proactive, ownership-driven mindset.
This position is open to all candidates.