1,067 Infrastructure jobs in South Africa
Infrastructure Engineer – Cloud Infrastructure
Posted today
Job Viewed
Job Description
Position Summary
Execute tasks related to MagicOrange's IT infrastructure, assist in the operational layer of our SaaS platform estate, and ensure security governance. Provide operational resilience, secure cloud usage, and consistent IT support across a globally distributed, hybrid workforce.
Key Responsibilities- Cloud Infrastructure & SaaS Operations
Oversee and optimize the suite of SaaS platforms and operational tools used across the business to ensure seamless day-to-day functionality, efficiency, and support for all users. - End-user Computing & Security Operations
Maintain end-user computing, network operations, and endpoint protection to ensure operational continuity and user satisfaction. - Automation & Remote Support
Implement tooling, automation, and remote support models for globally distributed and hybrid workforces. - Identity & Access Management Administration
Administer Microsoft Entra ID, including provisioning, monitoring, and performance optimization.
- Azure Governance & Compliance
Configure and maintain Azure Policy, Blueprints, and Compliance Manager to align with ISO 27001, GDPR, POPIA, SOC II, and other regulatory frameworks. - Auditing & Reporting
Create dashboards and evidence packs for auditors, executives, and clients, automating compliance reporting and remediation tasks.
- ISMS & Security Framework Ownership
Collaborate with the CISO Office on maintaining and enhancing the ISO 27001 ISMS, including contributing technical input for the Statement of Applicability and participating in internal/external audits as needed. - Compliance Alignment
Ensure IT infrastructure and cloud tooling align with the compliance goals set for ISO 27001, SOC 2, POPIA, and GDPR. - Risk Assessments
Support the CISO in conducting risk assessments related to infrastructure and SaaS operations and contribute to the IT-specific entries in the centralized risk register.
- Identity & Access Management
Implement and maintain identity services and access controls (e.g., RBAC, MFA, PIM) in accordance with IAM policies and governance. - Least-Privilege & Compliance
Enforce least-privilege access via automated policy enforcement and periodic reviews.
- SaaS Licensing & Software Asset Management
Manage Microsoft EA/CSP and other enterprise SaaS subscriptions. Develop and maintain a licensing compliance framework and optimize cost forecasting.
- Incident Response & Resilience
Execute incident response procedures under the direction of the CISO Office, coordinate with partners on infrastructure-related aspects, and implement technical corrective actions post-incident. Conduct post-incident investigations and implement corrective actions.
- People & Vendor Management
Collaborate with MSPs, security partners, and SaaS vendors. Ensure cost-efficiency across infrastructure and tooling.
- Cross-Functional Collaboration
Work with Product, Finance, HR, and Engineering teams to align IT operations with strategic business goals. Present technical KPIs, risks, and compliance status to non-technical executives and client stakeholders.
- IT Operations & Support
Handle issues with screens, docking stations, laptops, and printers, and provide necessary cords. Set up workstations, perform hardware upgrades (RAM, batteries), and manage cables. Manage the employee onboarding process, conduct access audits, and update general user information in systems like Microsoft Teams and email.
- 6–8 years in IT infrastructure, SaaS operations, or IT security.
- Diploma/Degree in IT.
- 3+ years working with Microsoft Azure environments.
- Proven experience with distributed teams, SaaS platforms, and remote support operations.
- Demonstrated involvement in ISO 27001, SOC 2, or GDPR/POPIA compliance programs.
- Strong knowledge of firewall/router technologies and network security concepts.
- Proficiency with administering Windows OS and Servers.
- Experience with Active Directory (Entra ID), DNS/DHCP.
- Microsoft Exchange, Office365, Microsoft Intune, Microsoft Defender365 experience is desired.
- Hands-on experience with monitoring and diagnostic tools.
- Experience supporting on-prem and Azure-hosted environments.
- Ability to troubleshoot and resolve complex infrastructure issues independently.
- Relevant IT certifications (e.g., CompTIA, Microsoft, Cisco).
- Familiarity with ITSM and ISMS platforms.
- Multi-cloud awareness (Azure, AWS, GCP).
Join us at MagicOrange and help shape the future of IT Financial Management and FinOps Software by ensuring our customers achieve the highest levels of satisfaction and success.
MagicOrange is an equal opportunity employer, committed to promoting diversity and inclusion in the workplace. We value and appreciate the diverse contributions and perspectives of all our employees.
#J-18808-LjbffrInfrastructure Engineer – Cloud Infrastructure
Posted today
Job Viewed
Job Description
MagicOrange is a globally recognized leader in the IT Financial Management Software market, as acknowledged by Gartner. With customers and a strong presence on four continents, we are a Software as a Service (SaaS) provider in a high-growth phase. Our mission is to empower individuals and organizations, enhancing their value through our innovative software solutions
Location: Durban - KwaZulu Natal, South Africa
Position Summary:
Execute tasks related to MagicOrange's IT infrastructure, assist in the operational layer of our SaaS platform estate, and ensure security governance. Provide operational resilience, secure cloud usage, and consistent IT support across a globally distributed, hybrid workforce.
Key Responsibilities
Cloud Infrastructure & SaaS Operations
- Oversee and optimize the suite of SaaS platforms and operational tools used across the business to ensure seamless day-to-day functionality, efficiency, and support for all users.
- Maintain end-user computing, network operations, and endpoint protection to ensure operational continuity and user satisfaction.
- Implement tooling, automation, and remote support models for globally distributed and hybrid workforces.
- Administer Microsoft Entra ID, including provisioning, monitoring, and performance optimization.
Azure Governance & Compliance
- Configure and maintain Azure Policy, Blueprints, and Compliance Manager to align with ISO 27001, GDPR, POPIA, SOC II, and other regulatory frameworks.
- Create dashboards and evidence packs for auditors, executives, and clients, automating compliance reporting and remediation tasks.
ISMS & Security Framework Ownership
- Collaborate with the CISO Office on maintaining and enhancing the ISO 27001 ISMS, including contributing technical input for the Statement of Applicability and participating in internal/external audits as needed.
- Ensure IT infrastructure and cloud tooling align with the compliance goals set for ISO 27001, SOC 2, POPIA, and GDPR.
- Support the CISO in conducting risk assessments related to infrastructure and SaaS operations and contribute to the IT-specific entries in the centralized risk register.
Identity & Access Management
- Implement and maintain identity services and access controls (e.g. RBAC, MFA, PIM) in accordance with IAM policies and governance.
- Enforce least-privilege access via automated policy enforcement and periodic reviews.
SaaS Licensing & Software Asset Management
- Manage Microsoft EA/CSP and other enterprise SaaS subscriptions.
- Develop and maintain a licensing compliance framework and optimize cost forecasting.
Incident Response & Resilience
- Execute incident response procedures under the direction of the CISO Office, coordinate with partners on infrastructure-related aspects, and implement technical corrective actions post-incident.
- Conduct post-incident investigations and implement corrective actions.
People & Vendor Management
- Collaborate with MSPs, security partners, and SaaS vendors.
- Ensure cost-efficiency across infrastructure and tooling.
Cross-Functional Collaboration
- Work with Product, Finance, HR, and Engineering teams to align IT operations with strategic business goals.
- Present technical KPIs, risks, and compliance status to non-technical executives and client stakeholders.
IT Operations & Support Responsibilities
- Handle issues with screens, docking stations, laptops, and printers, as well as provide necessary cords.
- Set up workstations, perform hardware upgrades (RAM, batteries), and manage cables.
- Manage the employee onboarding process, conduct access audits, and update general user information in systems like Microsoft Teams and email.
Required Skills & Experience
- 6–8 years in IT infrastructure, SaaS operations, or IT security.
- Diploma/Degree in IT.
- 3+ years working with Microsoft Azure environments.
- Proven experience with distributed teams, SaaS platforms, and remote support operations.
- Demonstrated involvement in ISO 27001, SOC 2, or GDPR/POPIA compliance programs.
- Strong knowledge of firewall/router technologies and network security concepts
- Proficiency with administrating Windows OS and Servers.
- Experience with Active Directory (Entra ID), DNS/DHCP
- Microsoft Exchange, Office365, Microsoft Intune, Microsoft Defender365 experience is desired.
- Hands-on experience with monitoring and diagnostic tools
- Experience supporting on-prem and Azure-hosted environments
- Ability to troubleshoot and resolve complex infrastructure issues independently
Preferred Qualifications
- Relevant IT certifications (e.g., CompTIA, Microsoft, Cisco).
- Familiarity with ITSM and ISMS platforms.
- Multi-cloud awareness (Azure, AWS, GCP).
Join us at MagicOrange and help shape the future of IT Financial Management and FinOps Software by ensuring our customers achieve the highest levels of satisfaction and success.
MagicOrange is an equal opportunity employer, committed to promoting diversity and inclusion in the workplace. We value and appreciate the diverse contributions and perspectives of all our employees.
Infrastructure Engineer – Cloud Infrastructure
Posted today
Job Viewed
Job Description
MagicOrange is a globally recognized leader in the IT Financial Management Software market, as acknowledged by Gartner. With customers and a strong presence on four continents, we are a Software as a Service (SaaS) provider in a high-growth phase. Our mission is to empower individuals and organizations, enhancing their value through our innovative software solutions
Location: Durban - KwaZulu Natal, South Africa
Position Summary:
Execute tasks related to MagicOrange's IT infrastructure, assist in the operational layer of our SaaS platform estate, and ensure security governance. Provide operational resilience, secure cloud usage, and consistent IT support across a globally distributed, hybrid workforce.
Key Responsibilities
Cloud Infrastructure & SaaS Operations
- Oversee and optimize the suite of SaaS platforms and operational tools used across the business to ensure seamless day-to-day functionality, efficiency, and support for all users.
- Maintain end-user computing, network operations, and endpoint protection to ensure operational continuity and user satisfaction.
- Implement tooling, automation, and remote support models for globally distributed and hybrid workforces.
- Administer Microsoft Entra ID, including provisioning, monitoring, and performance optimization.
Azure Governance & Compliance
- Configure and maintain Azure Policy, Blueprints, and Compliance Manager to align with ISO 27001, GDPR, POPIA, SOC II, and other regulatory frameworks.
- Create dashboards and evidence packs for auditors, executives, and clients, automating compliance reporting and remediation tasks.
ISMS & Security Framework Ownership
- Collaborate with the CISO Office on maintaining and enhancing the ISO 27001 ISMS, including contributing technical input for the Statement of Applicability and participating in internal/external audits as needed.
- Ensure IT infrastructure and cloud tooling align with the compliance goals set for ISO 27001, SOC 2, POPIA, and GDPR.
- Support the CISO in conducting risk assessments related to infrastructure and SaaS operations and contribute to the IT-specific entries in the centralized risk register.
Identity & Access Management
- Implement and maintain identity services and access controls (e.g. RBAC, MFA, PIM) in accordance with IAM policies and governance.
- Enforce least-privilege access via automated policy enforcement and periodic reviews.
SaaS Licensing & Software Asset Management
- Manage Microsoft EA/CSP and other enterprise SaaS subscriptions.
- Develop and maintain a licensing compliance framework and optimize cost forecasting.
Incident Response & Resilience
- Execute incident response procedures under the direction of the CISO Office, coordinate with partners on infrastructure-related aspects, and implement technical corrective actions post-incident.
- Conduct post-incident investigations and implement corrective actions.
People & Vendor Management
- Collaborate with MSPs, security partners, and SaaS vendors.
- Ensure cost-efficiency across infrastructure and tooling.
Cross-Functional Collaboration
- Work with Product, Finance, HR, and Engineering teams to align IT operations with strategic business goals.
- Present technical KPIs, risks, and compliance status to non-technical executives and client stakeholders.
IT Operations & Support Responsibilities
- Handle issues with screens, docking stations, laptops, and printers, as well as provide necessary cords.
- Set up workstations, perform hardware upgrades (RAM, batteries), and manage cables.
- Manage the employee onboarding process, conduct access audits, and update general user information in systems like Microsoft Teams and email.
Required Skills & Experience
- 6–8 years in IT infrastructure, SaaS operations, or IT security.
- Diploma/Degree in IT.
- 3+ years working with Microsoft Azure environments.
- Proven experience with distributed teams, SaaS platforms, and remote support operations.
- Demonstrated involvement in ISO 27001, SOC 2, or GDPR/POPIA compliance programs.
- Strong knowledge of firewall/router technologies and network security concepts
- Proficiency with administrating Windows OS and Servers.
- Experience with Active Directory (Entra ID), DNS/DHCP
- Microsoft Exchange, Office365, Microsoft Intune, Microsoft Defender365 experience is desired.
- Hands-on experience with monitoring and diagnostic tools
- Experience supporting on-prem and Azure-hosted environments
- Ability to troubleshoot and resolve complex infrastructure issues independently
Preferred Qualifications
- Relevant IT certifications (e.g., CompTIA, Microsoft, Cisco).
- Familiarity with ITSM and ISMS platforms.
- Multi-cloud awareness (Azure, AWS, GCP).
Join us at MagicOrange and help shape the future of IT Financial Management and FinOps Software by ensuring our customers achieve the highest levels of satisfaction and success.
MagicOrange is an equal opportunity employer, committed to promoting diversity and inclusion in the workplace. We value and appreciate the diverse contributions and perspectives of all our employees.
Infrastructure Engineer
Posted today
Job Viewed
Job Description
Our client is a global investment advisory firm focusing on long-term value creation through investment strategies. They work with a diverse group of institutional partners and pride themselves on a collaborative, sustainable, inclusive culture and performance.
What you will be doing- Manage a global, hybrid infrastructure on a daily basis to ensure optimal performance, security, and uptime.
- Maintain and run IT security systems and processes to protect sensitive data and minimize the risk of breaches.
- Continuously identify and implement best practices and technical improvements to optimize the infrastructure and enhance its security posture.
- Provide high-level (3rd/4th line) technical support for complex issues to both the Technology team and the wider business.
- Actively participate in critical technical processes like disaster recovery and penetration testing, addressing any issues that arise.
- Research, recommend, and implement new technologies to improve overall infrastructure and security.
- A relevant tertiary qualification would be beneficial
- Relevant IT certifications would also be ideal (e.g., Microsoft, CompTIA, ISC, etc.)
- 5+ years of hands-on experience in a system administration or infrastructure management role, preferably in the financial services industry.
- Possesses a broad and deep understanding of IT fundamentals.
- Hands-on experience in system implementation and providing high-level (3rd/4th line) technical support.
- Experience with the full Microsoft stack, including both on-premise and cloud environments like Azure and Office 365.
- Strong experience in multi-site networking, including firewalls, switching, TCP/IP, DNS, and DHCP.
- Experience managing both physical and virtual data center environments.
- A solid understanding of technical and data security concepts, including Microsoft Defender and identity management.
- Proven ability to effectively troubleshoot issues and propose solutions, working proactively and independently.
- Proficient in scripting for infrastructure administration, including languages like PowerShell, KQL, and Terraform.
- Willing to work outside of regular hours for maintenance or incidents, and possesses strong interpersonal and communication skills.
J
Seniority levelMid-Senior level
Employment typeFull-time
Job functionInformation Technology
IndustriesIT Services and IT Consulting
#J-18808-LjbffrInfrastructure Engineer
Posted 1 day ago
Job Viewed
Job Description
Overview
Direct message the job poster from The Career Network SA. If you’re passionate about robotics, automation, and building systems that stretch the limits of what’s possible, please read on. Join the newly built team in Cape Town! (company has been in existence 20+ years and currently HQ in the US)
What you’ll do- Design, deploy, and manage Kubernetes clusters in private cloud (Proxmox).
- Automate infrastructure with OpenTofu + GitLab CI/CD.
- Manage secrets securely with OpenBao.
- Monitor and troubleshoot with Prometheus/Grafana.
- Collaborate with Dev, DevOps, and Security teams, documenting best practices.
- 5+ years in infrastructure engineering, 3+ with Kubernetes.
- Strong skills with IaC (eg OpenTofu/Terraform) and GitLab CI/CD.
- Experience with secrets management (OpenBao) in production.
- Knowledge of Helm, ArgoCD/Flux, GitOps, and networking (Antrea, Contour).
- Bonus: Kubernetes certifications (CKA, CKAD).
A chance to work on cutting-edge global projects while rooted in South Africa, shaping infrastructure that keeps advanced robotics and automation running at scale.
#J-18808-LjbffrInfrastructure Engineer
Posted 13 days ago
Job Viewed
Job Description
Overview
The Infrastructure Engineer is responsible to help build out maintain and troubleshoot LabourNet's rapidly expanding infrastructure. You will be part of a talented team of engineers that demonstrate superb technical competency delivering mission-critical infrastructure and ensuring the highest levels of availability performance and security within the business.
Experience- Experience working in a complex infrastructure environment.
- Windows Deployment experience
- Experience of working in a best practice environment (ITIL) under strict change management processes
- Ability to understand log files and identify problems.
- Able to work under pressure and to challenging timescales.
- Able to isolate take ownership of and resolve performance problems.
- Ability to be self-sufficient and make independent decisions on problem resolution that align to departmental and functional strategy.
- Matric
- Diploma in IT related domain
- MCSE
- Networking certificate
- Azure Certification
- Security certification (Optional)
- Propensity to Own : The habit of taking ownership of a task. When faced with a challenge the individual steps up to such a challenge by working towards a positive outcome as a result of self-working on the activities.
- To Simplify : A strong tendency in breaking complex scenarios down to linear challenges that can easily be resolved. It can also be described as the habit of taking the easy route towards solving complex challenges.
- Responsiveness : The habit of acting immediately and quickly is important for success in this job.
- Frustration Handling : Frustration occurs when the individual is obstructed from reaching his / her goal. A strong habit in dealing with obstructing sources / interferences in such a way that positive actions towards successful results are taken is required.
- Routine : A well-defined habit towards structure and repetition sometimes even mundane activities is required. Strong behaviour in harmony with an environment of repetition and patterns of sameness needs to be present.
- Proficiency with network hardware and technologies.
- Proficiency with shared storage technologies.
- Proficiency with Microsoft operating systems.
- Precise attention to detail.
- Ability to prioritize tasks.
- Advanced organizational skills.
- Analytical skills.
Reports to Infrastructure Manager
Direct Reports : N / A
Works closely with : The Development Team, Employees
Core responsibilities- Managing configuring and monitoring all installed systems and infrastructure
- TomCat and VM application server management.
- Installing configuring testing and maintaining operating systems application software and system management tools
- Ensuring the highest levels of systems and infrastructure availability
- Ensure robust application infrastructure and network security with tight control over access management and effective tracking of changelogs.
- Assisting in incident resolution and root cause analysis
- Cloud management and monitoring (Azure beneficial).
- DevOps insights on best practices and process gaps.
- Maintaining the infrastructure
- Advise and specify preventive maintenance and administration tasks to monitor and alert patch and maintain and backup and recover key business systems.
- Supporting the infrastructure
- A proportion of time is spent providing 3rd line technical support through management of escalated technical issues through strong working relationships within Digital Services and with third party providers.
- Ad-hoc duties as required.
In return we offer you
- Culture : We pay attention to output rather than time spent and we offer a flexible working environment.
- Employer of choice : Our managers believe in putting their people first and are devoted to their growth and development hence we practice servant leadership (inverted pyramid).
- Flexibility : You must look after yourself; you cannot pour from an empty cup and we prefer work-life integration to work-life balance.
- Passion : Bring a sense of purpose to work and depart with a sense of success.
Gone are the days when you had to sit in traffic to go to work! Our hybrid working approach enables employees and their supervisors to agree on a working environment that promotes productivity while also allowing them to balance their professional and personal lives. We can now create an office environment wherever we are while sustaining virtual cooperation between our employees and teams thanks to technology.
Why should you join LabournetIts more than just a job with Labournet. Its a mission to make our clients compliant by doing meaningful work that focuses on cutting-edge customer-centric services and technology solutions. Every year you can help us by facilitating HR best practice by securing our companys ultimate compliance across the whole employee life cycle! In the end youve created a profession that no one could have predicted.
Who are weYour Strategic Partner in Human Resource Solutions
Our dedicated team of highly trained and educated consultants are passionate about taking workplace issues and translating them into HR Solutions that are accessible flexible affordable comprehensive and motivated by the goals and objectives of your business. Labournet strives to become an extension of your HR department to fulfill your compliance needs allowing you to keep a relentless focus on core activities.
The trusted compliance partner to employers solving their evolving compliance needs.
What do we doWe provide leading compliance solutions through-out South Africa that implement best practices within the businesses we partner with.
Remote Work : Employment Type :
Full-time
Key SkillsJenkins,Ruby,Python,Active Directory,Cloud,PowerShell,Windows,AWS,Linux,SAN,Java,Troubleshoot,Backup,Puppet,hardware
Experienceyears
Vacancy1
#J-18808-LjbffrInfrastructure Engineer
Posted 27 days ago
Job Viewed
Job Description
Managing the day-to-day IT operations 24/7, ensuring Client SLA’s are met with regards to services and delivery, and ensuring that the IT Helpdesk provides excellent customer service in supporting the business. Keeping the fault/request queue below the set maximum target and being a role model while setting an example for the desired standards of conduct, leadership, integrity, and professionalism at all times.
Duties & Responsibilities- Plan and implement moves, adds, changes, and deletions to support the IT infrastructure.
- Responsible for maintaining information on purchases for the assets registry.
- Implement network security at levels set by corporate standards.
- Anticipate difficulties or problems relating to IT and ensure that contingency plans are in place.
- Work with the IT team to implement and support internal IT systems and proactively address IT change requests from stakeholders.
- Interact constructively with internal clients at all levels to help resolve IT-related issues and provide timely answers through effective implementation of helpdesk and application software.
- Administer and maintain the company’s IT infrastructure and manage all telephone changes.
- Manage day-to-day internal and external client interactions.
- Oversee all helpdesk activities for the location.
- Respond to and resolve escalated Helpdesk issues.
- Manage day-to-day activities of the IT Team.
- Ensure that company IT assets are maintained according to company standards.
- Manage the administration and maintenance of computer stations and software for the company’s training programs and training facilities and provide proactive support.
- Manage troubleshooting, system backups, archiving, disaster recovery, and provide expert support while identifying opportunities for improvements.
- Provide the financial department with IT financial information and manage the purchasing of all software, hardware, and other IT supplies within budget constraints.
- Facilitate the motivation and development of the team by aligning project tasks with team members' career interests while attaining goals and giving feedback on performance.
- Manage IT suppliers and SLAs.
- Perform any other duties consistent with the position of ICT Team Leader.
- Identify opportunities for improvement and make constructive suggestions for change.
- Maintain information on purchases for the assets registry.
- Manage key supplier relationships, costing, and SLAs.
- Communicate with the Management Team and all other levels within the business.
- Provide regular updates to senior management during IT downtime.
- Play a vital role with clients, always being well-briefed and informative.
- Update as appropriate and maintain casual contacts for the good of the business.
- Attend and contribute to regular team meetings to discuss and resolve departmental issues and challenges.
- Remain at the forefront of emerging industry practices and continually investigate IT technologies.
- Identify opportunities for improvement and make constructive suggestions for change.
- Degree in a relevant discipline or equivalent relevant work experience.
- 5 years of experience in a senior/team lead IT role (preferably in the contact service centre industry: in-house centre within airline industry or international customer care outsourcing provider) advantageous.
- MCSE, CCNA certificate, and ITIL qualification required.
- Advanced knowledge of the following Systems: Windows 2003 and 2008 Server, Cisco Systems, VMware, Exchange 2003, Soft Grid 4.1 or later, System Centre Configuration Manager (SCCM).
- Advanced knowledge of Networking concepts: Switches, Routers, Hubs, Servers, Cables, Racks, Firewalls, LAN, WAN, TCP/IP, DNS, UDP, Latency, VoIP, QoS, MPLS.
- Web Technologies would be advantageous.
- Well-developed knowledge of Telephony applications and concepts.
- Well-developed knowledge of the Genesys Suite of products is an advantage.
Proven track record of organisational and planning skills is essential.
#J-18808-LjbffrBe The First To Know
About the latest Infrastructure Jobs in South Africa !
Infrastructure Engineer
Posted today
Job Viewed
Job Description
If you're passionate about robotics, automation, and building systems that stretch the limits of what's possible, please read on. Join the newly built team in Cape Town (company has been in existence 20+ years and currently HQ in the US)
What you'll do
- Design, deploy, and manage Kubernetes clusters in private cloud (Proxmox).
- Automate infrastructure with OpenTofu + GitLab CI/CD.
- Manage secrets securely with OpenBao.
- Monitor and troubleshoot with Prometheus/Grafana.
- Collaborate with Dev, DevOps, and Security teams, documenting best practices.
What you bring
- 5+ years in infrastructure engineering, 3+ with Kubernetes.
- Proven private cloud Kubernetes expertise.
- Strong skills with IaC (eg OpenTofu/Terraform) and GitLab CI/CD.
- Experience with secrets management (OpenBao) in production.
- Knowledge of Helm, ArgoCD/Flux, GitOps, and networking (Antrea, Contour).
- Bonus: Kubernetes certifications (CKA, CKAD).
What's in it for you
A chance to work on cutting-edge global projects while rooted in South Africa, shaping infrastructure that keeps advanced robotics and automation running at scale.
Infrastructure Engineer
Posted today
Job Viewed
Job Description
About Moonvalley
Moonvalley's mission is to solve Visual Intelligence in the age of generative AI. We are building technology that can tell stories, scale creativity, and understand both the physics and semantics of the world. With Marey, our first high-definition foundation model trained exclusively on licensed data, we are powering the next era of cinematic, commercial, and enterprise-grade creation.
Our team is an unprecedented convergence of talent across industries. Our elite AI scientists from Deepmind, Google, Microsoft, Meta & Snap, have decades of collective experience in machine learning and computational creativity. We have also established the first AI-enabled movie studio in Hollywood, filled with accomplished filmmakers and visionary creative talent. We work with the top producers, actors, and filmmakers in Hollywood as well as creative-driven global brands. So far we've raised over $100M+ from world-class investors including General Catalyst, Bessemer, Khosla Ventures & YCombinator – and we're just getting started.
Job Summary
We're hiring an Infrastructure Engineer to design and maintain the systems that power Moonvalley's generative AI research and product development. You'll be joining at a pivotal moment, helping to define the foundations of our infrastructure as we train and deploy cutting-edge video foundation models.
In this role, you'll work closely with researchers, engineers, and cross-functional partners to ensure our infrastructure is scalable, reliable, and efficient. From managing GPU clusters to optimizing ETL pipelines, you'll be instrumental in ensuring the technical performance and productivity of our entire AI platform.
What you'll do
Build, manage, and scale GPU infrastructure using tools like Kubernetes, Terraform, or Pulumi
Maintain and optimize ETL pipelines using Spark, Ray, or Airflow
Operate and improve our telemetry and monitoring stack (Datadog, Grafana, Weights & Biases)
Manage CI/CD pipelines and development tooling (GitHub, PyTorch, Python)
Track and optimize datasets, checkpoints, compute utilization, and related assets
Automate repetitive tasks to improve efficiency and reduce friction across engineering workflows
Participate in an on-call rotation to resolve infrastructure issues and ensure uptime
Provide tooling, documentation, and support to accelerate internal engineering productivity
What we're looking for
Strong generalist with experience managing large-scale, high-performance infrastructure
Skilled in designing scalable systems for compute, data, and developer tooling
Comfortable in high-urgency environments with the ability to prioritize for impact
Familiar with infrastructure stacks for AI model training and experimentation
Experienced with Kubernetes, Terraform/Pulumi, Spark/Ray, and observability tools
Pragmatic problem-solver who favors automation and simplicity over complexity
Open to using and contributing to open-source tooling when appropriate
Bonus: experience as a Cluster Engineer, Data Engineer, or Developer Advocate in AI/ML environments
What we offer (compensation & benefits)
Competitive salary and equity
Private health coverage
Pension contribution
Unlimited paid vacation
Fully-distributed, async-first culture
Hardware setup of your choice
Stipends for phone, internet, and meals
In our team, we approach our work with the dedication similar to Olympic athletes. Anticipate occasional late nights and weekends dedicated to our mission. We understand this level of commitment may not suit everyone, and we openly communicate this expectation.
If you're motivated by deeply technical problems, a seemingly never-ending uphill battle and the opportunity to build (and own) a generational technology company, we can give you what you're looking for.
All business roles at Moonvalley are hybrid positions by default, with some fully remote depending on the job scope. We meet a few times every year, usually in London, UK or North America (LA, Toronto) as a company.
If you're excited about the opportunity to work on cutting-edge AI technology and help shape the future of media and entertainment, we encourage you to apply. We look forward to hearing from you
The statements contained in this job description reflect general details as necessary to describe the principal functions of this job, the level of knowledge and skill typically required and the scope of responsibility. It should not be considered an all-inclusive listing of work requirements. Individuals may perform other duties as assigned, including work in other functional areas to cover absences, to equalize peak work periods, or to otherwise balance organizational work
Moonvalley AI is proud to be an equal opportunity employer. We are committed to providing accommodations. If you require accommodation, we will work with you to meet your needs.
Please be assured we'll treat any information you share with us with the utmost care, only use your information for recruitment purposes and will never sell it to other companies for marketing purposes. Please review our privacy policy and job applicant privacy policy located here for further information.
Infrastructure Engineer
Posted today
Job Viewed
Job Description
About our client:
Our client is a global investment advisory firm focusing on long-term value creation through investment strategies. They work with a diverse group of institutional partners and pride themselves on their collaborative, sustainable, inclusive culture and performance.
What you will be doing:
- Manage a global, hybrid infrastructure on a daily basis to ensure optimal performance, security, and uptime.
- Maintain and run IT security systems and processes to protect sensitive data and minimize the risk of breaches.
- Continuously identify and implement best practices and technical improvements to optimize the infrastructure and enhance its security posture.
- Provide high-level (3rd/4th line) technical support for complex issues to both the Technology team and the wider business.
- Actively participate in critical technical processes like disaster recovery and penetration testing, addressing any issues that arise.
- Research, recommend, and implement new technologies to improve overall infrastructure and security.
What our client is looking for:
- A relevant tertiary qualification would be beneficial
- Relevant IT certifications would also be ideal (e.g., Microsoft, CompTIA, ISC, etc.)
- 5+ years of hands-on experience in a system administration or infrastructure management role, preferably in the financial services industry.
- Possesses a broad and deep understanding of IT fundamentals.
- Hands-on experience in system implementation and providing high-level (3rd/4th line) technical support.
- Experience with the full Microsoft stack, including both on-premise and cloud environments like Azure and Office 365.
- Strong experience in multi-site networking, including firewalls, switching, TCP/IP, DNS, and DHCP.
- Experience managing both physical and virtual data center environments.
- A solid understanding of technical and data security concepts, including Microsoft Defender and identity management.
- Proven ability to effectively troubleshoot issues and propose solutions, working proactively and independently.
- Proficient in scripting for infrastructure administration, including languages like PowerShell, KQL, and Terraform.
- Willing to work outside of regular hours for maintenance or incidents, and possesses strong interpersonal and communication skills.
Job ID:
- J
For a more comprehensive list of opportunities that we have on offer, do visit our website -
Requirements
Infrastructure Engineer, IT Security, System Administration, Financial Services, Microsoft Stack, Azure, Office 365, Networking, Data Center, PowerShell, KQL, Terraform