Machine Learning Operations Engineer

Job Details

Sandton, Gauteng, South Africa
Siri Technologies
08.05.2024
Want updates for this and similar Jobs?

Full Job Description

Description

Client based in Sandton is hiring! We are in search of a Senior Machine Learning Operations Engineer (ML Ops Engineer) to join the Private Bank Technical Business Intelligence Team. The successful candidate will be responsible for deploying, maintaining, and monitoring machine learning models. We are looking for someone with a background in cloud infrastructure, Kubernetes, deployment pipelines, and a deep understanding of machine learning.

Responsibilities and Skills
  1. Deliver strategic goals and business objectives
  2. Maintaining platform stability
  3. Design and build solutions focused on efficiency
  4. Strong team dynamics, people skills, and relationship/network building
  5. Ensuring the strategy and teamwork within the principles and practices of MLOps and engineering as defined by group engineering and best practices
  6. Solid grasp of DevOps/SRE methodologies and practices
  7. Provide technical guidance and support throughout the release process, including strong troubleshooting abilities across the platform and channel
  8. Strong design and solutioning experience, across multiple technologies and understanding of Cloud DevOps services and hosting
  9. Git and CI/CD understanding
  10. Cloud native, hybrid cloud, and on-prem design principle understanding
  11. Developing and maintaining deployment pipelines for machine learning models on Microsoft Azure
  12. Monitoring and optimizing the performance of machine learning models in production
  13. Collaborating with data scientists for seamless deployment of models
  14. Ensuring high availability and reliability of the machine learning infrastructure on Microsoft Azure
  15. Providing technical support for machine learning models in production
  16. Conducting regular security assessments and ensuring compliance with industry standards and best practices
  17. Keeping up-to-date with new Azure ML offerings and technologies to continuously improve our ML ops processes
Requirements

Minimum BSc Computer Science, Engineering, or related field

At least 5 years of experience in ML Operations or a similar role

Extensive experience with Microsoft Azure, Azure pipelines, Functions, and ML offerings

Knowledge of containerization technologies (Docker, Kubernetes, Rancher)

Strong programming skills in Python, FastAPI, Redis, and SQL

Strong understanding of Software Engineering concepts

Strong experience writing unit tests

Knowledge of machine learning frameworks such as TensorFlow, PyTorch, etc.

Experience with monitoring and logging tools (e.g. Grafana, Kibana, etc.)

Excellent problem-solving skills and attention to detail

Knowledge on design patterns

#J-18808-Ljbffr
Report Job