※ Submission of English Resume or CV is mandatory
Job Summary:
We are seeking a highly skilled and motivated Machine Learning Engineer to join our innovative and fast-paced team. In this role, you will be instrumental in building and maintaining the foundational infrastructure that empowers our large-scale machine learning research and development efforts. You will have a unique opportunity to shape the future of our ML capabilities by designing robust systems, optimizing compute resources, and accelerating the pace of experimentation.
Responsibilities
- Design, build, and maintain distributed systems that effectively support large-scale machine learning research, ensuring scalability, reliability, and performance.
- Develop robust and intuitive tooling to significantly accelerate the experimentation lifecycle and streamline machine learning model training processes.
- Manage and optimize our compute infrastructure to ensure efficient resource utilization, cost-effectiveness, and seamless operation for ML workloads.
- Collaborate closely with ML researchers and data scientists to understand their needs and translate them into technical solutions.
- Implement and maintain CI/CD pipelines for ML models and infrastructure, promoting automation and best practices.
- Monitor system performance, troubleshoot issues, and implement proactive solutions to ensure high availability and stability.
- Stay up-to-date with the latest advancements in distributed systems, cloud technologies, and machine learning infrastructure.
Minimum Qualifications
-
5+ years of relevant experience with bachelor’s degree in computer science or related technical discipline.
-
Strong background in distributed systems, cloud platforms (AWS, Azure, GCP), and infrastructure at scale.
-
Proficiency in Python and extensive experience with core machine learning/AI tooling (e.g., PyTorch, TensorFlow, JAX, or similar frameworks).
-
Familiarity with containerization technologies (Docker) and orchestration platforms (Kubernetes).
-
Experience with implementing and managing CI/CD pipelines for software development and machine learning workflows.
- Proficiency in English.
-
Proven experience supporting high-performance or research-oriented computing environments.
- Ability to thrive in a fast-moving, exploratory setting with a high degree of ownership and autonomy.
- Excellent communication and collaboration abilities.
- Strong attention to detail and a commitment to delivering high-quality software.
- Ability to work independently and lead technical discussions.
Recruitment Process

Notifications
- Those who have no disqualification for overseas travel (must have completed or be exempt from military service)
- Persons with disabilities and persons eligible for national protection will be given preferential treatment in accordance with relevant laws when providing support.
- All positions have a probationary period of three months, and the probationary contract may be extended or terminated depending on the evaluation.
- If there are any false information in the submission or in the recruitment process, acceptance may be cancelled.
- The process may change depending on the situation, and if there is a change, separate guidance will be provided to the applicant.
- All recruitments are occasional. So if successful applicants occur, the recruitment announcement may be closed early.
- If the final assessment does not meet internal standards, recruitment may be cancelled.
- Personal information, including your resume, will be kept for up to two years from the date of notification of the final result of the recruitment process. But the information can be deleted if applicant requests.
- Employee Benefits may be subject to change based on the company's budget and headquarters' policies.