Machine Learning Engineer (MLE) - Data Solutions C...

Job description


Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

Why Join Us

At ByteDance, our people are humble, intelligent, compassionate and creative. We create to inspire - for you, for us, and for millions of users across all of our products. We lead with curiosity and aim for the highest, never shying away from taking calculated risks and embracing ambiguity as it comes. Here, the opportunities are limitless for those who dare to pursue bold ideas that exist just beyond the boundary of possibility. Join us and make impact happen with a career at ByteDance.

About The Team

The success of data business model hinges on the supply of a large volume of high quality labeled data that will grow exponentially as our business scales up. However, the current cost of data labeling is excessively high. The Data Solutions team is built to understand data strategically at scale for all Global Business Solution (GBS) business needs. Data Solutions Team uses quantitative and qualitative data to guide and uncover insights, turning our findings into real products to power exponential growth. Data Solutions Team responsibility includes infrastructure construction, recognition capabilities management, global labeling delivery management.

About The Role

We are looking for a highly capable machine learning engineer to deploy and optimise our machine learning systems. You will be evaluating existing machine learning (ML) lifecycle, understanding and productionizing the model data pipeline, and enhancing and maintaining the performance of our AI model's predictive automation capabilities.


What you will do

  • Model optimisation: Collaborate with data scientists to improve existing machine learning model training and evaluation pipelines, updating/finetuning the models with different training resources such as GPU or distributed training
  • Model Deployment: Build continuous integration, testing, and scalable deployment pipelines in cloud computing environments for machine learning services
  • Data pipeline productionisation: Work with data scientists and data engineers to design and implement the data pipelines for machine learning models that will support the current and future needs of our business
  • Maintenance: Build scalable and reliable infrastructure that supports feature engineering, model training, deployment, inferencing, performance monitoring
  • Tracking: Build logging, tracking, analyzing, monitoring and reporting pipelines for both data and model tracking in cloud computing environments to ensure correct model output and stable model performance

What you will need

  • Ability to understand the business use case to optimise and implement scalable solution
  • Knowledge of machine learning concepts and fundamentals
  • Deep learning proficiency in at least one of CV and NLP, with solid experience in model finetuning and optimization
  • Solid programming skills with experience writing and maintaining high-quality production code
  • Experience in ML pipeline, model training orchestration; large-scale/distributed training experience is desirable
  • Ability to work independently and complete projects from beginning to end and in a timely manner
  • Great communication skills, both written and oral; comfortable presenting findings and recommendations to non-technical audiences


  • BS or above in Computer Science, Software Engineering, or a related field
  • 5+ years of industry experience building ML infrastructure at scale
  • At least 2 years of experience in developing and deploying large-scale systems, version control, scaling and monitoring
  • Experience in machine learning frameworks (scikit-learn, Tensorflow, Pytorch), big data frameworks (e.g., Spark/Hadoop/Flink) and experience in resource management and task scheduling for large scale distributed systems.
  • Proficient in Python/SQL and one of C/C++/Go, with deep knowledge of Linux and CD tools (e.g. git); Experience with any microservice framework is highly desirable
  • Knowledge of machine learning concepts and fundamentals
  • Good communication and teamwork skills to clearly communicate technical concepts with other teammates.

ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

Please let the company know that you found this position on this Job Board as a way to support us, so we can keep posting cool jobs.