Clever Recruiting

Machine Learning Engineer - LLM

Job description

Our client, a visionary leader in the technology sector, is dedicated to pushing the boundaries of Large Language Models (LLM). The client's dynamic team is at the forefront of LLM technology, focusing on various facets including:


  • Foundational model training
  • Web-scale data collection
  • Efficient inference strategies
  • Model alignment techniques
  • Model evaluation methodologies


The client's mission is to craft cutting-edge language generation technology, both for internal innovation and customer-centric applications. By doing so, they're driving the evolution of the next wave of AI-driven products.


Tasks


The Opportunity


Our client is on the lookout for skilled Machine Learning (ML) Engineers who share their enthusiasm for expansive language models. Your expertise will play a pivotal role in shaping the state-of-the-art ALM stack.


In this position, responsibilities will span a range of tasks, including:


  • Architecting and developing the distributed training and data processing infrastructure
  • Innovating with advanced deep learning techniques to elevate the model quality
  • Exploring methodologies to enrich training data quality and quantity
  • Optimizing training and inference infrastructure to maximize hardware performance


Requirements


What They're Looking For


To excel in this role, candidates should possess:


  • A robust background in deep learning, whether in industry or academia
  • A solid grasp of machine learning theory
  • Familiarity with a modern deep learning framework (such as TensorFlow, PyTorch, or JAX)
  • Proficiency in contemporary software engineering practices like CI/CD, version control, and unit testing
  • A genuine passion for large language models and generative AI
  • An unwavering commitment to precision and thoroughness in their work


Preferred Qualifications


Additional advantages include:


  • Experience with language models or similar NLP technologies
  • A track record of building and delivering products (not exclusively ML-related) in a dynamic startup-like environment
  • Strong engineering skills, including the development of large distributed systems or high-load web services
  • Notable open-source projects that showcase their engineering prowess


Benefits


  • Relocation (if needed)
  • Competitive salary

Please let the company know that you found this position on this Job Board as a way to support us, so we can keep posting cool jobs.