Research Machine Learning Engineer Intern

Company:
Location: 75017 Paris

*** Mention DataYoshi when applying ***

Company Description


Dailymotion is the leading video discovery destination & technology that learns about your tastes over time, constantly surfacing the best, most relevant content on the web. Our mission is to provide the best video user experience for consumers on the market, connecting publishers and advertisers to engaged viewers who turn to Dailymotion for their daily fix of the most compelling music, entertainment, news, and sports content around.

Through partnerships with the world's leading publishers and content creators, France Télévisions, Le Parisien, CBS, Bein Sports, CNN, GQ, Universal Music Group, VICE and more, Dailymotion commands 3 billion monthly pageviews across its mobile app, desktop and connected TV experiences. Dailymotion is owned by Vivendi, one of the largest mass-media corporations in the world.

At Dailymotion, we‘re storytellers. We build the best place for people to enjoy the videos that matter. We do this through utilizing and developing cutting-edge technology and pushing the envelope to bring discoverable stories to life through premium content from the world’s best publishers. We do this by helping these publishers grow their audiences and monetize their content, their way.

Dailymotion is proud to be an equal employment opportunity and affirmative action employer. We value inclusion and we want you to help us thrive for a more diverse community.


Subject

Content characterisation: Use of multi modal (Textual, audio and/or visual) signals to characterize Dailymotion's content and contribute to a better user experience.


Job Description


Dailymotion is seeking a Research Machine Learning Scientist Engineer to join our Machine Learning team, and especially, the team responsible of our content categorization. Your work will have an impact throughout Dailymotion’s business and help make data-driven decisions on products and strategy. You will be part of a team made up of several Machine Learning Scientists/Engineers, Data Engineers, and Data Analysts that closely work together on several machine learning projects including recommender systems, semantic annotations, fraud detection, etc.

In order to better understand, organize and expose our video catalog to our millions of active users, Dailymotion's Machine Learning team developed several models to categorize our content. Well-categorized content helps to design a better user experience through better recommendations, contextual ads, etc. In order to keep up to the market, we have some tracks to add new/improved features. One of which is multi-modal learning, combining textual, audio, and visual information to improve our catalog categorization. It is key for many transverse businesses of Dailymotion and has a positive impact on buyers (placing commercials on the appropriate video), but also for publishers and active users of the platform. Thus, the benefits of this internship could be huge, with the industrialization of the solution when the performances are interesting.

As an Intern Research Machine Learning Engineer, you will

  • Try, evaluate and benchmark state-of-the-art multi-modal representation/classification methods.
  • Implement a scalable prototype in Python of a state-of-the-art algorithm to build a fully automated classification of the videos based on textual, audio, and/or visual signals.
  • Validate the algorithm performance according to our internal business KPIs on an evaluation dataset of videos from our catalog.
  • Finally, participate in the industrialization of the solution in the Dailymotion framework.

Qualifications
  • MS in Data Science / good knowledge of standard machine learning algorithms with a focus on: classification problems, computer vision, NLP, audio classification, Neural Networks (auto-encoders, LSTM, CNN, …).
  • Strong coding skills in Python (Scikit-learn, Numpy, Pandas, Tensorflow, Keras) and SQL.
  • Some knowledge of agile development methodologies (Git/Github, Docker, Jupyter notebook).
  • Intermediate level in English (spoken, written).
  • Experience with Google Cloud Platform is a plus (BigQuery, Cloud Storage, Compute Engine)

Additional Information


Bibliography

  • Less is More: CLIPBERT for Video-and-Language Learning via Sparse Sampling (https://arxiv.org/pdf/2102.06183.pdf)
  • Audio-Visual Instance Discrimination with Cross-Modal Agreement (https://arxiv.org/abs/2004.12943)
  • NeXtVLAD: An Efficient Neural Network to Aggregate Frame-level Features for Large-scale Video Classification (https://arxiv.org/abs/1811.05014)
  • How Deep Learning can boost Contextual Advertising Capabilities (https://medium.com/dailymotion/how-deep-learning-can-boost-contextual-advertising-capabilities-c9ca7c8fc4e9)
  • Bag-of-words representation for video channels’ semantic structuring (https://medium.com/dailymotion/bag-of-words-representation-for-video-channels-semantic-structuring-4f2777591e4a)
  • How we used Cross-Lingual Transfer Learning to categorize our content (https://medium.com/dailymotion/how-we-used-cross-lingual-transfer-learning-to-categorize-our-content-c8e0f9c1c6c3)


At Dailymotion, we empower candidates to take action. If this job sounds like a great opportunity for you, be confident in your skills, we are always happy to meet you! If needed, we can accommodate our recruitment process for your special abilities.

Location: Paris, France possibly some remote
Duration/Type of contract: 4 to 6 months Internship (full-time)
Start Date: September

Want to learn more about us:

  • Dailymotion.com
  • New-York office - BuiltIn
  • Offices in France - Welcome to the Jungle
  • Our articles
  • Remote Work Policy
    Saving Plan Vivendi
    Paternity leave or Coparental leave extended
    ️ Living Employee Culture (Events / Trainings / Partys / All hands / Dailymotion tradition…)
    Career development support (training / internal mobility / compensation cycle / 360 quarter feedback review …)
    High-end Health Insurance and Personal Services Vouchers (CESU)
    • ️ Paid Time off – RTT and Saving time plan (CET)
    ✅ Meal Vouchers – Public Transport and Bike refund
    European Economic and Social Committee (sport membership/cinemas vouchers/gift vouchers/discount)

    I'm interested

*** Mention DataYoshi when applying ***

Offers you may like...

  • National Youth Council

    Data Analyst (Pedagogy and Research) Trainee #SGUn...
    Singapore
  • Columbus Technologies

    Scientific Data Engineer
    Research Triangle Park, NC 27709
  • Feedzai

    Research Data Scientist
    Lisboa
  • SmartBLKTrade Limited (SBT)

    Director of AI Dept / Research Data Scientist / AI...
    Hong Kong
  • SmartBLKTrade Limited (SBT)

    Research Data Scientist / Big Data Engineer, AI De...
    Hong Kong