Data Scientist

Company:
Location: London

*** Mention DataYoshi when applying ***

Your mission
Papercup is a machine learning start-up transforming the media industry. We noticed a huge problem, 99.9% of video content is shackled to a single language. We’d like to fix this by making the world's content watchable in any language. By using the powers of Machine Learning we’re translating videos by generating voices that sound like the original speaker, not only capturing the characteristics of your voice but also the way you speak. Backed by major venture capitalists and media giants we’ve already raised millions in funding, we’re looking to secure a Data Scientist to join our elite squad of Engineers.
Your profile
About the role

As a Data Scientist/Engineer at Papercup, you will be part of a great team pushing the boundaries of neural text-to-speech and speech-to-speech translation systems. You will uncover insights through statistical models and analytics dashboards with the Product team and Machine learning team to drive the product and research vision
You will be responsible for identifying future data needs, implementing dashboards and data pipelines. As a member of the Product Team, you will participate in team-wide workshops and present your work in team-wide workshops, present your work in cross-team and company-wide sessions.

Your Skills

We need your help with:
  • Extend and maintain our existing data systems
  • Create analysis and dashboards on our video production pipeline
  • Develop and support business model validation through hypothesis testing

Must haves:
  • Python coding skills, particularly in the areas of automation & integrations
  • Familiarity with relational SQL
  • Knowledge of data cleaning, wrangling, visualization, and reporting
  • Have attention to detail and ability to QA own and other team member's work
  • Knowledge of statistics principles necessary to interpret data and apply models. For example, knowledge of errors and confidence intervals to understand whether a relation seen in the data is spurious or significant.
  • Exposure to iterative/agile development methodologies such as SCRUM

Nice to haves:
  • ETL/ELT knowledge, experience with DAGs to manage script dependencies
  • Knowledge of NoSQL databases
  • NLP: Misspellings, Suggestions, Word Embeddings, Cross-lingual Embeddings, Named Entity Recognition, Dependency Parsing, Part-of-speech tagging, Information Retrieval, Deep Information Retrieval, Query Expansion, Product Enrichment.
  • Experience working with cloud services - GCP, AWS, Azure (AWS preferred)
Why us?
Engineering at Papercup
  • We’re committed to expanding our diverse team of experts, we’re interested in finding THE BEST candidate for the job. That person may be one who comes from a less traditional background, and that’s okay. We would strongly encourage you to apply, even if you don’t believe you meet every one of the qualifications described
  • We are not a typical web application, the bulk of our time is spent on building internal tooling for the team to produce videos. This means, we have a very tight closed loop of feedback, quick release cycles, and don't have to support all browsers (did you hear that? Never shall you hear again words This doesn't work on IE11 because it doesn't)
  • We don't believe there is a single solution or methodology that fits all companies. At Papercup we study different solutions and adapt them to our product
  • Everyone is a problem solver, if you have an idea for a better system or process, then you have a voice. We all participate in discussions by proposing and commenting on RFCs
  • We are all geeks, we all love learning and trying out new tools and learning new techniques
  • We host regular events to foster a learning community.
  • No silos, everybody works with everybody. All teams work together to help to improve each other's processes
  • As an engineering team, we have a lot of trust and autonomy within the company to invest time into our infrastructure, code, and tooling

Benefits Include:
  • Competitive salary - £50,000 - £70,000 (dependent on experience)
  • Monthly wellness benefit
  • Remote working options (2-3 days)
  • Unlimited holidays
  • Personal learning budget
  • Excellent parental leave options
Technologies we love but not limited to:
Languages: TypeScript, JavaScript, Python
Frameworks and Modules: ReactJS, NodeJS, Prisma, Next.js, Tachyons
Database: Postgres, MySQL
Tools: Notion, Clubhouse, StackOverflow Team
Infrastructure: AWS (ECS Fargate, Lambda, S3), GCP, GraphQL (APIs, and Kubernetes), Azure (Kubernetes), Sentry, Vercel, Auth0, RedisLabs

Please note we’re not looking for someone who ticks all the boxes, if you have some of the skills listed above and are willing to learn, you’re the person for Papercup.
About us
Papercup is a machine learning start-up transforming the media industry. 99.9% of video content is shackled to a single language. Our ambition is to make the world's content watchable in any language. We're translating videos by generating voices that sound like the original speaker, not only capturing the characteristics of your voice but also the way you speak.

*** Mention DataYoshi when applying ***

Offers you may like...

  • Syskron GmbH

    DATA ENGINEER (M/F/D) DATA SCIENCE
    93059 Regensburg
  • Blue Owl

    Senior Data Scientist - Call Center Optimization
    San Francisco, CA
  • Ascena

    Senior Data Scientist
    Pataskala, OH 43062
  • iFoodDecisionSciences

    Business Intelligence Engineer/Data Scientist-Remo...
    Seattle, WA 98101
  • Spotify

    Associate Data Scientist, Ads (Remote Eligible, Am...
    New York, NY