At Storytel we believe that powerful stories add an extra dimension to life. We offer hundreds of thousands of audiobooks and ebooks to customers in more than 20 markets, with several new markets launching in the coming year. As we continue to accelerate our development of speech technology and in particular Text-to-Speech, we are hiring three new key roles to our Speech team to help us build some of the best automatic narration technology in the world.
About the Team
The role is in the Speech team, a part of the larger Intelligence group which houses our machine learning and data science teams. In the Speech team we build services that enable Storytel to efficiently generate new, and understand existing content. In particular, our team owns the entire Text-to-Speech stack at Storytel, from data curation to modelling decisions, training and deployment infrastructure. In order to accelerate the development, and get the system in production, we are growing our team. Since the team is new, each position we're hiring for is considered essential. Our new team members will be expected to take on large responsibilities and will impact all aspects of our work. While each role's main responsibilities are different, we will all work closely together to achieve our big ambitions of highly automated and prosodically rich speech synthesis.
We are an international company with colleagues in the larger Intelligence team in Stockholm, Barcelona and Copenhagen. The Speech team is currently based in Stockholm, and while we hope to keep building the team in the Stockholm offices we are open to work with the right candidates to find a solution that is great for both parties.
About the Role
As a Machine Learning Engineer focusing on Deep Learning you will have an essential role in developing TTS and other speech technology applications. While we work together in the team on many parts of our services, you will have a large responsibility for model implementation, our training infrastructure and the way we package our artifacts for serving. We also expect you to help improve how we use and build datasets for TTS, e.g. setting up self-/semi-supervised learning for various models in our stack or working with active learning to improve the annotated part of our data. To do this well we believe that you will also need to stay up to date with developments in deep learning and interact with the research and open source communities.
You understand that in practice, deep learning requires more engineering work than advertised, and that a significant amount of the work is required in everything from data preparation, versioning and deployment. Your strong understanding of neural networks helps you in debugging networks and improving them and understanding various test time tradeoffs. In this rapidly evolving field you enjoy keeping up to date with new methods and might even have a few favorite applications that you follow closely
To be successful in this role we believe that you have
MSc degree, or similar, in Speech technology, Machine Learning, Computer Science, Mathematics, Physics, or a related field
Good knowledge of recent developments in at least one of: Self-/Semi-Supervised Learning, Active Learning, Distributed Training, Generative Models, Flow based models, GANs, Autoregressive models, Transformers, Audio, Speech
Strong Python development knowledge
Excellent at writing and speaking English
While not required, we would also love to hear about any of:
What we offer
Does this sound like you? If you feel like Storytel is a place where you could thrive, let us know and we will contact you as soon as possible.