Department: Technical
Employment Type: Full Time
Location: USA, East Coast (Home based)
Compensation: $110,000 - $150,000 / year
Description
About us
We are Digital Science and we are advancing the research ecosystem.
We are a pioneering technology company, and our vision is of a future where a trusted and collaborative research ecosystem drives progress for all. We believe in better, open, collaborative and inclusive research. In creating the next generation of tools and working in partnership with the community we tackle some of the biggest challenges to research. In order to achieve our vision, we need innovative, inspiring and dynamic people to join our team. Want to join us?
Dimensions, part of the Digital Science family, is the world's largest linked research information dataset, covering millions of research publications and connected by more than 1.3 billion citations. We are shaping the future of research and are looking for a Data Scientist to join the team.
Your new role
As part of a global and dynamic team and will play a key role in the delivery of data and analytical insights, through the development and implementation of advanced analytic solutions including interactive dashboards, data pipelines and other tools to support our team of data scientists, allowing our customers to engage with data and analysis results to support their decision-making processes. You will help our customers, including the largest funding and research organizations in the U.S. Federal government and beyond to more effectively manage their research portfolios to achieve their missions. You will leverage our data and platforms, including Dimensions and the rest of Digital Sciences portfolio to support research assessment, portfolio management/analysis, strategic planning and more.
The role will touch all aspects of data analysis & delivery, from managing specialised analytic infrastructure resources in secure environments to data collection/wrangling, visualization, and the development & delivery of interactive dashboards and other applications. You will work closely with team members with a diversity of intellectual and professional backgrounds to harness our unique data and product capabilities to address our customer's critical needs.
What You’ll Be Doing
- Conduct large-scale, quantitative data analysis (millions of records) including data collection and data wrangling, using tools such as python, APIs, and databases
- You will plan, design, enhance and document data pipelines, internal use utilities, tools and software packages
- You will deploy, manage, and monitor cloud-based resources to support our team of data scientists
- Leverage LLM and other AI technologies to address customer analytical needs, incorporating these tools into data processing pipelines and customer facing applications.
- Create and deploy visualizations and interactive web-based dashboards, using tools such as Plotly and Dash
- Build machine-learning models that operate on large, text-based documents (10s - 100s of millions of documents), including document clustering and topic modeling
- Directly interface with customer-facing data scientists and key customer contacts in a team environment
- Monitor new technologies and methods and incorporate into workflow where appropriate.
What You’ll Bring To The Role
- You will have a good understanding of the S&T ecosystem - funders, research organizations, scientific publishing
- You will have experience in Python, including relevant Python libraries and modules such as pandas, scikit learn, gensim, transformers, pyTorch and Dash.
- You will have experience working with modern AI models like GPT, Bard or Palm and LLM support toolkits such as LangChain, Guidance, and Haystack.
- You'll be experienced in Natural Language Processing (such as genism, NLTK, sklearn) and machine learning methods
- You will have broad experience of working with relational database management system (RDBMS, i.e, PostgreSQL, Google BigQuery)
- You will have experience with data visualization tools (Plotly, D3, matplotlib etc)
- You will be experienced managing cloud resources (ie, AWS EC2/ECS, S3, ALB) and the cloud-based deployment of web-based applications for both internal and external use (ie, Linux, Docker, web proxies/load balancers and CI/CD pipelines)
- You will thrive in an environment where you can work independently and remotely
- You will have previous experience of working globally and across multiple teams.
- You will be a strong communicator and able to communicate your findings to a varied audience through written and verbal presentation
- You will be comfortable working in a fast paced, changing environment and utilise this to empower your career with us
- You will have 3-5 years of experience delivering customer solutions.
Not sure you meet all qualifications? Let us decide! Research shows that women and members of other under-represented groups tend to not apply to jobs when they think they may not meet every qualification, when in fact, they often do! We are committed to creating a diverse and inclusive environment and strongly encourage you to apply.
Additional Information
Current US Public Trust clearance preferred, as applicants will be subject to a security investigation and will need to meet eligibility requirements for access to sensitive information.
Living our Values
We invest in, nurture and support innovative businesses and technologies that make all parts of the research process more open, efficient and effective.
The talent we secure is fundamental to us achieving our vision and our growth plans. The values we live by are:
We are
brave in the pursuit of better
We are
collaborative and inclusive
We are
always open-minded
We are
from and for the community
We're an equal opportunity employer. All applicants will be considered for employment without attention to race, colour, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.