Software Engineering, Data Engineer

Location: Vancouver, WA 98660

*** Mention DataYoshi when applying ***

Software Engineering, Data Engineer

AbSci is a leading synthetic biology company that translates ideas into drugs with a platform technology that reinvents the biopharmaceutical drug discovery process. We custom engineer E. coli to create novel complex biologics in their full length format while simultaneously developing production cell lines. We are deploying cutting edge deep learning artificial intelligence to inform our designs, and every day we succeed in achieving things others have dismissed as impossible. With more than a dozen partnerships in place with top pharma and industry leaders, our collaborations include projects for drugs and drug candidates that range across multiple protein types and therapeutic functionality. We are continually innovating, expanding the scope of what we can do and the impact we can have as we help get better drugs to patients...sooner. We are based in Vancouver, Washington, ten minutes from Portland, Oregon and an hour from world class alpine terrain and rugged Pacific coastlines.

Job Description

As a software engineer at AbSci, you will play a critical role in how biological sequence data is processed, analyzed, visualized and ultimately utilized by scientists to make actionable decisions. Biological sequence data plays a critical role in the development of our core technology. You will learn a broad range of industry relevant skills, help shape AbSci's strategic development of core technology, and work closely with a very talented team with some of the deepest domain expertise in the industry. If you're excited about working on novel technology that empowers scientists to make the world a better place, we would love to hear from you.

Job Responsibilities:

  • Collaborate with our biological sequencing team to develop applications for storing, analyzing, and visualizing biological sequence data
  • Solve complex data storage and automation problems
  • Build REST APIs that provide access to biological sequence data and analysis results
  • Develop cloud-based applications and services for biological sequence data management
  • Develop new ETL pipelines for managing large volumes of biological sequence data


  • 2-5 years of experience with provisioning cloud services with GCP, AWS, or Azure
  • Strong proficiency in Python scripting
  • Experience architecting reliable infrastructure platforms including monitoring and alerting, load balancing, scalable services, and multi-region deployments
  • Experience with batch computing systems such as AWS Batch or SLURM
  • Experience designing and developing REST APIs
  • Familiarity with AWS Sagemaker, Google Kubernetes Engine, or comparable technology for managing large scale ML model deployments
  • Familiarity with designing and implementing data processing pipelines
  • Familiarity with database languages (e.g., SQL, No-SQL) and basic data model implementation
  • Proficiency in the Linux environment and experience with version control practices and tools (Git, Mercurial, etc.)
  • Effective interpersonal communication and team collaboration skills but also possess the ability to work independently
  • Enjoy learning new technologies
  • Enjoy empowering scientists with novel software solutions

Nice to Have Experience and Knowledge:

  • Experience working with NGS data
  • Experience working with biological data
  • Experience with defining infrastructure following regulatory compliance (GDPR, HIPAA, etc).
  • Experience working with medium-to-large size datasets
  • Experience with data processing pipelines like nextflow

We seek data engineering candidates who will dive all in to participate in our creative company culture that's collaborative, multidisciplinary, and committed to a big vision for positive impact. We are defying conventions and innovating without boundaries. We are disrupting an industry with bold ideas and passionate pursuit of new possibilities. We are looking for original thinkers, creative scientists, and data-devoted gurus. Successful candidates will be excited to work in a dynamic startup environment and contribute as a key member of a project team. If this sounds good to you, we invite you to join us in our quest to redefine possible.

AbSci offers highly competitive salaries and benefits, including medical and dental insurance, paid time off, and 401(k) with a generous company match. Legal authorization to work in the U.S. is required. We are not able to sponsor individuals for employment visas for this job. In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification form upon hire.

AbSci is headquartered in Vancouver, WA (just outside of Portland, OR) - AbSci may be open to remote-based workers for this role. Must be available to work standard business hours in Pacific Time. AbSci offers a dog-friendly work environment - bring your pup along for the ride.

*** Mention DataYoshi when applying ***

Offers you may like...

  • Anju Software

    Data Analyst
    2600 Berchem
  • Voi Technology

    Senior Data Analyst - Software & Data
    111 52 Stockholm
  • Zscaler

    Principal Software Engineer - Data Engineering
    New York, NY 10012
  • Sonalysts, Inc.

    Junior Software Engineer / Data Scientist
    Waterford, CT 06385
  • CVS Health

    Senior Software Data Engineer
    Denver, CO