Data Engineer

Company:
Location: Dallas, TX

*** Mention DataYoshi when applying ***

Job Description:

Role activities:

  • Be a fundamental part of data science team to collaborate with data scientists, product owners and project managers to build machine learning automation/pipeline solutions
  • Have strong database background and SQL server/Hadoop data domain expertise a plus
  • Ability to take a model from development to production
  • Experience with ML model Cloud Deployment and performance optimization - Azure ML and Databricks are a strong plus
  • Help architect automation framework scaffolding to leverage standardized processes
  • Ability to scale solutions to handle large amounts of data efficiently and with controls to ensure failsafe product delivery
  • Ability to understand the big picture not just a single solution but how hardware and software solutions align and to stay ahead of capacity needs
  • Implement CI/CD processes to the ML model development and deployment life cycle

o Required qualifications:
  • 3-5 years of progressing Python development experience using various packages and dependency resolutions
  • 3-5 years of experience working with Hadoop, HIVE, Spark, and other Big Data tools
  • Hands on experience with Custom UI on Elastic Search and Data Visualization with Kibana
  • Data Quality Control including Data Cleansing/Wrangling and Data Pipeline builds and management
  • Automation of batch jobs using AutoSys and other scheduling including Monitoring and Controls of batch Jobs
  • Unix/Linux Administration experience
  • Agile Experience required with use of JIRA for workload managements
  • Experience in multiple Dev Pipelines with CI/CD experience using BitBucket, Ansible, Jenkins, and other tools
  • Understanding of model building architecture

o Preferred qualifications:
  • NLP Model Development and Testing include predefined Models such as BERT and RoBERTA
  • ElasticSearch API and Web Application development

Job Band:

H5

Shift:

1st shift (United States of America)

Hours Per Week:

40

Weekly Schedule:

Referral Bonus Amount:

0

*** Mention DataYoshi when applying ***

Offers you may like...

  • EAB

    Data Engineer, Commercial Analytics
    Washington, DC 20036
  • NN Tech, LLC

    Data Engineer
    Remote
  • The University of Pittsburgh

    Data Engineer (remote)
    Pittsburgh, PA
  • Exact Sciences Corporation

    Data Engineer II
    Madison, WI 53711
  • Spotify

    Data Engineer, Insights Platform
    New York, NY