Project Data Engineer (AWS PySpark Developer)

Location: Wayne, PA

*** Mention DataYoshi when applying ***



  • Experience in building large scale batch and data pipelines with data processing frameworks in AWS cloud platform using PySpark (on EMR) & Glue ETL
  • Deep experience in developing data processing data manipulation tasks using PySpark such as reading data from external sources merge data perform data enrichment and load in to target data destinations.
  • Proficiency with Big Data processing technologies (Hadoop, Hive, or Databricks)
  • Experience in deployment and operationalizing the code using CI/CD tools Bitbucket and Bamboo
  • Experience with SQL and relational databases
  • Strong AWS cloud computing experience. Extensive experience in Lambda, S3, EMR, Redshift

The Capgemini Freelancer Gateway is enabled by a cutting-edge software platform that leads the contingent labor world for technology innovation. The software platform leverages Machine Learning and Artificial Intelligence to make sure the right people end up in the right job.

A global leader in consulting, technology services and digital transformation, Capgemini is at the forefront of innovation to address the entire breadth of clients’ opportunities in the evolving world of cloud, digital and platforms. Building on its strong 50 year heritage and deep industry-specific expertise, Capgemini enables organizations to realize their business ambitions through an array of services from strategy to operations. Capgemini is driven by the conviction that the business value of technology comes from and through people. It is a multicultural company of over 200,000 team members in more than 40 countries. The Group reported 2018 global revenues of EUR 13.2 billion.

*** Mention DataYoshi when applying ***

Offers you may like...

  • Project A Ventures

    Data analyst in Customer Relations (m/f/d)
    20095 Hamburg
  • CRIF S.p.A.

    Milano, Lombardia
  • Project A Ventures

    Senior Data Engineer (m/f/d)
    Koblach, V
  • Project A Services GmbH & Co. KG

    (Senior) Data Engineer (m/f/d)
  • Applied Research Associates, Inc

    Mid-Level Data Scientist & Project Manager
    Washington, DC 20001