Capgemini

Databricks Data Engineer

Job description

Department/project description,


Insights & Data practice delivers cutting-edge data centric solutions.

Most of our projects are Cloud & Big Data engineering. We develop solutions to process large, also unstructured, datasets, with dedicated Cloud Data services on AWS, Azure or GCP.


We are responsible for full SDLC of the solution: apart from using data processing tools (e.g., ETL), we code a lot in Python, Scala or Java and use DevOps tools and best practices. The data is either exposed to downstream systems via API, outbound interfaces or visualized on reports and dashboards.


Within our AI CoE we deliver Data Science and Machine Learning projects with focus on NLP, Anomaly Detection and Computer Vision.

Additionally, we are exploring the area of Quantum Computing, searching for practical growth opportunities for both us and our clients.


Currently, over 250 of our Data Architects, Engineers and Scientists work on exciting projects for over 30 clients from different sectors (Financial Services, Logistics, Automotive, Telco and others)


Come on Board!


Your daily tasks,


  • Designing and implementing solutions for processing large and unstructured datasets, including Data Lake Architecture and Streaming Architecture.
  • Implementing, optimizing, and testing modern DWH/Big Data solutions using Databricks Platform within a Continuous Delivery/Continuous Integration environment.
  • Improving data processing efficiency and managing migrations from on-premises to public cloud platforms.
  • Development of Data, AI, and ML applications, as well as Generative AI solutions


Frequently used technologies,


  • Databricks 5
  • Python/PySpark 4
  • Cloud: Azure, AWS, GCP 4
  • SQL 3


Our expectation,


  • At least 3 years of experience in Big Data or Cloud projects in the areas of processing and visualization of large and/or unstructured datasets (including at least 1 year of hands-on Databricks experience)
  • practical knowledge of at least one Public Cloud platform in Storage, Compute (+Serverless), Networking and DevOps areas supported by commercial project work experience.
  • At least basic knowledge of SQL and one of programming languages: Python/Scala/Java/bash
  • Very good command of English


Our offer,


  • Permanent employment contract from the first day,
  • Hybrid, flexible working model,
  • Possibility of using increased tax-deductible costs in the case of creative work,
  • Co-financing to equip a workplace at home,
  • Development opportunities:
  • Substantive support from project leaders,
  • A wide range of internal and external trainings (technical, language, leadership),
  • Certification support in various areas,
  • Mentoring and a real impact on shaping your career path,
  • Access to a database of over 2,000 training courses on Pluralsight, Coursera, Harvard platforms,
  • Internal communities (including Agile, IoT, Digital, Security, Women@Capgemini),
  • The opportunity to participate in conferences both as a listener and an expert;
  • Relocation package;
  • Benefits as part of the social package (including Multisport card, medical care for the whole family, group insurance on preferential terms, cafeteria).

Please let the company know that you found this position on this Job Board as a way to support us, so we can keep posting cool jobs.