Data Engineer

Location: 10178 Berlin

*** Mention DataYoshi when applying ***

At Gemma, we help our clients activate data by using state-of-the-art technology. Our clients make better decisions and are empowered to make use of their data on their own. We are service-focused, yet also build open-source tools to deliver a more effective and efficient service.

Our clients range from Series A ventures to SMEs with 30 to 13,000 employees per client. We have an honest, pleasant, and fun work environment. Please make reference calls on us for validation : )


About the Job

Gemma Analytics is data-driven and helps clients to become more data-driven.

As the first pure data engineer in our team, you play a crucial role in developing our open-source Python library EWAH from a tool that we use internally and with our clients into a tool that data engineers of other companies love to use in their data stack. The challenges that this open-source library solves are the following:

  • Data transfer from raw data sources to data warehouses
  • Enabling Analytics Engineers to set up and maintain state-of-the-art data infrastructures using open-source software components
  • GDPR compliance as data does not leave the customer’s environment
  • Save time for data engineers and analytics engineers by abstracting away data engineering issues
Besides developing the open-source application, there will be challenges imposed by our existing and new clients that circle around the connection to new data sources, new target databases, and custom requests that are circling around setting up and maintaining an effective and efficient data infrastructure across our entire client portfolio.

We are looking for a person who loves abstractions and understands how to apply abstractions to make everyone’s life easier.

Technologies you’ll use

Working with multiple clients, we are in touch with many technologies, which is truly exciting. We aim to use state-of-the-art technologies while being fully pragmatic (we do not crack a walnut with a sledgehammer). We follow an ELT philosophy and divide the tasks between Data Engineering and Analytics Engineering accordingly.

The following technologies constitute our preferred data tech stack:

Data Loading

  • For most clients, we use our own open-source Python library EWAH and Apache Airflow
  • For simple requests, we work with Fivetran or Stitch
Data Warehousing

  • For smaller data loads, we mostly use PostgreSQL databases
  • For larger datasets, we work with Snowflake or BigQuery
Data Transformation

  • We love to work with data build tool (dbt)
Data Visualization

  • For smaller businesses with < 100 FTE, we mostly recommend Metabase as a powerful open-source reporting tool
  • For specified needs and a centralized BI, we recommend PowerBI or Tableau
  • For a decentralized, self-service BI with more than 50 users, we recommend Looker

We believe in a good mixture of experience and upside in our team. We are looking for both types of people equally.

Besides that we are looking for the following:
  • Extensive experience using Python to solve data engineering challenges in production
  • Fluency in English
  • Optional: Experience in creating, shipping, and maintaining high-quality open source applications
  • Optional: Experience with SQL and relational databases

We are located in Berlin, close to Nordbahnhof. We are currently 6 colleagues and will grow to 10-12 colleagues in the coming months. Other perks include:

  • We are remote-friendly - not only during pandemic times
  • We have an honest, inclusive work environment and want to nurture this environment
  • We don’t compromise on equipment - a powerful Laptop, extra screens, and all the tools you need to be effective
  • We will surround you with great people who love to solve (mostly data) riddles
  • We believe in efficient working hours rather than long working hours - we focus on the output rather than the input
  • We learn and share during meetups, lunch & learn sessions and are open for further initiatives
  • We pay a market-friendly salary and we additionally distribute at least 20% of profits to our employees
  • We are fast-growing, have technology at our core, yet we do not rely on a VC and operate profitably
  • We have a great yearly offsite event that brings us all together for a full week, enjoying good food, having a good time, and of course, solving complex data-related challenges
How you’ll get here

1. CV Screening
2. Phone/Coffee/Tea Conversation
3. Interviews with 2 future colleagues
4. Offer + Hired

Looking forward to your application : )

*** Mention DataYoshi when applying ***

Offers you may like...

  • UnitedHealth Group

    Senior Data Engineer Data Quality Analyst - Teleco...
    Eden Prairie, MN 55346
  • Business Mentors

    Senior Data Engineer FT
  • umlaut North America

    Staff Data Engineer
  • Recruiting for Good

    Remote Data Engineer with Cool Automotive Company
    Los Angeles, CA
  • XO

    Data Engineer
    Brisbane, CA 94005