Data Engineer

Location: Bengaluru, Karnataka

*** Mention DataYoshi when applying ***

Data Engineer - Artificial Intelligence & Machine Learning

Job Profile

Does working with data on a day to day basis excite you? Are you interested in building robust data architecture to identify data patterns and optimise data consumption for our customers, who will forecast and predict what actions to undertake based on data? If this is what excites you, then you’ll love working in our intelligent automation team.

Schneider Digital is leading the digital transformation of Schneider Electric by building highly available, massive scalable digital platform for the enterprise.

We are looking for a savvy Data Engineer to join our growing team of AI and machine learning experts. You will be responsible for expanding and optimizing our data and data pipeline architecture, as well as optimizing data flow and collection for cross functional teams. The ideal candidate is an experienced data pipeline builder and data wrangler who enjoys optimizing data systems and building them from the ground up.

The Data Engineer will support our software engineers, data analysts and data scientists on data initiatives and will ensure optimal data delivery architecture is consistent throughout ongoing projects. They must be self-directed and comfortable supporting the data needs of multiple teams, systems and products.


  • Create and maintain optimal data pipeline architecture; assemble large, complex data sets that meet functional / non-functional requirements
  • Design and build production data pipelines from ingestion to consumption within a big data architecture for data transfer and processing
  • Build the necessary datamarts, data warehouse required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS ‘big data’ technologies.
  • Create necessary preprocessing and postprocessing for structured, semi-structured and unstructured data for training/ retraining and inference ingestions as required; feature extraction, standardization and normalization
  • Storing, cataloging, documenting and making reusable feature available for ML consumption (training, testing and prediction)
  • Support both batch and real-time paradigms as required for all data pipelines
  • Create data visualization and business intelligence tools for stakeholders and data scientists for necessary business/ solution insights
  • Collaborate with data scientists to ensure success of all machine learning and AI projects
  • Identify, design, and implement internal process improvements: automating manual data processes, optimizing data delivery, etc.
  • Ensure our data is separated and secure across national boundaries through multiple data centers and AWS regions

Requirements and Skills

  • You should have a bachelors or master’s degree in computer science, Information Technology or other quantitative fields
  • You should have at least 5 years working as a data engineer in supporting large data transformation initiatives related to machine learning, with experience in building and optimizing ‘big data’ pipelines and data sets
  • Strong analytic skills related to working with unstructured datasets.
  • Experience with big data tools: Hadoop, Spark, Kafka, etc.
  • Experience with relational SQL and NoSQL databases, including Postgres and Cassandra.
  • Experience with data pipeline and workflow management tools: Azkaban, Luigi, Airflow, etc.
  • Experience with AWS cloud services: EC2, EMR, RDS, Redshift and familiarity with various log formats from AWS.
  • Experience with stream-processing systems: Storm, Spark-Streaming, etc.
  • Experience with object-oriented/object function scripting languages: Python, Java, C++, etc.

About Us

Schneider Electric™ creates connected technologies that reshape industries, transform cities and enrich lives. Our 144,000 employees thrive in more than 100 countries. From the simplest of switches to complex operational systems, our technology, software and services improve the way our customers manage and automate their operations. Help us deliver solutions that ensure Life Is On everywhere, for everyone and at every moment:


Great people make Schneider Electric a great company.

We seek out and reward people for putting the customer first, being disruptive to the status quo, embracing different perspectives, continuously learning, and acting like owners. We want our employees to reflect the diversity of the communities in which we operate. We welcome people as they are, creating an inclusive culture where all forms of diversity are seen as a real value for the company. We’re looking for people with a passion for success — on the job and beyond. See what our people have to say about working for Schneider Electric:

Primary Location

: IN-Karnataka-Bangalore


: Full-time

Unposting Date

: Ongoing

*** Mention DataYoshi when applying ***

Offers you may like...

  • CyberCoders

    Data Engineer
    San Francisco, CA 94102
  • iVinci Health

    Data Engineer II
    Boise, ID 83702
  • Go Maverick Group

    Lead Data Engineer
  • Logic20/20

    Senior Data Engineer
    San Francisco, CA 94016
  • Trova

    Remote Data Engineer/No Sponsorship
    Orlando, FL