Data Engineer, Intern

Location: São Paulo, SP

*** Mention DataYoshi when applying ***

Facebook's mission is to give people the power to build community and bring the world closer together. Through our family of apps and services, we're building a different kind of company that connects billions of people around the world, gives them ways to share what matters most to them, and helps bring people closer together. Whether we're creating new products or helping a small business expand its reach, people at Facebook are builders at heart. Our global teams are constantly iterating, solving problems, and working together to empower people around the world to build community and connect in meaningful ways. Together, we can help people build stronger communities - we're just getting started.
Our data centers are the foundation upon which our rapidly scaling infrastructure efficiently operates to deliver our innovative products. The Data Center Analytics team is seeking a Data Engineer Intern to join the Data Center Data Science Team. This team owns the data engineering, business intelligence, advanced analytics, and analytics solution engineering for the entire Infrastructure Data Center organization. Data engineer is responsible to source large amount of data at scale from various tools and systems using best practices around ETL, dimensional modeling, SQL, HDFS, Spark and Python. Prior knowledge in applied analytics and operationalizing machine learning models is a plus. Help build Facebook’s world class data centers & empower them with analytical solutions in partnership with our data scientists to allow our facilities and sites to become autonomous and better serve over 2.5+ billion people globally!
  • Apply proven expertise and build high-performance scalable data warehouses
  • Design, build and launch efficient & reliable data pipelines to move and transform data (both large and small amounts)
  • Securely source external data from numerous partners
  • Intelligently design data models for optimal storage and retrieval
  • Deploy inclusive data quality checks to ensure high quality of data
  • Optimize existing pipelines and maintain of all domain-related data pipelines
  • Ownership of the end-to-end data engineering component of the solution
  • Collaboration with the Data Center SMEs, Data Scientists, and Program Managers
  • Design and develop new systems in partnership with software engineers to enable quick and easy consumption of data

  • Currently has, or is in the process of obtaining, a MS degree in Computer Science, Electrical Engineering, Information Systems or other related engineering field
  • Coding & scripting proficiency in languages such as Python, C++ or Java & API integration
  • Experience extracting, transforming and loading data using a big data platform using HDFS, Hive and Presto
  • SQL knowledge in handling volumes of data and performance tuning
  • Proven Knowledge in relational database
  • Ability to translate business requirements into analytics solutions and creating dimensional models
  • Experience in working with cross-functional teams
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment

*** Mention DataYoshi when applying ***

Offers you may like...

  • Facebook

    Data Engineer, Analytics Team
  • CRG

    Sr. Cloud Data Engineer
    Cincinnati, OH 45202
  • 2nd Watch, Inc.

    Sr. Data Engineer - Cloud Services
  • 2nd Watch, Inc.

    Data Engineer - Cloud Services
  • Samaritan Health Services

    IS-Data Engineer II
    Corvallis, OR 97330