As a G7 Data Scientist within Data Architecture you will provide the technical lead around developing code, methods and processes to exploit key strategic external data for a variety of wider business products; with a focus on developing a number of geospatial, economic, and reference data products for both internal and cross-government use.
This will involve developing operational and ONS data to form the basis of a series of products as well as applying established data science, and statistical methods to the data for the main stage of the product build. This involves using key programming languages such as Python and SQL as well as a strong understanding of both the theory and application of mainstream data science and statistical methods in a big data context. This also requires an understanding of key government operational data as well a familiarity with survey and/or Census data and collection methods.
The role will involve the leadership of several high-profile technical projects covering end-to-end development of the key Data Architecture outputs. It also involves matrix-managing staff working on these projects. As such experience leading the big data projects and production of analytical outputs is required.
- Working collaboratively with relevant experts (particularly in Methodology Division and Data Architecture) as well as the wider ONS and strategic data suppliers to identify and resolve key data, and methodological issues affecting the development of key big data products.
- Taking day-to-day responsibility for implementing linking methods across key demographic, social, and economic data series to establish multiple dimensions of a range of data Engineering projects. This involves integrating a range of priority administrative data sources, particularly linking person demographics and statuses across time.
- Support other senior managers in developing technical solutions for key projects around data engineering and linking work developing operational data for key Census, Labour Market and National Accounts requirements.
- Providing a technical lead across Data Architecture Division in developing and applying statistical and data science methods to operational data in developing new products from operational data.
- Provide a link to Methodology Division, Data Science Campus and technical specialists across government in identifying best practice, ensuring it is embedded within Data Architecture products and outputs.
- Coding and specifying the business logic used to enable inclusion of different administrative datasets into key Data Architecture geospatial, economic and reference data products. This also includes identifying and integrating the most appropriate statistical and data science methods where required in development of said products.
- Matrix-manage teams of HO and SO-level technical staff (from across data Architecture) dedicated to a range of internal and external (data) product development projects spanning Data Architecture, the Wider Division and cross-ONS.
- Alongside ONS Methodology Division, provide the lead in ensuring full methodological and analytical support is provided to underpin the running and maintenance of a number mature Data Architecture data products.
You will work with:
- Relevant experts in ONS Big Data, Methodology Division, Data Science Campus, Census, National Accounts, Labour Market Division, Methodology, Digital Services & Technology and Data as a Service, establishing and agreeing approaches for preparing operational/externally acquired data for key Data Architecture products.
- Analysts in external supplier departments to identify key administrative data issues that will impact on ONS use of key strategic administrative data feeds.
- Analysts and data specialists across ONS responsible for integrating Operational data into core ONS products.
- Key technical leads both across ONS and wider government to in identifying technical best practice, ensuring it is embedded within Data Architecture products and outputs.
- Able to work with a range of stakeholders and collaborate across teams to ensure a quality outcome;
- Adopts a positive approach, act promptly to reassess workloads and priorities when there are conflicting demands;
- Can take the initiative to suggest ideas to make improvements particularly where technical issues arise;
- Ensures communication has a clear purpose and takes into account people’s individual needs.
- Intermediate to advanced understanding SQL and python particularly in the context of Spark and other Cloudera tools;
- Experience developing and analysing government operational data or big data in a commercial context;
- Familiarity with data tools such as Spark and Hadoop User Experience tool (i.e. Hive and Impala);
- Well-versed in established data science, statistical techniques and theory; and/or qualified in relevant degree or higher;
- Demonstrable experience of validating, structuring and linking across very large administrative datasets; large-scale survey datasets or Census data;
- Familiarity with common data engineering practices used with ‘big data’.
We'll assess you against these behaviours during the selection process:
- Changing and Improving
- Developing Self and Others
- Communicating and Influencing
- Delivering at Pace
Whether you are hearing about us for the first time or already know a bit about our organisation, we hope that the benefits pack attached (bottom of page)
will give you a great insight into the benefits and facilities available to our employees, and our fantastic working culture.
We are an organisation that takes the well-being of its employees seriously and lives and breathes the desire to modernise the workplace of the future. Everyone, from our office-based staff in Newport, London and Titchfield, to our field interviewers and airports and ports passenger survey staff, are part of a diverse and inclusive family.