Technical Architect/Developer (ETL, AWS, RDF, SPARQL)

  • Location:

    Surrey, England

  • Sector:


  • Job type:


  • Salary:

    £700.00 - £800.00 per day

  • Job ref:


  • Published:

    about 1 month ago

  • Duration:

    6 Months

  • Start date:


  • Client:

    Undisclosed Company

Technical Architect/Developer (ETL, AWS, RDF, SPARQL, Elasticsearch)

Leading the build of an alpha system for managing catalogue data using RDF

After a successful discovery, our client now need an expert Technical Architect and Lead Developer to help us implement an alpha of a new pan archival catalogue, developing a solution meet the users' needs, using AWS Neptune and Elasticsearch.

Work so far

Our client has run a discovery phase leading to plans for an alpha using AWS Neptune and Elasticsearch.

The discovery produced a proposal for a new Catalogue Data Model using RDF, a new identifier scheme, and transformation routines for the existing data to the new model. We have held workshops identifying the keyways that staff managing the catalogue work with the data and what they would like in future. The archivist needs to search, analyse, add to, correct, edit, enrich, and enhance record descriptions so that the catalogue is properly maintained. The archivist needs to work with catalogue entries individually or as large sets, making (or reversing) bulk changes, so they can work efficiently. The archivists need to understand the version history of the catalogue so they can be confident about where the information has originated.

We have investigated all the current databases that hold catalogue data and how they inter-relate. We have investigated a wide range of existing data standards and ontologies. We have documented all the findings in a detailed published report.

The Project

Our client are developing a pan-archival catalogue, bringing together record descriptions from multiple catalogues into a single new system. We are looking for a technical architect and developer to lead the development work on an alpha catalogue management system.

This work will involve developing API functions to search, select, add, export, edit, import and delete catalogue data; developing search for use by expert users (using SPARQL in combination with Elasticsearch); developing an Extract, Transform, Load process to migrate The National Archives catalogue data from multiple relational database (SQL Server) and RDF databases to a cloud based native RDF database (AWS Neptune).

Key Skills / Experience

  • Ability to understand business needs and priorities and apply them to project planning
  • Excellent communication skills, translating technical relating to software engineering, delivery management and service management so that they are understood by all.
  • Strategic overview of standards and technology, with the capacity of turning business problems into technical design.
  • Have experience with using standards-based ontologies/vocabularies, such as W3C PROV data model, Dublin Core and W3C ODRL
  • Have experience of working with RDF databases and SPARQL, for example AWS Neptune
  • Have experience, knowledge and understanding of Extract, Transform, Load (ETL) processes
  • Have experience, knowledge and understanding of create resilient and secure systems using IAM in a cloud context.
  • Have experience developing a user interface/front end to support non-expert, editorial engagement with RDF
  • Understanding of informational architecture in an archival context
Apply Save job Create Job Alerts

Share this job