FARSCAPE WS1 Data Systems

Europe/London
Stanislas Pamela (UKAEA), Tom Farmer (Research Data Management)
Description

FARSCAPE Workstream 1 fortnightly meetings


This meeting is intended to be comprehensive strategy session focusing on the critical aspects of data management. This meeting aims to bring together key stakeholders and experts to discuss, evaluate, and enhance our data management practices.

The sessions plan to cover

  • Metadata Management
    Understanding the role of metadata in improving data discoverability, quality, and governance. We will discuss best practices for creating and maintaining metadata standards.

  • Data Catalogues
    Exploring the implementation and benefits of data catalogues in organizing and centralizing data assets. We'll review tools and strategies for building an effective data catalogue.

  • Data Archiving
    Evaluating our current data archiving practices and identifying opportunities for improvement. We'll discuss policies for long-term data retention, compliance, and cost-effective storage solutions.

  • Data Pipelines
    Reviewing the design, development, and maintenance of data pipelines to ensure efficient data flow from ingestion to analysis. We'll examine tools and technologies that can optimize our data processing workflows.

Representatives from the groups and experiments at UKAEA will have an opportunity to discuss their data management process and how it can be improved. Initially, facilities/groups that will be represented are:

  • HIVE,
  • MRF,
  • ELSA,
  • CHIMERA/PEGASUS

Agenda + Minutes


Present: AP, TF, ... and friends

There are minutes attached to this event. Show them.
    • 11:00 11:10
      Introduction 10m
      Speaker: Tom Farmer (Research Data Management)
    • 11:10 11:20
      Metadata Schemas 10m

      An update on progress developing metadata schemas. This includes high level schemas, as well as those relevant to specific facilities.

      It will highlight the remaining schemas which need to be specified for HIVE before September.

      Speaker: Adam Parker (High Performance Data Analytics)
    • 11:20 11:30
      Metadata Ingestion Implementations 10m

      An update on progress developing a specific implementations of metadata ingestion. This includes development of:
      - A UI for manual metadata ingestion for HIVE
      - Plugins for automated metadata extraction and ingestion from files generated in data capture, processing or analysis.

      Speakers: Bhargav Garikipati, Ajay Rawat
    • 11:30 11:40
      Metadata Ingestion System 10m

      An update on development of the metadata ingestion system. This includes:
      - The client implementation which allows specific ingestion plugins to be developed.
      - The metadata schema server (metacat).

      Speaker: Derek Sandiford (Research Data Management)
    • 11:40 11:50
      Data Pipelines 10m

      Update on the progress of data pipeline work.

      It will include a discussion of the remaining steps to be completed for HIVE before September.

      Speaker: Kirill Palamartchouk (Research Data Management)
    • 11:50 11:55
      Data Catalogue 5m

      Update on the progress of implementing a production grade instance of SciCat.

      Speaker: Tom Farmer (Research Data Management)
    • 11:55 12:00
      AOB and Review of New Actions 5m

      Review of additional actions which have arisen during the meeting

      Speaker: Tom Farmer (Research Data Management)