Fusion of Talent: Celebrating the Many Roles of Women in Computing

Name: Fusion of Talent: Celebrating the Many Roles of Women in Computing
Start: 2025-11-04T08:30:00+00:00
End: 2025-11-04T19:30:00+00:00
Location: No location set

4 November 2025

Europe/London timezone

Contact

Towards a Unified Lakehouse Platform for PSDI

4 Nov 2025, 13:40

Balcony (Conference Centre)

Balcony

Conference Centre

Poster Poster Poster Session

Amali Pawula Hewage (UKRI - STFC)

PSDI is the UK's nationally funded programme that provides tools and services to help researchers in the physical sciences find, share, and process data, with the explicit aim of accelerating scientific discovery and innovation. In PSDI, we work with diverse data from various sources. One of the key challenges we face is managing big data while maintaining flexibility in handling both raw and complex data in low-cost storage, and addressing issues related to data governance, performance, and consistency. To truly empower the scientific community, this data must be usable for both analytics and cutting-edge AI/ML applications.

To tackle this, we will design and build a ‘data lakehouse’ on low-cost object storage. This architecture combines the flexibility of a data lake with the transactional consistency and query performance of a data warehouse. Raw datasets from different data sources will be ingested into object storage and transformed into a common, open format like Apache Parquet, enabling efficient analytics. These datasets will then be registered as Apache Iceberg tables in a metadata catalog (e.g., Lakekeeper or Apache Polaris) to manage schema and ensure consistency. By providing a unified, governed platform with a powerful query engine (e.g., Trino, DuckDB, or Spark), this lakehouse will make diverse data more Findable, Accessible, Interoperable, and Reusable (FAIR). Ultimately, this work will make it easier for the scientific community to exploit this data for new insights and discoveries.

References:

Amali Pawula Hewage (UKRI - STFC)

Dr Alexander Belozerov Dr Tom Underwood Dr Vasily Bunakov

There are no materials yet.

Fusion of Talent: Celebrating the Many Roles of Women in Computing

Contact

Towards a Unified Lakehouse Platform for PSDI

Balcony

Conference Centre

Speaker

Description

Author

Co-authors

Presentation materials

Choose timezone

Fusion of Talent: Celebrating the Many Roles of Women in Computing

Contact

Speaker

Description

Author

Co-authors

Presentation materials