Updated 17/03/2025
DTE Infrastructure Component

RUCIO Jupyterlab extension

Federated Data Infrastructure
Image

Description

Allows using certain Rucio capabilities like interactive data browsing and triggering data replicas to the storage volume attached to the jupyter server.

The Rucio-JupyterLab extension is an optional service that may be deployed to augment a JupyterLab service by adding support for data management within the data lake.  It allows the JupyterLab user to discover the availability of data files and datasets, query the current availability of that data within a JupyterLab instance and to request that data be made available on some Jupyter-accessible storage volume that is also part of the Data Lake.

For the user, this provides a easy-to-use graphical interface to understand and manipulate the DataLake, that is conveniently placed within their existing Jupyter environment.  Once a Jupyter session is spawned, the extension will be available on the left sidebar.

Target Audience
+

In principle, all DT users that use the Datalake concept to manage their data are using Rucio.  Depending on the use cases, DT users may interact directly with Rucio, or they may use Rucio via some intermediate service.

License
+

Apache 2.0

Created by
+

Release Notes

The Rucio-JupyterLab extension was originally developed in the ESCAPE project and further enhanced within the interTwin project. The software is now at TRL 7. Future development will continue within CERN’s Rucio community.

Future Plans

The Rucio-JupterLab extension is now a supported component, managed within the Rucio project.  The Rucio project has a well-established support process that is science-agnostic and community-driven.  Deployments of the interTwin DTE blueprint that take advantage of this JupyterHub extension will be supported through the Rucio project.