DTE Infrastructure Component

FTS3

Federated Data Infrastructure

Description

The software underlying a service responsible for globally distributing the majority of the LHC data across the WLCG infrastructure.

FTS3 is a low-level data movement service, responsible for reliable bulk transfer of files from one site to another while allowing participating sites to control the network resource usage.

Transferring large volumes of data between facilities requires a component that manages those transfers: monitoring their progress, cancelling transfers that have stalled or that take too long, and retrying failed transfers (where appropriate). Deploying FTS provides a common service for handling such transfers, allowing higher-level data management; e.g., bandwidth shaping links between facilities.

Release Notes

This release represents a technical preview of the interTwin federated data management solution.

There are two external software components: FTS and Rucio.  They are fully established projects, independent of the interTwin project.  The software is production-ready, at TRL 9, and hardened with many years of production-critical use.  Both projects have multiple deployments of their software, operated by different communities.

The ALISE software is currently in a development phase, under the aegis of interTwin.  At the time of release, ALISE is TRL 4.  The user-facing functionality of ALISE is mostly feature-complete; however, anticipated changes to the API imply that the necessary integration work (whereby a service uses ALISE to identify a user) should be considered experimental. Feedback from early adopters is encouraged, but any plans to deploy ALISE should be tempered by the anticipated changes to the API.

The teapot software is also currently in a development phase, within the interTwin project.  Teapot is TRL 3–4.  The current release is sufficient to build a limited proof-of-concept demonstration, supporting the data transfer requirements of a single user.

Future Plans

A number of improvements are planned for teapot. Teapot will be updated to support multiple, concurrent users. The per-user WebDAV instance management will be automated, starting new services on demand, and terminating them if there is sufficient idle time. It will also be updated to integrate with ALISE, to support automated identity management.

For ALISE, we anticipate possible improvements and stabilisation of the service-integration API, based on experience gained from integrating ALISE into various services. In addition, we plan to add support for client authentication in future versions of ALISE. This will limit access to the identity mapping information, providing this information to authorised services only.

There are currently no immediate plans to improve FTS. Future experience from integrating the testbed with the interTwin service providers may identify necessary enhancements to FTS.

Plan to enhance Rucio by improving support for integrating Rucio with external catalogues; e.g., Lattice QCD’s catalogue.  Other plans to enhance Rucio may emerge from our experience when integrating Rucio with DTE core service, DTE thematic modules, and with the science use cases.

 

Target Audience
+

FTS is a core component that most users do not access directly.  Instead, FTS is used by any DT user or manager who is (directly or otherwise) managing data locality.

FTS is used when a DT user or manager needs some data to be located at some facility and that data is not currently available.

License
+

Apache 2.0

Created by
+

Related components

Infrastructure Component: Teapot

An easy-to-install edge service that a facility may deploy to provide remote access to their...

February 15, 2024
Infrastructure Component: ALISE

A tool for linking a user’s federated identity with their facility account.

...
February 15, 2024
Infrastructure Component: RUCIO

Providing a service that manages data locality. It provides a scalable solution for managing...

February 15, 2024