Computation resources architecture

Below is the architecture model of our computation and storage resources. HPC architecture

Every LBT computing resources (HPC, storage and IS clusters) are internally designed and built, and based on open-source software and solutions.

Currently in the cluster room we have 1* HPC clusters (a.k.a. Supercomputer), called Baal (Lucifer for the standby login server), which is composed as follows:

* the 2 first (Lucifer and Hades) have been completely dismantled.

Because it will not do any good to compute without storage capacity, following storage volumes are also available:

* maybe more considering ZFS compression and deduplication.

Some other disk spaces are not listed above because they are kept in reserve.

Except for /archive/ibpc_team and /scratch volumes, all above-mentioned volumes are replicated and/or distributed on a storage cluster currently composed by 11 servers.

/scratch volume on each node serves only to store temporary computing files. So, because it's cleared every night (deleting old job's directories and all not well-formed directories), you should not try to use it for chaining jobs in a single -but dedicated- scratch directory.

Archive volume (/archive) is not available on computing nodes.

In order to get the best performance as possible, we choose a high throughput and low latency network technology: Infiniband QDR (40Gbs).

Below, the evolution year-per-year of the computing performance and storage:

CPU computing power evolution.GPU-accelerated computing power evolution.Storage evolution