The goal of our work is to provide a cloud infrastructure for research groups in the context of the Cluster of Excellence "Machine Learning", the Tübingen AI Center and the Cyber Valley Initiative, and to create a platform that enables collaborative work, high performance computing and modern data management.
Our main task is to set up, maintain and expand the basic hardware and software structure of the cloud, as well as all additional components. We operate the hardware in a dedicated server room, provide suitable applications and other software solutions, and offer technical support.
TThe core of the infrastructure is formed by Slurm resource manager. The necessary computing power is provided both by high-performance GPUs and by powerful CPU computing nodes. Currently the ML Cloud provides Nvidia 2080ti, A100, H100 compute nodes.