GPU & LLM Cluster
High-performance compute, built for Digital Humanities
Scale up what already runs on your laptop. Train and fine-tune models, run interactive GPU notebooks, and call a hosted OpenAI-compatible LLM and embedding API — without leaving the academic network.
What you can do — and how you get in
Three entry points, from a raw shell to a one-line API call. Pick the one that fits how you work.
Shell & SLURM
SSH in to run batch training and multi-GPU / multi-node fine-tuning over InfiniBand, bring your own containers (Apptainer), and use project storage. Full control for power users.
JupyterHub notebooks
Log in through the browser with single sign-on, pick your resources, and your GPU-backed notebook runs as a scheduled job — no terminal required.
LLM & embedding API
Call OpenAI-compatible LLM and embedding endpoints with a project key over HTTPS — for chat, RAG and semantic search. No cluster account needed.
How it differs from generic HPC
Interactive & API access
Not batch-only. Notebooks and hosted APIs alongside SLURM — we try to outperform generic HPC for DH needs.
Hosted model APIs
A ready LLM and embedding API with project keys — something a classic HPC allocation does not give you.
Part of Austria's HPC ecosystem
DHInfra.at is one piece of a wider landscape, and we would rather connect to it than reinvent it. Austria already has excellent shared supercomputing through Austrian Scientific Computing (ASC) and the Vienna Scientific Cluster (VSC), and most major universities — including ours — hold partnerships that make these systems available to their researchers, often at no cost. Where your work fits there, we are happy to point you to it.
The skills carry over. If you are comfortable with a SLURM batch job or a JupyterHub session on one cluster, you are most of the way to using ours — and to using national and European systems. So for the fundamentals we gladly lean on the training material and events these centres already maintain, and focus our own effort on what is specific to DHInfra.at: interactive, DH-first, API-driven use.
We are in close contact with these partners in any case. EuroCC Austria and ASC have lent their expertise to the planning of our cluster and to parts of its operation (for example authentication), and we try to give something back by acting as a low-threshold, DH-oriented entry point into this ecosystem.
Hardware at a glance
The cluster at the University of Graz, with a closely related system in Krems. Figures are indicative.
| GPUs | Memory | Typical use |
|---|---|---|
| 12× NVIDIA H200 | 141 GB each | Large-model training, SOTA LLM serving |
| 11× NVIDIA RTX PRO 6000 | 96 GB each | Medium training, notebooks, model serving |
| 4× NVIDIA L40S | 48 GB each | Embeddings, lightweight inference, courses |
Storage is tiered: very fast NVMe right next to the GPUs (served over RDMA InfiniBand) to keep training and inference efficient, plus larger, slower capacity for everything else — including data staging and short- to mid-term project storage. Across the sites that is at least 100 TB of fast and 500 TB of slower storage. Everything runs on the Austrian academic network (ACOnet).
Getting access
1. Authenticate
Log in with your university credentials over the academic network (ACOnet single sign-on).
2. Apply for a project
Submit a short project application — research, course, thesis, event, or exploratory use.
3. Get approved
On approval, SSH, the SLURM scheduler, storage and the model APIs are unlocked for your project.