ClusterCockpit

Job-specific Performance and Energy Monitoring for HPC Clusters

ClusterCockpit Job list
ClusterCockpit is a framework for job-specific performance and energy monitoring on HPC clusters with a focus on simple installation and maintenance, high security and intuitive usage.

Scalable

Supports multiple clusters in one web interface. Scales to thousands of nodes and millions of jobs. Supports heterogeneous clusters with and without node sharing.

Slurm integration

Comes with a ready to use Slurm integration. Can be integrated with any batch job scheduler via a REST API.

Read more

Modern Web UI

Responsive web interface with job specific, node, user, and system views. Provides metric plots, aggregate statistics, roofline plots and ressource table, and access to job script. Fully user configurable.

Read more

Global access to metric data

Access to configurable set of metrics including hardware performance counter data. Comes with powerful HPC centric node agent, but can also be integrated with other node agent solutions.

Read more

Authentication methods

Supports local accounts, LDAP, and KeyCloak OpenID Connect. Can be integrated with existing user portals using JWT based authentication.

Read more

User roles

Supports roles for users, project managers, support personnel, and administrators. Users can only see their own jobs.

Read more

Job sorting and filtering

Powerful sorting of jobs according to all job metadata attributes. Filter for job metadata and aggregate metric data attributes.

Read more

Unified search bar

A unified search bar allows to search for job ids, job names, project ids, usernames, and names.

Read more

Job tagging

Jobs can be tagged. Job tags are grouped by type and can have a configurable visibility using a scope attribute.

Read more