A number of new Qlustar features were completed recently and published
together with the latest security updates.
QluMan 12.0.7.3 highly improves on its slurm management capabilities
basically making the slurm component feature complete. It adds
- Real SLURM QOS MANAGEMENT [1] in the GUI. QoS can now easily be added,
edited, removed and assigned. The full list of QoS properties may be
added/removed and manipulated including sanity checking the values
entered.
- SLURM ASSOCIATION AND LIMITS MANAGEMENT [2] has been added in the GUI
unleashing the power of advanced slurm resource management to QluMan
admins. Various visual filters have been created to achieve a
comprehensive display of slurm associations. The full list of
association settings may be added/removed and manipulated including
sanity checking the values entered.
* An 'LDAP GROUP <-> SLURM ACCOUNT' SYNC MECHANISM [3] has been designed
to automate slurm user setup upon creation of the user in LDAP. Slurm
accounts may be associated with LDAP groups such that depending on
users primary group the corresponding slurm account will be assigned
as their default.
* Cron-based LDAP DB BACKUP was introduced to allow a disaster recovery
in case the LDAP DB becomes corrupted.
Other new features:
- Update to AlmaLinux 8.6
[1] https://docs.qlustar.com/Qlustar/12.0/ClusterOS/qluman-guide/components/slu…
[2] https://docs.qlustar.com/Qlustar/12.0/ClusterOS/qluman-guide/components/slu…
[3] https://docs.qlustar.com/Qlustar/12.0/ClusterOS/qluman-guide/components/slu…
The Qlustar releases 12.0.0.12-b548f1436 and 11.0.1.16-b549f1437 are ready
including a number of security/bug fixes and improvements. Please check
the following web pages for details about security fixes and special
update instructions:
https://qlustar.com/qsa/2022/0624221https://qlustar.com/qsa/2022/0624222
The following non-security related enhancements/bug fixes are included:
For 12.0 only:
- Update to AlmaLinux 8.6
- Update to QluMan 12.0.7.3
- Update to MLNX OFED 5.4-3.4.0.0
- Update to Lustre 2.12.9
The Qlustar releases 12.0.0.11-b547f1433 and 11.0.1.15-b543f1435 are ready
including a number of security/bug fixes and improvements. Please check
the following web pages for details about security fixes and special
update instructions:
https://qlustar.com/qsa/2022/0524221https://qlustar.com/qsa/2022/0524222https://qlustar.com/qsa/2022/0524223
The following non-security related enhancements/bug fixes are included:
For 12.0 only:
- Update to QluMan 12.0.7.1
- Update to slurm 21.08.8.2
The Qlustar releases 12.0.0.10-b547f1432 and 11.0.1.14-b543f1431 are ready
including a number of security/bug fixes and improvements. Please check
the following web pages for details about security fixes and special
update instructions:
https://qlustar.com/qsa/2022/0413221https://qlustar.com/qsa/2022/0413222
The following non-security related enhancements/bug fixes are included:
For 12.0 only:
- Update to Nvidia graphics driver version 510.47.03
A number of new Qlustar features were completed recently and published
together with the latest security updates.
QluMan 12.0.6.0 has the following new capabilities:
- QLUMAN TIMERS [1] present a new Config class assignable to nodes. They
allow for time-based execution of commands on nodes and are easily
configured via the GUI.
- ENHANCEMENTS OF THE SLURM GUI COMPONENT [2]:
* Added on-the-fly job filtering and job grouping.
* Significantly improved mechanism to transfer job data. The QluMan
GUI is now able to smoothly handle thousands of jobs.
- HARDWARE DETECTION INFO for nodes in the GUI Enclosure view [3] now
transparently highlights misconfigurations, helping admins to track
down faulty settings.
- FILTERS IN USER/GROUP MANAGEMENT [4] have been added, making LDAP management
a lot clearer and easier to follow.
Other new features:
- MELLANOX OFED [5] drivers and tools have been included for optimized support
of clusters with newest Infiniband (IB) technology. Qlustar now
automatically detects whether IB adapters supported by the current
Mellanox OFED release are present and activates its drivers if that's
the case. For all other IB hardware, the in-kernel drivers are used.
- GPU DIRECT RDMA [6] support has been added for nodes running Nvidia GPUs
together with IB adapters supported by Mellanox OFED. It provides a
significant decrease in GPU-GPU communication latency and completely
offloads the CPU, removing it from all GPU-GPU communications across
the network.
- The VIRTUALGL image module is now production ready. When used in a
node image, Qlustar can provide 3D remote visualization on nodes with
the required hardware.
[1] https://docs.qlustar.com/Qlustar/12.0/ClusterOS/qluman-guide/Config-Classes…
[2] https://docs.qlustar.com/Qlustar/12.0/ClusterOS/qluman-guide/components/slu…
[3] https://docs.qlustar.com/Qlustar/12.0/ClusterOS/qluman-guide/Adding-Hosts.h…
[4] https://docs.qlustar.com/Qlustar/12.0/ClusterOS/qluman-guide/components/lda…
[5] https://www.mellanox.com/products/infiniband-drivers/linux/mlnx_ofed
[6] https://www.mellanox.com/products/GPUDirect-RDMA
The Qlustar releases 12.0.0.8-b546f1425 and 11.0.1.12-b543f1424 are ready
including a number of security/bug fixes and improvements. Please check
the following web pages for details about security fixes and special
update instructions:
https://qlustar.com/qsa/2021/1221211https://qlustar.com/qsa/2021/1221212
The following non-security related enhancements/bug fixes are included:
For 12.0 only:
- QluMan 12.0.6.0 (GUI update to 12.0.6.0 needed as well)
* Major overhaul of slurm component
- Monitor slurm data only when a GUI client needs it
- Remove duplicate job properties before transmitting to client
- Encode run time of jobs so it doesn't need to be transmitted every 5 sec
* Further improve Slurm JobManagement interface
- Add better column titles for new job properties
- Allow grouping of jobs by columns
- Show number of jobs shown after filter / total jobs
- Run time of running jobs now updates every second
* Add reminder about syncing users and groups with nodes upon LDAP changes
* Switch to unlimited lifetime of LDAP certs (previously expired after
1 year). Add option '--update-certs' to qluman-ldap-cli.
* Set DHCP lease time to 1 hour
* EnclosureView: Several fixes/improvements with enclosure handling
* NetworksWidget: Allow routed networks to be master for slave networks
* WriteFiles/ssh
- Drop files in /etc/qlustar/common/image-files and include
authorized_keys in preview
- replace /etc/ssh/ssh_known_hosts + shosts.equiv symlinks with files
on online nodes
* Fix wrong timestamps in message viewer
The Qlustar releases 12.0.0.7-b542f1400 and 11.0.1.11-b543f1399 are ready
including a number of security/bug fixes and improvements. Please check
the following web pages for details about all security fixes and special
update instructions:
https://qlustar.com/qsa/2021/1019211https://qlustar.com/qsa/2021/1019212
The following non-security related enhancements/bug fixes are included:
For 12.0 only:
- QluMan 12.0.5.0 (GUI update to 12.0.5.0 needed as well)
* Improve Slurm JobManagement interface
- Add quick-search filter
- Make server <-> client job updates much more efficient
- Improve job filters
- Allow actions (e.g. kill job) on jobs with different states
- Fix jobs not getting removed
* Improve Slurm NodeState Management GUI dialogs
- name enclosures after their hosts
- filter enclosure view items to show only slurm nodes
- fix exception sorting enclosures
* qluman-slurmd: Avoid memory leak in pyslurm creating data classes
* FilesystemExports Widget
- Correct network priorities when changing servers
- fix broken priorities when an entry is displayed
* FilesystemMounts
- select server name for exports in the network of the server with
a gateway
- try to match NIC types for gateways
* Add GlobalProp SearchDomains by default, change seperator to " "
add DNS search domains to /etc/resolv.conf
- Add cleanup for stale unionfs files upon boot
- Update Nvidia driver to 470.74
- slurm module
* Update to 20.02.7
* Fix a memory leak in the job structure
The Qlustar releases 12.0.0.6-b542f1396 and 11.0.1.10-b543f1397 are ready
including a number of security/bug fixes and improvements. Please check
the following web pages for details about all security fixes and special
update instructions:
https://qlustar.com/qsa/2021/0903211https://qlustar.com/qsa/2021/0903212
The following non-security related enhancements/bug fixes are included:
For 12.0 only:
- QluMan 12.0.4.8 (GUI update to 12.0.4.8 needed as well)
* NameServiceConfig: remove sssd restart commands
* Indicate authentification method for LDAP users in GUI
* Make NFS v3 the default for NFS mounts
* Fix hostname override for non-primary networks
* Fix slurm account creation
* Fix focus loss in wizards
* Fixes for AD import
* Fix broken Slurm Job FilterEditor
* Fix occasional exceptions in Slurm job updates
* Fix network priority management in FS mounts
The Qlustar releases 12.0.0.5-b542f1391 and 11.0.1.9-b543f1392 are ready
including a number of security/bug fixes and improvements. In
particular, they include a fix for the Sequoia local root exploit bug
https://www.qualys.com/2021/07/20/cve-2021-33909/sequoia-local-privilege-es…
Please check the following web pages for details about all security fixes
and special update instructions:
https://qlustar.com/qsa/2021/0724211https://qlustar.com/qsa/2021/0724212
The following non-security related enhancements/bug fixes are included:
For 12.0 only:
- ntpd is replaced by systemd-timesyncd as the daemon to sync system time
between head-node(s) and netboot nodes.