applications/system

minicondor - Configuration for a single-node HTCondor

Website: https://htcondor.org/
License: ASL 2.0
Description:
This example configuration is good for trying out HTCondor for the first time.
It only configures the IPv4 loopback address, turns on basic security, and
shortens many timers to be more responsive.

Packages

minicondor-23.0.14-1.el9.x86_64 [24 KiB] Changelog by Tim Theisen (2024-08-08):
- Docker and Container jobs run on EPs that match AP's CPU architecture
- Fixed premature cleanup of credentials by the condor_credd
- Fixed bug where a malformed SciToken could cause a condor_schedd crash
- Fixed crash in condor_annex script
- Fixed daemon crash after IDTOKEN request is approved by the collector
minicondor-23.0.12-1.el9.x86_64 [25 KiB] Changelog by Tim Theisen (2024-06-13):
- Remote condor_history queries now work the same as local queries
- Improve error handling when submitting to a remote scheduler via ssh
- Fix bug on Windows where condor_procd may crash when suspending a job
- Fix Python binding crash when submitting a DAG which has empty lines
minicondor-23.0.10-1.el9.x86_64 [25 KiB] Changelog by Tim Theisen (2024-05-09):
- Preliminary support for Ubuntu 22.04 (Noble Numbat)
- Warns about deprecated multiple queue statements in a submit file
- Fix bug where plugins could not signify to retry a file transfer
- The condor_upgrade_check script checks for proper token file permissions
- Fix bug where the condor_upgrade_check script crashes on older platforms
- The bundled version of apptainer was moved to libexec in the tarball
minicondor-23.0.8-1.el9.x86_64 [25 KiB] Changelog by Tim Theisen (2024-04-11):
- Fix bug where ssh-agent processes were leaked with grid universe jobs
- Fix DAGMan crash when a provisioner node was given a parent
- Fix bug that prevented use of "ftp:" URLs in file transfer
- Fix bug where jobs that matched an offline slot never start
minicondor-23.0.6-1.el9.x86_64 [27 KiB] Changelog by Tim Theisen (2024-03-14):
- Fix DAGMan where descendants of removed retry-able jobs are marked futile
- Ensure the condor_test_token works correctly when invoked as root
- Fix bug where empty multi-line values could cause a crash
- condor_qusers returns proper exit code for errors in formatting options
- Fix crash in job router when a job transform is missing an argument
minicondor-23.0.4-1.el9.x86_64 [27 KiB] Changelog by Tim Theisen (2024-02-08):
- NVIDIA_VISIBLE_DEVICES environment variable lists full uuid of slot GPUs
- Fix problem where some container jobs would see GPUs not assigned to them
- Restore condor keyboard monitoring that was broken since HTCondor 23.0.0
- In condor_adstash, the search engine timeouts now apply to all operations
- Ensure the prerequisite perl modules are installed for condor_gather_info
minicondor-23.0.3-1.el9.x86_64 [27 KiB] Changelog by Tim Theisen (2024-01-04):
- Preliminary support for openSUSE LEAP 15
- All non-zero exit values from file transfer plugins are now errors
- Fix crash in Python bindings when job submission fails
- Chirp uses a 5120 byte buffer and errors out for bigger messages
- condor_adstash now recognizes GPU usage values as floating point numbers
minicondor-23.0.2-1.el9.x86_64 [28 KiB] Changelog by Tim Theisen (2023-11-20):
- Fix bug where OIDC login information was missing when submitting jobs
- Improved sandbox and ssh-agent clean up for batch grid universe jobs
- Fix bug where daemons with a private network address couldn't communicate
- Fix cgroup v2 memory enforcement for custom configurations
- Add DISABLE_SWAP_FOR_JOB support on cgroup v2 systems
- Fix log rotation for OAuth and Vault credmon daemons
minicondor-23.0.1-1.el9.x86_64 [28 KiB] Changelog by Tim Theisen (2023-10-31):
- Fix 10.6.0 bug that broke PID namespaces
- Fix bug where execution times for ARC CE jobs were 60 times too large
- Fix bug where a failed 'Service' node would crash DAGMan
- Condor-C and Job Router jobs now get resources provisioned updates
minicondor-23.0.0-1.el9.x86_64 [28 KiB] Changelog by Tim Theisen (2023-09-29):
- Absent slot configuration, execution points will use a partitionable slot
- Linux cgroups enforce maximum memory utilization by default
- Can now define DAGMan save points to be able to rerun DAGs from there
- Much better control over environment variables when using DAGMan
- Administrators can enable and disable job submission for a specific user
- Can set a minimum number of CPUs allocated to a user
- condor_status -gpus shows nodes with GPUs and the GPU properties
- condor_status -compact shows a row for each slot type
- Container images may now be transferred via a file transfer plugin
- Support for Enterprise Linux 9, Amazon Linux 2023, and Debian 12
- Can write job information in AP history file for every execution attempt
- Can run defrag daemons with different policies on distinct sets of nodes
- Add condor_test_token tool to generate a short lived SciToken for testing
- The job’s executable is no longer renamed to ‘condor_exec.exe’

Listing created by Repoview-0.6.6-4.el7