[Hpc-notice] Slurm scheduler issues
Casey Mc Laughlin
cmclaughlin at fsu.edu
Fri Sep 25 11:57:23 EDT 2020
Hi HPC Users,
This morning, our Systems Team made an update to the job scheduler (Slurm) in order to fix an ongoing issue we've been having with authentication.
This change affected job submissions and most jobs that were already running. If you had any jobs that were pending or running as of this morning, we advise you to login, check on them, and re-submit them if necessary.
Additionally, you may see an error message similar to the one below when you attempt to submit your job to the scheduler. In this case, you should wait a few moments and try to resubmit:
srun: job 4603 queued and waiting for resources
srun: error: Security violation, slurm message from uid 309
srun: error: Security violation, slurm message from uid 309
srun: error: Job allocation 4603 has been revoked
We apologize for any disruptions this may have caused to you and your research. Let us know if you need any assistance by submitting a support ticket: support at rcc.fsu.edu<mailto:support at rcc.fsu.edu>.
Best regards,
The RCC Team
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.fsu.edu/pipermail/hpc-notice/attachments/20200925/67239316/attachment.html>
More information about the Hpc-notice
mailing list