[Hpc-notice] Issue with Slurm submitting new jobs
Casey Mc Laughlin
cmclaughlin at fsu.edu
Mon Apr 12 11:48:14 EDT 2021
Hi HPC users,
The Slurm scheduler is once again accepting job submissions. Thanks for your patience.
Going forward, if you notice anything when running Slurm commands, please let us know (support at rcc.fsu.edu).
Best regards,
The RCC Team
________________________________
From: Casey Mc Laughlin
Sent: Monday, April 12, 2021 10:14 AM
To: JESfwd-hpc-notice <hpc-notice at lists.fsu.edu>
Cc: JESfwd-hpc-staff <hpc-staff at lists.fsu.edu>
Subject: Issue with Slurm submitting new jobs
Hi HPC Users,
We are currently experiencing an issue with our scheduler software, Slurm. The issue affects submitting jobs. When you run a Slurm job submission command (sbatch or srun), you'll see this error message:
$ sbatch slurm-submit.sh
sbatch: error: Batch job submission failed: Socket timed out on send/recv operation
The Systems Team is investigating right now, and we expect to have a solution shortly.
As far as we are aware, the issue does not affect currently running jobs.
Thanks for your patience while we sort this issue out.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.fsu.edu/pipermail/hpc-notice/attachments/20210412/40a2b1ce/attachment.html>
More information about the Hpc-notice
mailing list