Dear Experts,
Recently we encounter a problem when performing parallel ATK calculations. When the job does not finish successfully (be killed or exit due to time limit), it may leave all processes still running, and cause CPU overloading for the next job, no matter if it is an ATK job.
Could you please give us some advice to fix the problem? Thank you very much!
(Please find a job info in the attachment. A ATK job was killed due to convergence issue and then a VASP job started)
Best regards,
Qiang Fu