Author Topic: Job termination problem during HSE calculation  (Read 38 times)

0 Members and 1 Guest are viewing this topic.

Offline dua.harkishan@yahoo.com

  • New ATK user
  • *
  • Posts: 1
  • Country: in
  • Reputation: 0
    • View Profile
Job termination problem during HSE calculation
« on: November 22, 2021, 12:52 »
I am trying to optimize one nanotube having 182 atoms using HSE functional with the help of LCAO calculator. Initially, I have submitted the job using m_file with two nodes (8 proc each. One node having 128 GB RAM and other node having 96 GB RAM). However, the job terminated with the following message "=   BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES=   PID 244987 RUNNING AT compute-0-1 =   EXIT CODE: 9 =   CLEANING UP REMAINING PROCESSES=   YOU CAN IGNORE THE BELOW CLEANUP MESSAGES". After that we have submitted the Job using four nodes with four processor each and RAMs are 128 GB, 96 GB, 64 GB and 64 GB and this time also the job terminated with the same error. Next, I have submitted the using three nodes except compute-0-1 and this time the job stopped without showing any error. I am attaching all the log files for your reference. Kindly help me in this regard.

Offline filipr

  • Regular ATK user
  • **
  • Posts: 15
  • Country: dk
  • Reputation: 3
  • QuantumATK developer
    • View Profile
Re: Job termination problem during HSE calculation
« Reply #1 on: November 23, 2021, 08:18 »
Exit Code 9 from an MPI program means that the program was killed by the host system.

It could be that the job scheduling system on your cluster killed the job because it used too much memory or was taking longer than the requested wall time allocation. See if you got a mail from the queuing system or ask you system administrator.