Author Topic: An error occured:when run a scipt in parallel  (Read 3776 times)

0 Members and 1 Guest are viewing this topic.

Offline zhangxuebiao

  • Regular QuantumATK user
  • **
  • Posts: 6
  • Reputation: 0
    • View Profile
An error occured:when run a scipt in parallel
« on: June 10, 2010, 14:07 »
/home/atk/atk-2008.10.0/bin/atk: line 3: 30180 Killed                  LD_LIBRARY_PATH=$EXEC_DIR/../lib $EXEC_DIR/atk_exec $*
rank 0 in job 8  node3_53803   caused collective abort of all ranks


How to solve this?Can anyone help me?

Offline jdgayles16

  • QuantumATK Guru
  • ****
  • Posts: 108
  • Reputation: 0
    • View Profile
Re: An error occured:when run a scipt in parallel
« Reply #1 on: June 11, 2010, 09:54 »
I had a similar problem, it may be memory, Im just guessing though.

Offline Anders Blom

  • QuantumATK Staff
  • Supreme QuantumATK Wizard
  • *****
  • Posts: 5576
  • Country: dk
  • Reputation: 96
    • View Profile
    • QuantumATK at Synopsys
Re: An error occured:when run a scipt in parallel
« Reply #2 on: June 11, 2010, 10:21 »
That's the most likely reason. The error is actually thrown by MPICH2, not ATK, and means that one of the nodes stopped working, basically. Look in the log files (the .o or .e files if you run via qsub) for additional error messages that actually tell you why the node shut down ATK.