QuantumATK Forum

QuantumATK => General Questions and Answers => Topic started by: serhan on April 23, 2009, 04:59

Title: "Killed signal 9" error
Post by: serhan on April 23, 2009, 04:59
Hi all,

I'm trying to simulate a (6,6) perfect nanotube. My py file is attached. I have 4 reps in electrodes and 10 reps in central region. But, ATK (or MPI) gives the error:

Code
# ----------------------------------------------------------------
# Electrodes Calculation
# ----------------------------------------------------------------
# sc  0 : Fermi Energy =    0.00000 Ry
rank 0 in job 2  node01_40687   caused collective abort of all ranks
  exit status of rank 0: killed by signal 9

Is there  a way to overcome this problem?

Cheers,
Serhan
Title: Re: "Killed signal 9" error
Post by: Anders Blom on April 23, 2009, 09:11
Strictly speaking this is an MPI error, not an ATK error. It might indicate a network problem, which may be spurious or not. So, try to resubmit the calculation and see if it fails in the same point again.
Title: Re: "Killed signal 9" error
Post by: serhan on April 23, 2009, 20:52
I asked this to MPI mail list and the answer is:

Code
Try the latest version of MPICH2, 1.0.8. It is hard to say what the problem might be. Could even be a bug in the application.

This may help others that gets the error.

Cheers,
Serhan
Title: Re: "Killed signal 9" error
Post by: Anders Blom on April 23, 2009, 21:05
Upgrading to MPICH2 1.0.8 is probably always a good idea, and I hope it solves your problem. Otherwise, these MPI errors are a bit mysterious, and sometimes not reproducible. And when they are, they are not easy to solve...
Title: Re: "Killed signal 9" error
Post by: serhan on April 23, 2009, 21:16
Thanks.