I actually don't remember when we introduced parallelization over displacements, but at least that's the most obvious way to get good performance for these calculations, and it works very nicely in ATK 2014 and 2015. If you have a 2-atom cell, you can calculate the phonon spectrum in the same time as a single calculation for that system, if you use 6 MPI processes, which is 6 times faster than if you just run it in serial (at least if we don't count the initial calculation for the reference system). Now, if you have 20 atoms, you can parallelize up to 60 MPIs for a speedup of 60x (that is, from 2 months to 1 day).