For a Windows machine with 4 cores, I would not recommend running MPI at all, you will be way too limited in memory. ATK will take advantage of the 4 cores automatically, in OpenMP. If you want better performance, and run properly in parallel, you should consider running on a Linux cluster, or at least a couple of Windows machines connected in a cluster (although, as mentioned, support for this is limited at this point).