The first thing I would do is to significantly decrease the max_forces parameter, as the van der Waals forces are very small. Also, you have done everything with default ATK computational settings, but defaults need to be checked to ensure convergence of physical quantities of your interest with respect to these settings, e.g., density mesh cut-off and so on.
Another suggestion would be to do a "manual" scan of the total energy vs. separation distance (e.g., from 4.5 to 7.5 Angstrom) between the monolayers to roughly locate the energy minimum, and then do optimization at that approximately-calculated equilibrium separation distance, using strict convergence criteria.