I suggest that you instead start by optimizing the cell - I think a lot of the distortion you see is because the atoms try to adjust to a "wrong" lattice constant. Additionally, the distortion is not as big as it seems. VNL shows bonds when the distance between atoms are smaller than some element-dependent number, and by increasing the so-called "fuzz factor", that number can be changed. If you increase the fuzz factor in both cases, they look more alike in the Viewer.
Also, I got an initial force of 1.5 eV/Å, which I would definitely not consider huge. Considering that the original structure was not optimized with your chosen DFT method, it is perfectly reasonable to see initial forces of that size, or even larger.