I think it can handle this relatively well, since the stoichiometry itself is not a design parameter for the MTP. What matters, generally speaking, is that the training data for the original MTP contains enough "environments" that would be seen in each stoichiometry, i.e. the specific bond lengths and atom-atom pairs that would appear in those structures. For this to be the case, I would recommend active learning on an amorphous structure of the material, ideally including a few simple stoichiometries (like 25%, 50%, 75%, as relevant, which can be realized in smaller supercells), in order to obtain an MTP that then can be applied to large-scale structure of varying percentages.