To ensure full reproducibility from run to run you need to set seeds for pseudo-random generators, and set deterministic flag in Trainer ... This accounts for gradient accumulation and the current ...
Some results have been hidden because they may be inaccessible to you