-Need to learn against adversary -Need to learn with games until the end (in order to give Victory/Defeat reward) -Benchmarking the augmentation of the patch_size for learning and choosing action