A MENACE player backed by a shared Matchboxes registry. Records the sequence of (position, move) pairs during a game so it can back-propagate the result afterwards.
All cell indices in history are in original board orientation, matching what selectMove returns and what update expects.
Pl = Boolean: true = X (first player), false = O (second player).
Called at the end of a game with the full result and this player's identity. Default implementation is a no-op. Override to implement learning or logging.
Called at the end of a game with the full result and this player's identity. Default implementation is a no-op. Override to implement learning or logging.
Value parameters
me
this player's identity, used to extract the relevant score.