A probabilistic framework that generates samples with probability proportional to a reward function, useful for optimization tasks like molecule discovery.