Learning with Global Cost in Stochastic Environments

   Abstract