Multi-agent Dynamics in Multi-armed Bandit Problem with Heterogeneous Stochastic Interactions