What are the benefits of A/B testing in model evaluation?