Gradient-based One-Side Sampling

The standard gbdt is reliable but it is not fast enough on large datasets. Hence, goss suggests a sampling method based on the gradient to avoid searching for the whole search space. We know that for each data instance when the gradient is small that means no worries data is well-trained and when the gradient is large that should be retrained again. So we have two sides here, data instances with large and small gradients. Thus, goss keeps all data with a large gradient and does a random sampling (that’s why it is called One-Side Sampling) on data with a small gradient. This makes the search space smaller and goss can converge faster. Finally, for gaining more insight about goss, you can check this blog post.

B) Pros and Cons

B.1) Pros

converge faster

B.2) Cons

overfitting when dataset is small

Zzong's Notes

탐색기

Gradient-based One-Side Sampling

Gradient-based One-Side Sampling

B) Pros and Cons

B.1) Pros

B.2) Cons

D) References

목차

탐색기

Gradient-based One-Side Sampling

Gradient-based One-Side Sampling

B) Pros and Cons

B.1) Pros

B.2) Cons

C) Related

D) References

함께 보면 좋은 글

목차