Abstract

SLIM 이 높은 ranking 정확도를 여러 논문 실험에서 보여줬지만, 데이터를 통한 파라매터 학습 비용이 너무 높았다. 그래서 이 논문에서는 이를 고차원 회귀 문제의 variants 로 생각하고 closed-form solution 을 제안한다.

추가적으로, 데이터의 item-popularity 에 대해 대응하기 위해 re-weighting 보다는 re-scaling 에서 영감을 받았다.

…

Introduction

SLIM 학습 비용이 높은 것을 줄이기 위해 다음과 같은 시도를 진행함

L1-norm regularization term 과 학습 가중치에 대한 non-negativity constraints 를 지웠더니 ranking 정확도의 상승이 있었다.

Approach: Preliminaries

…

Biased Training-data

실제 추천 시스템에서는 데이터에 여러 종류의 bias 가 들어가있다. 이 bias 의 원인은 데이터가 MNAR 이기 때문이다.

debiasing 을 위한 간단하고 효과적인 방법중 하나는 positive user-item interactions 으로만 구성된 학습 데이터를 이용할 때, negative user-item interactions 을 샘플링 하는 것이다.

그리고 또 다른 debiasing 방법 중 하나가 데이터에서 popularity bias 를 지우는 것이다. Natural Language Process 도메인에서는 이를 word2vec 으로 해결하고 있다: ^62a6cb

Weighted Errors

\hat{B}^{(weighted)} = ar g B min ∥ W ⊙ (Y - X \cdot B) ∥_{F}^{2} + λ \cdot ∥ B ∥_{F}^{2} s.t. diag (B) = 0

Re-scaled Target-Values

target values $Y \in R^{∣ U ∣ \times ∣ I ∣}$ 를 아이템 가중치 $w^{(I)} \in R^{∣ I ∣}$ 를 이용하여 rescaling 하는 방법을 제시한다. $Y$ 는 일반적으로 binary matrix 로 생각할 수 있다 (1 은 클릭, 0 은 missing data).

\hat{B}^{(scaled)} = ar g B min Y \cdot diagMat (w^{(I)}) - X \cdot B_{F}^{2} + λ \cdot ∥ B ∥_{F}^{2} s.t. diag (B) = 0

Example: Popularity Adjustments

Removal of Popularity-Bias

학습 데이터에서 popularity bias 를 지우는 목적은 모델이 아이템의 similarities 에 집중했으면 하는 바람에 있다.

Scaling Method

popularity bias 를 완화하기 위해 후처리 방법으로 추천 결과의 score 를 rescaling 하는 방법을 제시

r_{u, i}^{(scaled)} = r_{u, i}^{(model)} / (C_{i})^{α}

$r_{u, i}^{(model)}$ 은 predicted score
$C_{i}$ 는 학습 데이터에 포함된 아이템 $i$ 에 대한 총 클릭 횟수
$α$ 는 debiasing strength 를 조절하는 hyper-parameter
- 실험 상으로는 0.5 정도에서 가장 성능이 좋았다고 한다.

Application Ideas

Contextual Bandit 에서 score 생산 시 적용해보면 어떨까?
아니면 item pool 을 구성할 때 score 로 고려해도 괜찮을 것 같다.

References

Distributed representations of words and phrases and their compositionality

Zzong's Notes

탐색기

Collaborative filtering via high-dimensional regression

Abstract

Introduction

Approach: Preliminaries

Biased Training-data

Weighted Errors

Re-scaled Target-Values

Example: Popularity Adjustments

Removal of Popularity-Bias

Scaling Method

Application Ideas

References

링크된 언급

목차

탐색기

Collaborative filtering via high-dimensional regression

Abstract

Introduction

Approach: Preliminaries

Biased Training-data

Weighted Errors

Re-scaled Target-Values

Example: Popularity Adjustments

Removal of Popularity-Bias

Scaling Method

Application Ideas

References

링크된 언급

함께 보면 좋은 글

목차