ctr的负样本

2021-08-02

字数：140字 | 预计阅读时长：1分钟

样本

RLNF: Reinforcement Learning based Noise Filtering for Click-Through Rate Prediction
1. 几种提升ctr模型的思路
  1. 新的模型结构
  2. incorporating feature interactions
  3. 可解释性
  4. 数据和行为的稀疏性问题
  5. 用户兴趣爱好随时间变迁(时间维度)
  6. 样本不平衡性
2. RLNF的步骤
  1. s1根据特征向量生成action a(是否选择)；s2根据很多的a生成reward；s3根据reward生成；s4所有负样本中选择部分a为选择的N，用N和正样本来训练ctr模型==>这样交替地进行noise filter和ctr model的训练

本文作者： yuqing wang
本文链接： https://satyrswang.github.io/2021/08/02/ctr的负样本/
版权声明： 本作品采用知识共享署名-非商业性使用-相同方式共享 4.0 国际许可协议进行许可。转载请注明出处！

jsonContent: meta: false pages: false posts: title: true date: true path: true text: false raw: false content: false slug: false updated: false comments: false link: false permalink: false excerpt: false categories: false tags: true