Public Sphere 2.0

Targeted Commenting in Online News Media

DSLAB PR @ 2019.07.11

이준범 ( jun@beomi.net)

Abstract

News consumption => Max(ad revenue)
Reader Engagement 👍 => *Comments*
Traditional view: Comments from the full article
Present Landscape:
Comments from only particular sections of the article
Build Neural Net to find 'comment ~ article section'

1. Introduction

Background

Paradigm shift consuming news 📰
Online media > Offline media
Comment & Share ideas on articles
Comments = Most effective tool for user engagement
Online news acts as the facilitator of public debates

User patterns

F-shaped pattern
User's attention is focused on initial texts
News website = New public sphere

2. Dataset & Motiv

Dataset

1352 Guardian
1020 NYTimes
with Comments

Dataset

60%+ comments in >20 paragraphs length
More Longer Article = More comments

Dataset

Label data with relevance score 1(irrelevant)~5(relevant)
Judged by presence and absence of
- common words
- common thoughts
2 Annotators => Cohen Kappa 0.71

Overview

42.7% = Relevent to the *whole* article
48.9%/48.8% = Relevent to 2-3 paragraphs
More releveant comments
=> Beginning paragraph

3. Linking
Comments to Paragraphs

Approach

Baseline: Traditional ML
ex) NB,DT,RF,K-NN,R-SVM,AdaBoosts,LR
DeepLearning Method:
- LSTM
- GRU 👈 Best Score!

Bi-LSTM & GRU

Pretrained 300-dim Google News Vectors
Text(article/comment) 👉 Embedding Vector
(OOV 👉 zero vector)
Text Vector = avg(word vectors)
LSTM / GRU 👉 150 dim vector
Merge Article vec + Comment vec
👉 FC layer 👉 Softmax
👉 5 classes 🔥

Eval

10-fold cross-validation
(DL Model: val each epoch, total 5 epochs)

Related works

Comment Ranking
Comment Recommendation
Comment Analysis

Conclusion

Traditional comment UI needs revamp
Comments are more related to particular sections
DeepLearning, GRU performs better
(Could BERT/XLNet perform better?)