Conference Proceeding Article
User attribute extraction on social media has gain considerable attention, while existing methods are mostly supervised which suffer great diffi- culty in insufficient gold standard data. In this paper, we validate a strong hypothesis based on homophily and adapt it to ensure the certainty of user attribute we extracted via weakly supervised propagation. Homophily, the theory which states that people who are similar tend to become friends, has been well studied in the setting of online social networks. When we focus on age attribute, based on this theory, online friends tend to have similar age. In this work, we take a step further and study the hypothesis that the age gap between online friends become even smaller in a larger friendship clique. We empirically validate our hypothesis using two real social network data sets. We further design a propagation-based algorithm to predict online users’ age, leveraging the clique-based hypothesis. We find that our algorithm can outperform several baselines. We believe that this method could work as a way to enrich sparse data and the hypothesis we validated would shed light on exploring the proximity of other user attributes such as education as well.
Social Network Analysis, Age Prediction, Homophily
Databases and Information Systems | Social Media
Data Management and Analytics
HT'14: Proceedings of the 25th ACM Conference on Hypertext and Social Media: September 1-4, 2014, Santiago, Chile
City or Country
LIAO, Lizi; JIANG, Jing; LIM, Ee Peng; and Huang, Heyan.
A Study of Age Gaps between Online Friends. (2014). HT'14: Proceedings of the 25th ACM Conference on Hypertext and Social Media: September 1-4, 2014, Santiago, Chile. 98-106. Research Collection School Of Information Systems.
Available at: http://ink.library.smu.edu.sg/sis_research/2416
Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License.