Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
11-2018
Abstract
User identity linkage (UIL), the problem of matching user account across multiple online social networks (OSNs), is widely studied and important to many real-world applications. Most existing UIL solutions adopt a supervised or semisupervised approach which generally suffer from scarcity of labeled data. In this paper, we propose Factoid Embedding, a novel framework that adopts an unsupervised approach. It is designed to cope with different profile attributes, content types and network links of different OSNs. The key idea is that each piece of information about a user identity describes the real identity owner, and thus distinguishes the owner from other users. We represent such a piece of information by a factoid and model it as a triplet consisting of user identity, predicate, and an object or another user identity. By embedding these factoids, we learn the user identity latent representations and link two user identities from different OSNs if they are close to each other in the user embedding space. Our Factoid Embedding algorithm is designed such that as we learn the embedding space, each embedded factoid is “translated” into a motion in the user embedding space to bring similar user identities closer, and different user identities further apart. Extensive experiments are conducted to evaluate Factoid Embedding on two real-world OSNs data sets. The experiment results show that Factoid Embedding outperforms the state-of-the-art methods even without training data.
Keywords
user identity linkage, factoid embedding, network embedding
Discipline
Databases and Information Systems | Theory and Algorithms
Research Areas
Data Science and Engineering
Publication
2018 IEEE International Conference on Data Mining ICDM 2018: Singapore, November 17-20: Proceedings
First Page
1338
Last Page
1343
ISBN
9781538691588
Identifier
10.1109/ICDM.2018.00182
Publisher
IEEE Computer Society
City or Country
Los Alamos, CA
Citation
XIE, Wei; MU, Xin; LEE, Roy Ka Wei; ZHU, Feida; and LIM, Ee-peng.
Unsupervised user identity linkage via factoid embedding. (2018). 2018 IEEE International Conference on Data Mining ICDM 2018: Singapore, November 17-20: Proceedings. 1338-1343.
Available at: https://ink.library.smu.edu.sg/sis_research/4258
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1109/ICDM.2018.00182