Research Collection School Of Computing and Information Systems

DeepStyle: User style embedding for authorship attribution of short texts

Zhiqiang HU, University of Electronic Science and Technology of China
Roy Ka-Wei LEE, University of Saskatchewan
Lei WANG, Singapore Management UniversityFollow
Ee-Peng LIM, Singapore Management UniversityFollow

Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

9-2020

Abstract

Authorship attribution (AA), which is the task of finding the owner of a given text, is an important and widely studied research topic with many applications. Recent works have shown that deep learning methods could achieve significant accuracy improvement for the AA task. Nevertheless, most of these proposed methods represent user posts using a single type of features (e.g., word bi-grams) and adopt a text classification approach to address the task. Furthermore, these methods offer very limited explainability of the AA results. In this paper, we address these limitations by proposing DeepStyle, a novel embedding-based framework that learns the representations of users’ salient writing styles. We conduct extensive experiments on two real-world datasets from Twitter and Weibo. Our experiment results show that DeepStyle outperforms the state-of-the-art baselines on the AA task.

Keywords

Authorship attribution, Style embedding, Triplet loss

Discipline

Databases and Information Systems

Research Areas

Data Science and Engineering

Publication

Web and Big Data: 4th International Joint Conference, APWeb-WAIM 2020, Tianjin, China, September 18-20: Proceedings

Volume

12318

First Page

221

Last Page

229

ISBN

9783030602895

Identifier

10.1007/978-3-030-60290-1_17

Publisher

Springer

City or Country

Cham

Embargo Period

7-4-2021

Citation

HU, Zhiqiang; LEE, Roy Ka-Wei; WANG, Lei; and LIM, Ee-Peng. DeepStyle: User style embedding for authorship attribution of short texts. (2020). Web and Big Data: 4th International Joint Conference, APWeb-WAIM 2020, Tianjin, China, September 18-20: Proceedings. 12318, 221-229.
Available at: https://ink.library.smu.edu.sg/sis_research/6018

Copyright Owner and License

Authors

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

505687_1_En_17_MOESM1_ESM.pdf (273 kB)
Supplementary Figure

Additional URL

https://doi.org/10.1007/978-3-030-60290-1_17

Download

Download Data/Files

Find it in your library

Included in

Databases and Information Systems Commons

COinS

Research Collection School Of Computing and Information Systems

DeepStyle: User style embedding for authorship attribution of short texts

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

First Page

Last Page

ISBN

Identifier

Publisher

City or Country

Embargo Period

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

DeepStyle: User style embedding for authorship attribution of short texts

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

First Page

Last Page

ISBN

Identifier

Publisher

City or Country

Embargo Period

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links