Research Collection School Of Computing and Information Systems

Multiview semi-supervised learning with consensus

Guangxia LI, Nanyang Technological University
Kuiyu CHANG, Nanyang Technological University
Steven C. H. HOI, Singapore Management UniversityFollow

Publication Type

Journal Article

Version

publishedVersion

Publication Date

11-2012

Abstract

Obtaining high-quality and up-to-date labeled data can be difficult in many real-world machine learning applications. Semi-supervised learning aims to improve the performance of a classifier trained with limited number of labeled data by utilizing the unlabeled ones. This paper demonstrates a way to improve the transductive SVM, which is an existing semi-supervised learning algorithm, by employing a multiview learning paradigm. Multiview learning is based on the fact that for some problems, there may exist multiple perspectives, so called views, of each data sample. For example, in text classification, the typical view contains a large number of raw content features such as term frequency, while a second view may contain a small but highly informative number of domain specific features. We propose a novel two-view transductive SVM that takes advantage of both the abundant amount of unlabeled data and their multiple representations to improve classification result. The idea is straightforward: train a classifier on each of the two views of both labeled and unlabeled data, and impose a global constraint requiring each classifier to assign the same class label to each labeled and unlabeled sample. We also incorporate manifold regularization, a kind of graph-based semi-supervised learning method into our framework. The proposed two-view transductive SVM was evaluated on both synthetic and real-life data sets. Experimental results show that our algorithm performs up to 10 percent better than a single-view learning approach, especially when the amount of labeled data is small. The other advantage of our two-view semi-supervised learning approach is its significantly improved stability, which is especially useful when dealing with noisy data in real-world applications.

Keywords

Artificial intelligence, learning systems, multiview learning, semi-supervised learning, support vector machines

Discipline

Computer Sciences | Databases and Information Systems

Research Areas

Data Science and Engineering

Publication

IEEE Transactions on Knowledge and Data Engineering (TKDE)

Volume

Issue

First Page

2040

Last Page

2051

ISSN

1041-4347

Identifier

10.1109/TKDE.2011.160

Publisher

IEEE

Citation

LI, Guangxia; CHANG, Kuiyu; and HOI, Steven C. H.. Multiview semi-supervised learning with consensus. (2012). IEEE Transactions on Knowledge and Data Engineering (TKDE). 24, (11), 2040-2051.
Available at: https://ink.library.smu.edu.sg/sis_research/2283

Copyright Owner and License

Authors

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.1109/TKDE.2011.160

Download

Find it in your library

Included in

Databases and Information Systems Commons

COinS

Research Collection School Of Computing and Information Systems

Multiview semi-supervised learning with consensus

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Multiview semi-supervised learning with consensus

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links