Property Testing of Data Dimensionality

Publication Type

Conference Proceeding Article

Publication Date

1-2003

Abstract

Data dimensionality is a crucial issue in a variety of settings, where it is desirable to determine whether a data set given in a high-dimensional space adheres to a low-dimensional structure. We study this problem in the framework of property testing: Given a query access to a data set S, we wish to determine whether S is low-dimensional, or whether it should be modified significantly in order to have the property. Allowing a constant probability of error, we aim at algorithms whose complexity does not depend on the size of S.We present algorithms for testing the low-dimensionality of a set of vectors and for testing whether a matrix is of low rank. We then address low-dimensionality in metric spaces. For vectors in the metric space l1, we show that low-dimensionality is not testable. For l2, we show that a data set can be tested for having a low-dimensional structure, but that the property of approximately having such a structure is not testable.

Discipline

Numerical Analysis and Scientific Computing | Software Engineering

Research Areas

Software Systems

Publication

SODA '03: Proceedings of the 14th Annual ACM-SIAM Symposium on Discrete Algorithms

First Page

18

Last Page

27

ISBN

9780898715385

Publisher

ACM

City or Country

New York

Additional URL

http://dl.acm.org/citation.cfm?id=644112

Share

COinS