Publication Type

Patent

Version

submittedVersion

Publication Date

5-2012

Abstract

A method and apparatus for rapid identification of column heterogeneity in databases are disclosed. For example, the method receives data associated with a column in a database. The method computes a cluster entropy for the data as a measure of data heterogeneity and then determines whether said data is heterogeneous in accordance with the cluster entropy.

Discipline

Databases and Information Systems

First Page

1

Last Page

13

Publisher

USPTO

Additional URL

http://www.google.com/patents/US8176016

Share

COinS