Publication Type

Patent

Publication Date

5-2012

Abstract

A method and apparatus for rapid identification of column heterogeneity in databases are disclosed. For example, the method receives data associated with a column in a database. The method computes a cluster entropy for the data as a measure of data heterogeneity and then determines whether said data is heterogeneous in accordance with the cluster entropy.

Discipline

Databases and Information Systems

Research Areas

Data Management and Analytics

First Page

1

Last Page

13

Publisher

USPTO

Creative Commons License

Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License.

Additional URL

http://www.google.com/patents/US8176016

Share

COinS