Data Clustering

Publisher:
IGI Global
Publication Type:
Chapter
Citation:
Handbook of Research on Innovations in Database Technologies and Applications: Current and Future Tr, 2009, 1, pp. 562 - 572
Issue Date:
2009-01
Full metadata record
Files in This Item:
Filename Description Size
Thumbnail2009005758OK.pdf2.44 MB
Adobe PDF
Clustering is one of the most important techniques in data mining. This chapter presents a survey of popular approaches for data clustering, including well-known clustering techniques, such as partitioning clustering, hierarchical clustering, density-based clustering and grid-based clustering, and recent advances in clustering, such as subspace clustering, text clustering and data stream clustering. The major challenges and future trends of data clustering will also be introduced in this chapter. The remainder of this chapter is organized as follows. The background of data clustering will be introduced in Section 2, including the definition of clustering, categories of clustering techniques, features of good clustering algorithms, and the validation of clustering. Section 3 will present main approaches for clustering, which range from the classic partitioning and hierarchical clustering to recent approaches of bi-clustering and semisupervised clustering. Challenges and future trends will be discussed in Section 4, followed by the conclusions in the last section.
Please use this identifier to cite or link to this item: