Show simple item record

dc.contributor.authorBudhaditya, S.
dc.contributor.authorPhung, D.
dc.contributor.authorPham, DucSon
dc.contributor.authorVenkatesh, S.
dc.contributor.editorM. Jaki
dc.contributor.editorA. Siebes
dc.contributor.editorJ. Yu
dc.contributor.editorB. Goethals
dc.contributor.editorX. Wu
dc.date.accessioned2017-01-30T10:37:58Z
dc.date.available2017-01-30T10:37:58Z
dc.date.created2013-02-19T20:00:31Z
dc.date.issued2012
dc.identifier.citationBudhaditya, Saha and Phung, Dinh and Pham, Duc Son and Venkatesh, Svetha. 2012. Sparse subspace representation for spectral document clustering, in Zaki, M.J. and Siebes, A. and Yu, J.X. and Goethals, B. and Wu, X. (ed), 2012 IEEE 12th International Conference on Data Mining, Dec 10-13 2012, pp. 1902-1907. Brussels, Belgium: IEEE.
dc.identifier.urihttp://hdl.handle.net/20.500.11937/4290
dc.identifier.doi10.1109/ICDM.2012.46
dc.description.abstract

We present a novel method for document clustering using sparse representation of documents in conjunction with spectral clustering. An ℓ1-norm optimization formulation is posed to learn the sparse representation of each document, allowing us to characterize the affinity between documents by considering the overall information instead of traditional pair wise similarities. This document affinity is encoded through a graph on which spectral clustering is performed. The decomposition into multiple subspaces allows documents to be part of a sub-group that shares a smaller set of similar vocabulary, thus allowing for cleaner clusters. Extensive experimental evaluations on two real-world datasets from Reuters-21578 and 20Newsgroup corpora show that our proposed method consistently outperforms state-of-the-art algorithms. Significantly, the performance improvement over other methods is prominent for this datasets.

dc.publisherIEEE
dc.subjectdocument clustering
dc.subjectsparse representation
dc.titleSparse subspace representation for spectral document clustering
dc.typeConference Paper
dcterms.source.startPage1902
dcterms.source.endPage1907
dcterms.source.issn1550-4786
dcterms.source.titleProceedings of the IEEE International Conference on Data Mining (ICDM)
dcterms.source.seriesProceedings of the IEEE International Conference on Data Mining (ICDM)
dcterms.source.conferenceThe IEEE International Conference on Data Mining
dcterms.source.conference-start-dateDec 10 2012
dcterms.source.conferencelocationBrussels, Belgium
dcterms.source.placeUSA
curtin.note

Copyright © 2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

curtin.department
curtin.accessStatusOpen access


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record