Classification and pattern discovery of mood in weblogs
MetadataShow full item record
Automatic data-driven analysis of mood from text is anemerging problem with many potential applications. Unlike generic text categorization, mood classification based on textual features is complicated by various factors, including its context- and user-sensitive nature. We present a comprehensive study of different feature selection schemes in machine learning for the problem of mood classification in weblogs. Notably, we introduce the novel use of a feature set based on the affective norms for English words (ANEW) lexicon studied in psychology. This feature set has the advantage of being computationally efficient while maintaining accuracy comparable to other state-of-the-art feature sets experimented with. In addition, we present results of data-driven clustering on a dataset of over 17 million blog posts with mood groundtruth. Our analysis reveals an interesting, and readily interpreted, structure to the linguistic expression of emotion, one that comprises valuable empirical evidence in support of existing psychological models of emotion, and in particular the dipoles pleasure-displeasure and activation-deactivation.
Showing items related by title, author, creator and subject.
Nguyen, Thin K. (2012)Social media allows people to participate, express opinions, mediate their own content and interact with other users. As such, sentiment information has become an integral part of social media. This thesis presents a ...
Nguyen, Thin; Phung, Dinh; Adams, Brett; Venkatesh, Svetha (2011)Estimation of a person's influence and personality traits from social media data has many applications. We use social linkage criteria, such as number of followers and friends, as proxies to form corpora, from popular ...
Relationship building in Vietnamese English written business communication: A systemic functional analysis,Nguyen, Bich; Oliver, Rhonda (2015)English has a long history in Vietnam and in the last two decades, particularly for business communication, it has developed with an unprecedented speed. Despite this ascendancy, there is an absence of research regarding ...