Survey of Text Mining II (2nd Ed., 2008)
Clustering, Classification, and Retrieval

Coordinators: Berry Michael W., Castellanos Malu

Language: English

Approximative price 52.74 €

In Print (Delivery period: 15 days).

Add to cartAdd to cart
Survey of Text Mining II
Publication date:
240 p. · 15.5x23.5 cm · Paperback

Approximative price 52.74 €

Subject to availability at the publisher.

Add to cartAdd to cart
Survey of text mining II: Clustering, classification & retrieval (2nd Ed.)
Publication date:
240 p. · 15.5x23.5 cm · Hardback

This Second Edition brings readers thoroughly up to date with the emerging field of text mining, the application of techniques of machine learning in conjunction with natural language processing, information extraction, and algebraic/mathematical approaches to computational information retrieval. The book explores a broad range of issues, ranging from the development of new learning approaches to the parallelization of existing algorithms. Authors highlight open research questions in document categorization, clustering, and trend detection. In addition, the book describes new application problems in areas such as email surveillance and anomaly detection.

Clustering.- Cluster-Preserving Dimension Reduction Methods for Document Classification.- Automatic Discovery of SimilarWords.- Principal Direction Divisive Partitioning with Kernels and k-Means Steering.- Hybrid Clustering with Divergences.- Text Clustering with Local Semantic Kernels.- Document Retrieval and Representation.- Vector Space Models for Search and Cluster Mining.- Applications of Semidefinite Programming in XML Document Classification.- Email Surveillance and Filtering.- Discussion Tracking in Enron Email Using PARAFAC.- Spam Filtering Based on Latent Semantic Indexing.- Anomaly Detection.- A Probabilistic Model for Fast and Confident Categorization of Textual Documents.- Anomaly Detection Using Nonnegative Matrix Factorization.- Document Representation and Quality of Text: An Analysis.

Overview of current methods and software for text mining

Experts from academia and industry share their experiences in solving large-scale retrieval and classification problems

Highlights open research questions in document categorization and clustering, and trend detection

Describes new application problems in areas such as email surveillance and anomaly detection

Includes supplementary material: sn.pub/extras