DWDM \ Text Mining

Web mining / Web content mining
It is used to extract web data(organized and unstructured data) for the purposes to discover patterns from the World Wide Web.

Text Mining

Info Retrieval view (Web content Mining) DB view (Web content Mining) Web structure mining Web usage mining
Data View 1. Unstructured
2. Structured
1. Semi-structured
2. Web site as DB
Link structure Interactivity
Web contents text, images, audio and video files text, images, audio and video files Document level or hyperlink level. 1. Web Server logs.
2. Application server logs
3. Application level logs.
Mining Techniques Classification, Clustering Classification, Clustering Classification, Association Clustering, Association
Representation 1. Bag of words, n-gram terms
2. phrases, concepts / ontology
3. Relational
1. Edge labeled graph.
2. Relational.
Graph 1. Relational table.
2. Graph




Home    Back