Simple Web Mining Diagram

I don’t know much about Python or data mining, but this diagram gave me a good idea of how this module worked. I give these guys two thumbs up for visual communication.


Pattern is a web mining module for the Python programming language.
It bundles tools for data retrieval (Google + Twitter + Wikipedia API, web spider, HTML DOM parser), text analysis (rule-based shallow parser, WordNet interface, syntactical + semantical n-gram search algorithm, tf-idf + cosine similarity + LSA metrics) and data visualization (graph networks).

Leave a Reply

Your email address will not be published. Required fields are marked *