Introduction to Text Mining with Hands-on Workshop:

The Participants will get an overview understanding of the text mining concepts, business needs, brief theory, followed by practical, illustrative examples and work-through case studies. The workshop is recommended for every Data warehousing, Business Intelligence and Information life cycle professional. It is expected that the adoption of text mining for competitive advantage and business excellence will exponentially grow in the coming years.


Importance and Influence of Text Mining:


  • Introduction to Data Explosion
  • Growing scale and influence of Text and unstructured Data in Business Decision Making
  • Organizational Relevance across industries

Types of unstructured text data and business opportunities


  • Email
  • Documents
  • Facebook / Twitter
  • Web Content
  • Customer Service Logs

Art and Science of text parsing and processing


  • Grammer and Syntax
  • Symantic expression challenges
  • Statistical features and Machine Learning
  • Ontology and RDF

Text Mining: An illustrative example


Text Mining: Process and Techniques


  • Pre processing
  • Tokenization
  • Stemming
  • Dimensionality Reduction
  • NLP Techniques
  • Symantec Indexing
  • Classification and Clustering

Web Mining


  • Crawling the web
  • Extracting information from web sites
  • Transforming web sites to documents
  • Information extraction using regular expressions
  • Opinion mining and summarization

Text mining Cases:


  • Case scenarios based on the needs of the audience

Interactive Discussion:


  • What is the text mining need in your business context?
  • Practical applications: Translating text mining output to business actions

After this training-workshop, the participants will be able to do the following:

  • Understand the context, scope and significance of unstructured date
  • Understand the business need to mine unstructured text data
  • Understand the art and science of processing unstructured data
  • Know how to tokenize and transform text data into a structured format
  • Know most frequently used text processing algorithms
  • Understand the process of classification, clustering, sentiment analysis or opinion mining
  • Know how to do simple text mining activities with an open source tool like GATE /   RapidMiner