CAP6640 Natural Language Processing
Vector Semantics and NLP Applications:
Search Engines, Business Intelligence, and Cybersecurity
By: Dr. Wingyan Chung
Objectives
Upon successfully completing this module, students should be able to:
- Describe vector models and their applications in natural language processing (NLP).
- Explain concepts in term weighting, PPMI, inverted index, and measures of similarity in vector models.
- Apply deep learning to word and context prediction using skip-gram and CBOW models.
- Evaluate quality of prediction outcomes on word and context.
Readings
- News: "Obama wants you to join CyberCorps Reserve"
- News: White House wants to revamp cybersecurity
- Memex (Domain-Specific Search), DARPA I2O
- Text: "Vector Semantics," in Jurafsky & Martin "Speech and Language Processing (3rd ed. draft)"
- Slides on Vector Semantics
- "Deep or Shallow, NLP is Breaking Out," Communications of the ACM, March 2016
NLP Research and Applications
- Word2Vec: Deep Learning of Text Corpus
- IPM Special Issue: Emotion and Sentiment in Social and Expressive Media
- Contextual semantics for sentiment analysis of Twitter
- Sentiment and network analyses of U.S. Immigration and border security
- BizPro: Extracting and categorizing business intelligence factors from textual news articles
- Text Visualization for Authorship Analysis
- Web Searching in a Multilingual World, CACM
Copyright © 2016 Wingyan Chung. All Rights Reserved.