学习重点

学习资源

斯坦福2012年版经典自然语言处理(NLP)课程 by Dan Jurafsky, Chris Manning
CS 124: From Languages to Information

学习计划

Date Video Lectures Reading 进度
2020.05.22 Basic Text processing J+M 3rd Chapter 2 “Regular Expressions, Text Normalization, Edit Distance”, pages 1-21
2020.05.23 Edit Distance J+M New Chapter 2 “Regular Expressions, Text Normalization, Edit Distance”, pages 22-26
2020.05.24~25 Language Modeling J+M (3ed) Chapter 3, “Language Modeling with N-grams” pages 1-16 (plus section 3.6, “The Web and Stupid Backoff”).
2020.05.26 Naive Bayes and Text Classification J+M (3ed) Chapter 4, “Naive Bayes and Sentiment Classification” pages 1-14, sections 4.1 through 4.8.
2020.05.27 Sentiment Analysis
- Bo Pang, Lillian Lee, and Shivakumar Vaithyanathan. 2002. Thumbs up? Sentiment Classification using Machine Learning Techniques. EMNLP 2002, pages 79—86
- J+M (3ed) Chapter 21, “Lexicons for Sentiment, Affect, and Connotation” pages 1-6 plus section 21.6 (page 15)
2020.05.28 Logistic Regression(no video) J+M (3ed) Chapter 5, “Logistic Regression” pages 1-6. Optional: pages 7-17
2020.05.29 Information Retrieval (I)
- MR+S Chapter 1: Boolean Retrieval (pages 1-17)
- MR+S Chapter 2: Term vocabulary and postings lists (only pages 33-42)
2020.05.30 Information Retrieval (II)
- MR+S Chapter 6: Scoring, term weighting, and the vector space model, (only pages 100 and 107-116)
- MR+S Chapter 8: Evaluation in Information Retrieval (only pages 139-149)
2020.06.03 Vector Semantics, Neural Embeddings, Word2Vec (no video) J+M (3ed) Chapter 6: Vector Semantics, 1-7, 17-26, and review 7-16 (should already be familiar)
(all chapter 6)
2020.06.03 Relation Extraction J+M (3ed) Chapter 18: Information Extraction page 1-3 and section 18.2 (pages 8-18)
(18.1+18.2)
2020.06.05 Question Answering J+M (3ed) Chapter 25: Question Answering
2020.06.06 Chat Bots (no videos) J+M (3ed) Chapter 26: Dialog Systems and Chatbots
2020.06.14 Recommender systems (Collaborative Filtering) (no video) Jure Leskovec, Anand Rajaraman, Jeff Ullman. 2014. Mining of Massive Datasets. Chapter 9 2nd edition. pages 307-311 (intro and 9.1) and 321-327 (9.3).
2020.06.15 Web graphs, Links, and PageRank MR+S Chapter 21: Link Analysis
2020.06.16 Social Networks (no videos) Networks, Crowds, and Markets: Reasoning About a Highly Connected World By David Easley and Jon Kleinberg Cambridge University Press (2010) Chapter 2, Sections 3.1-3.3 and Secs 18.1-18.5