学习重点
学习资源
斯坦福2012年版经典自然语言处理(NLP)课程 by Dan Jurafsky, Chris Manning
CS 124: From Languages to Information
学习计划
Date | Video Lectures | Reading | 进度 |
---|---|---|---|
2020.05.22 | Basic Text processing | J+M 3rd Chapter 2 “Regular Expressions, Text Normalization, Edit Distance”, pages 1-21 | ✅ |
2020.05.23 | Edit Distance | J+M New Chapter 2 “Regular Expressions, Text Normalization, Edit Distance”, pages 22-26 | ✅ |
2020.05.24~25 | Language Modeling | J+M (3ed) Chapter 3, “Language Modeling with N-grams” pages 1-16 (plus section 3.6, “The Web and Stupid Backoff”). | ✅ |
2020.05.26 | Naive Bayes and Text Classification | J+M (3ed) Chapter 4, “Naive Bayes and Sentiment Classification” pages 1-14, sections 4.1 through 4.8. | ✅ |
2020.05.27 | Sentiment Analysis | - Bo Pang, Lillian Lee, and Shivakumar Vaithyanathan. 2002. Thumbs up? Sentiment Classification using Machine Learning Techniques. EMNLP 2002, pages 79—86 - J+M (3ed) Chapter 21, “Lexicons for Sentiment, Affect, and Connotation” pages 1-6 plus section 21.6 (page 15) |
✅ |
2020.05.28 | Logistic Regression(no video) | J+M (3ed) Chapter 5, “Logistic Regression” pages 1-6. Optional: pages 7-17 | ✅ |
2020.05.29 | Information Retrieval (I) | - MR+S Chapter 1: Boolean Retrieval (pages 1-17) - MR+S Chapter 2: Term vocabulary and postings lists (only pages 33-42) |
✅ |
2020.05.30 | Information Retrieval (II) | - MR+S Chapter 6: Scoring, term weighting, and the vector space model, (only pages 100 and 107-116) - MR+S Chapter 8: Evaluation in Information Retrieval (only pages 139-149) |
✅ |
2020.06.03 | Vector Semantics, Neural Embeddings, Word2Vec (no video) | J+M (3ed) Chapter 6: Vector Semantics, 1-7, 17-26, and review 7-16 (should already be familiar) (all chapter 6) |
✅ |
2020.06.03 | Relation Extraction | J+M (3ed) Chapter 18: Information Extraction page 1-3 and section 18.2 (pages 8-18) (18.1+18.2) |
✅ |
2020.06.05 | Question Answering | J+M (3ed) Chapter 25: Question Answering | ✅ |
2020.06.06 | Chat Bots (no videos) | J+M (3ed) Chapter 26: Dialog Systems and Chatbots | ✅ |
2020.06.14 | Recommender systems (Collaborative Filtering) (no video) | Jure Leskovec, Anand Rajaraman, Jeff Ullman. 2014. Mining of Massive Datasets. Chapter 9 2nd edition. pages 307-311 (intro and 9.1) and 321-327 (9.3). | ✅ |
2020.06.15 | Web graphs, Links, and PageRank | MR+S Chapter 21: Link Analysis | ✅ |
2020.06.16 | Social Networks (no videos) | Networks, Crowds, and Markets: Reasoning About a Highly Connected World By David Easley and Jon Kleinberg Cambridge University Press (2010) Chapter 2, Sections 3.1-3.3 and Secs 18.1-18.5 | |