关系抽取的应用：

关系分类：一般是判断一个句子中两个entity是哪种关系，属于多分类问题。
关系抽取：从一个句子中判断两个entity是否有关系，一般是一个二分类问题，已指定某种关系。

关系抽取方法（Relation extractors）

人工规则
优点：高准确率，可以为特定领域制定规则。
缺点：低召回率，对所有可能的pattern考虑周全很困难，而且需要为每条关系来定义pattern，很费时间精力。
Stanford CoreNLP 的 tokensRegex：基于字符串的 pattern 和基于 ner 的 pattern 结合
Example: Who holds what office in what organization?
- PERSON, POSITION of ORG
  - George Marshall, Secretary of State of the United States
- PERSON (named | appointed | chose | etc.) PERSON Prep? POSITION
  - Truman appointed Marshall Secretary of State
- PERSON [be]? (named | appointed | etc.) Prep? ORG POSITION
  - George Marshall was named US Secretary of State
    监督学习
选择我们想要提取的关系集合
选择相关的命名实体集合
寻找并标注数据
- 选择有代表性的语料库
- 标记命名实体
- 人工标注实体间的关系
- 分成训练、验证、测试集
训练分类器：MaxEnt、Naive Bayes、SVM ….

半监督学习

01 关系抽取 - 图3