A Semantic-based Method for Unsupervised Commonsense Question Answering

Task Definition

  • 无监督多选项commonsense QA
  • 设s为score function,模型应当选出最合适的答案
  • Question&Answering - 图1

    SEQA

    引入SEQA的目的是为了预测一个answerQuestion&Answering - 图2的语义得分
    Question&Answering - 图3可以用Question&Answering - 图4来估计,其定义如下:

Question&Answering - 图5
其中,Question&Answering - 图6为answerQuestion&Answering - 图7的supporters集合,Question&Answering - 图8为所有可能answer的集合,Question&Answering - 图9为indicator function

indicator function定义如下:
Question&Answering - 图10
其中,Question&Answering - 图11为使用BERT等model抽取出的语义特征

上面的indicator function过于strict,因此引入一个soft version:
Question&Answering - 图12
Question&Answering - 图13
其中Question&Answering - 图14为温度控制参数,值越大越接近strict的indicator function

由于无法枚举Question&Answering - 图15中的全部answer,因此使用下面的方法来近似估计:
Question&Answering - 图16

Voting View

image.png

  • step 1:sample voters Question&Answering - 图18from Question&Answering - 图19
  • step 2:each voter votes for the choices with the semantic similarity weights
    • eg:Question&Answering - 图20votes for Question&Answering - 图21with the weight of Question&Answering - 图22

      Addressing Semantic Drift in Generative Question Answering with Auxiliary Extraction

      Keywords

      大水文、他们的数据集我们可以用、我们可以超越它

      Rationale Span Extraction

      现有questionQuestion&Answering - 图23与passageQuestion&Answering - 图24,首先将两者拼接成一个sequenceQuestion&Answering - 图25,接着通过transformer输出hidden statesQuestion&Answering - 图26

根据encoder输出的内容,模型可输出每个token的应当纳入rationale的概率
Question&Answering - 图27
损失函数:
Question&Answering - 图28
Question&Answering - 图29为第i个token的rationale label

Rationale-Enriched Answer Generation

MLE

Joint Training and Prediction

所以很简单,最后就是一个多任务学习。一项训练encoder,一项训练整个模型
Question&Answering - 图30