【EMNLP 2021 论文和开源目录】

  • 对话系统(Dialogue Systems)
  • 事件预测(Event Prediction)
  • 时间轴摘要(Timeline Summarization)
  • 事件抽取(Event Extraction)

Dialogue Systems

Contextualize Knowledge Bases with Transformer for End-to-end Task-Oriented Dialogue Systems
Yanjie Gou, Yinjie Lei, Lingqiao Liu, Yong Dai and Chunxu Shen
将知识库(KB)整合到端到端面向任务的对话系统中是一项挑战,因为它需要正确地表示知识库的实体,而知识库实体与其知识库上下文和对话上下文相关联。现有的工作仅通过感知实体知识库上下文的一部分来表示实体,这可能导致由于信息丢失而导致的表示效率降低,并对知识库推理和响应生成产生不利影响。为了解决这个问题,工作探索通过动态感知所有相关实体和对话历史来充分语境化实体表示。为了实现这一点,我们提出了一个上下文感知内存增强转换器框架(COMET),它将KB视为一个序列,并利用一个新的内存掩码强制实体只关注其相关实体和对话历史,同时避免无关实体的干扰。通过大量的实验,我们表明我们的COMET框架可以实现优于现有技术的性能。
Paper: https://arxiv.org/abs/2010.05740
Code:

Event Prediction

The Future is not One-dimensional: Complex Event Schema Induction by Graph Modeling for Event Prediction
Manling Li, Sha Li, Zhenhailong Wang, Lifu Huang, Kyunghyun Cho, Heng Ji, Jiawei Han and Clare Voss
image.png
事件模式对事件的结构及其联系进行编码。随着事件的展开,模式对于充当中间连接器至关重要。先前关于事件模式归纳的工作关注于原子事件或线性时间事件序列,忽略了事件之间通过参数和参数关系的相互作用。本文引入了时态复杂事件模式的一个新概念:基于图的模式表示,它包含事件、参数、时态连接和参数关系。此外,论文提出了一个时态事件图模型,该模型根据时态复杂事件模式预测事件实例。为了构建和评估这样的模式,我们发布了一个新的模式学习语料库,其中包含6399个文档以及事件图,我们还手动构建了gold-standard模式。
Paper: https://blender.cs.illinois.edu/paper/schema2021a.pdf
Code: https://github.com/limanling/temporal-graph-schema (code none)

Timeline Summarization(Event)

Timeline Summarization based on Event Graph Compression via Time-Aware Optimal Transport
Manling Li, Tengfei Ma, Mo Yu, Lingfei Wu, Tian Gao, Heng Ji and Kathleen McKeown
image.png
Timeline Summarization是识别新闻集合中的主要事件,并按照时间顺序描述它们,并标记关键日期。以前的方法通常在确定事件的关键日期后为每个日期分别生成摘要。这些方法忽略了事件的内部结构(参数)和内部结构(事件连接)。按照不同的路径,我们建议将新闻文章表示为事件图,因此摘要任务变成将整个图压缩为其显著子图。关键的假设是,通过共享参数和时间顺序连接的事件描述了时间线的框架,其中包含在全局事件图中语义相关、结构显著和时间连贯的事件。然后引入时间感知的最优传输距离,以无监督的方式学习压缩模型。
Paper: https://aclanthology.org/2021.emnlp-main.519.pdf
Code: https://github.com/limanling/event-graph-summarization

Integrating Deep Event-Level and Script-Level Information for Script Event Prediction
Long Bai, Saiping Guan, Jiafeng Guo, Zixuan Li, Xiaolong Jin and Xueqi Cheng
image.png
脚本是从文本中提取的事件和参与者的结构化序列。脚本事件预测旨在根据脚本中的历史事件预测后续事件。两种信息促进了这项任务,即事件级信息和脚本级信息。在事件层面,现有的研究将事件视为与参与者一起的动词,而忽略了其他有用的属性,例如参与者的状态。在脚本层面,大多数现有的研究只考虑单个事件序列对应于一个共同的主角。在本文中,我们提出了一个基于转换器的模型,称为MCPredictor,它集成了脚本事件的深层事件级和脚本级信息预言。在事件层面,MCPredictor利用文本中丰富的信息来获得更全面的事件语义表示。在脚本级别,它考虑对应于后续事件的不同参与者的多个事件序列。在广泛使用的《纽约时报》语料库上的实验结果证明了该模型的有效性和优越性。
Paper: https://aclanthology.org/2021.emnlp-main.777.pdf
Code:

Event Extraction

  1. Extracting Event Temporal Relations via Hyperbolic Geometry

Xingwei Tan, Gabriele Pergola and Yulan He
image.png
在自然语言理解中,检测事件及其随时间的演化是一项至关重要的任务。最近用于事件时间关系提取的神经方法通常将事件映射到欧氏空间中的嵌入,并训练分类器来检测事件对之间的时间关系。然而,嵌入在欧几里德空间中无法捕获更丰富的非对称关系,如事件-时间关系。因此,我们建议将事件嵌入到双曲空间中,双曲空间本质上是面向层次结构建模的。我们介绍了两种在双曲空间中对事件及其时间关系进行编码的方法。一种方法利用双曲线嵌入通过简单的几何运算直接推断事件关系。在第二个方案中,我们设计了一个由双曲型神经单元组成的端到端架构,用于时间关系提取任务。
Paper: https://arxiv.org/pdf/2109.05527.pdf
Code: https://github.com/Xingwei-Warwick/hyper-event-TempRel

  1. Learning Constraints and Descriptive Segmentation for Subevent Detection

Haoyu Wang, Hongming Zhang, Muhao Chen and Dan Roth
image.png
文本中提到的事件对应于不同粒度的现实世界事件。子事件检测的任务旨在解决这个粒度问题,识别事件复合体中多粒度事件的论元关系。由于知道事件复合体描述上下文的范围有助于推断事件的成员关系,因此我们提出了基于事件的文本分割任务(EVENTSEG)作为辅助任务,以改进子事件检测的学习。为了将这两项任务联系在一起,我们提出了一种学习和实施约束的方法,这些约束捕获子事件检测和事件序列预测之间的依赖关系,并指导模型进行全局一致性推理。具体来说,我们采用整流网络进行约束学习,然后将学习到的约束转化为神经模型损失函数中的正则项。
Paper: https://arxiv.org/pdf/2109.06316.pdf
Code: https://github.com/CogComp/Subevent_EventSeg

  1. Event Coreference Data (Almost) for Free: Mining Hyperlinks from Online News

Michael Bugert and Iryna Gurevych
image.png
跨文档事件共指解决(CDCR)的任务是识别哪些事件提及涉及整个文档集合中的相同事件。举例,CDCR数据是一个艰巨而昂贵的过程,这解释了为什么现有的语料库很小并且缺乏领域覆盖率。为了克服这个瓶颈,我们自动从在线新闻的超链接中提取事件共同引用数据:当引用一个重要的现实世界事件时,作者通常会在另一篇报道该事件的文章中添加一个超链接。
Paper: https://openreview.net/pdf?id=485AXJD1fQ5
Code: https://github.com/UKPLab/emnlp2021-hypercoref-cdcr

  1. Treasures Outside Contexts: Improving Event Detection via Global Statistics

Rui Li, Wenlin Zhao, Cheng Yang and Sen Su
image.png
事件检测(ED)旨在识别给定文本中特定类型的事件实例,它已被形式化为序列标记任务。据我们所知,现有的基于神经网络的ED模型依赖于输入文本中每个单词的上下文语义特征进行决策,我们发现在测试阶段很容易被不同的上下文混淆。为此,我们提出了在整个训练集中引入一组来自单词-事件共现频率的统计特征来配合上下文特征的想法。具体地说,我们提出了一个语义和统计联合鉴别网络(S-JDN),该网络由语义特征提取器、统计特征提取器和联合事件鉴别器组成。
Paper: https://aclanthology.org/2021.emnlp-main.206.pdf
Code: https://github.com/Buted/SSJDN

  1. Uncertain Local-to-Global Networks for Document-Level Event Factuality Identification

Pengfei Cao, Yubo Chen, Yuqing Yang, Kang Liu and Jun Zhao
image.png
事件真实性表示事件是否发生在现实世界中的确定程度。现有的研究主要集中在句子层面上识别事件的真实性,这很容易导致同一事件的不同提及之间的冲突。为此,我们研究了文档级事件真实性识别问题,它从文档的角度确定事件的真实性。对于这个任务,我们需要考虑两个重要的特征:局部不确定性和全局结构性 ,这可以用来改善性能。在本文中,我们提出了一种不确定的局部到全局网络(ULGN)来利用这两个特性。具体来说,我们设计了一个局部不确定性估计模块来模拟局部信息的不确定性。此外,我们还提出了一个不确定信息聚合模块,以利用全局结构来集成局部信息。
Paper: https://aclanthology.org/2021.emnlp-main.207.pdf
Code: https://github.com/CPF-NLPR/ULGN4DocEFI

  1. Lifelong Event Detection with Knowledge Transfer

Pengfei Yu, Heng Ji and Prem Natarajan
image.png
传统的有监督信息提取(IE)方法可以从非结构化数据中提取结构化知识元素,但它们仅限于预定义的目标本体。实际上,随着添加新的类型或更细粒度的子类型,感兴趣的本体可能会随着时间的推移而改变。我们提出了一个新的终身学习Lefe Long Learning框架来应对这一挑战。我们将终身事件检测作为一个范例案例,并提出了一个新的问题公式,该公式也可推广到其他IE任务。在事件检测和更一般的IE任务中,分层知识元素类型之间存在丰富的相关性或语义关联。在我们提出的框架中,知识正在旧事件类型和新事件类型之间传递。具体地说,我们使用自我训练损失,通过提及新的事件类型来更新旧知识。此外,我们根据旧事件类型与新事件类型的相似性来聚合旧事件类型的表示,以初始化新事件类型的表示。
Paper: https://aclanthology.org/2021.emnlp-main.428.pdf
Code: https://github.com/Perfec-Yu/Lifelong-ED

  1. Machine Reading Comprehension as Data Augmentation: A Case Study on Implicit Event Argument Extraction

Jian Liu, Yufeng Chen and Jinan Xu
image.png
隐式事件论元提取(EAE)是一项关键的文档级信息提取任务,旨在识别句子级以外的事件论元。尽管这项任务做了很多努力工作,但缺乏足够的训练数据长期以来阻碍了这项研究。在本文中,通过将任务与机器阅读理解(MRC)联系起来,我们从一个新的角度来解决隐式EAE所面临的数据稀疏问题。特别是,我们通过MRC设计了两种数据扩充机制,包括:1)隐式知识转移,通过在MRC公式中构建统一的培训框架实现其他任务的知识转移;2)显式数据扩充,可以显式生成新的培训示例,将MRC模型视为注释器。
Paper: https://aclanthology.org/2021.emnlp-main.214/
Code:https://github.com/jianliu-ml/DocMRC

  1. ESTER: A Machine Reading Comprehension Dataset for Reasoning about Event Semantic Relations

Rujun Han, I-Hung Hsu, Jiao Sun, Julia Baylon, Qiang Ning, Dan Roth and Nanyun Peng
理解事件之间的语义关系是阅读理解的本质。最近以事件为中心的阅读理解数据集主要关注事件参数或时间关系。虽然这些任务部分评估了机器的叙事理解能力,但类似人类的阅读理解需要处理基于事件的信息的能力,而不仅仅是论据和时间推理。例如,为了理解事件之间的因果关系,我们需要推断动机或目的;要建立事件层次结构,我们需要了解事件的组成。为了简化这些任务,我们引入了ESTER,一个用于事件语义关系推理的综合机器阅读理解(MRC)数据集。该数据集利用自然语言查询对五种最常见的事件语义关系进行推理,提供超过6K个问题,并捕获10.1K个事件关系对。
Paper: https://arxiv.org/pdf/2104.08350.pdf
Code:https://github.com/PlusLabNLP/ESTER

  1. Document-level Entity-based Extraction as Template Generation

Kung-Hsiang Huang, Sam Tang and Nanyun Peng
Paper: https://arxiv.org/pdf/2109.04901.pdf
Code: https://github.com/PlusLabNLP/TempGen
image.png
大多数文档级EE系统都是构建了抽取模型,但是这种方式难以解决文档级别中实体长依赖的问题。为了解决这个问题,本文提出了一种通用的两个层级的文档EE框架:角色填充实体抽取REE和关系抽取RE。我们首先把他建模成模板填充问题,使模型能够高效的捕获跨实体依赖的问题,引用实体标签,避免因为N元关系时候造成指数级计算复杂度。

  1. Modeling Document-Level Context for Event Detection via Important Context Selection

信息抽取中的事件检测任务旨在识别和分类文本中事件的触发词。最近的进展将基于转换器的高级语言模型(如BERT)作为ED最先进模型的关键组成部分。然而,输入文本的长度限制是此类ED模型的一个障碍,因为它们无法对已被证明对ED有益的长程文档级上下文进行编码。为了解决这个问题,我们提出了一种新的方法来为ED建模文档级上下文,动态选择文档中的相关句子,以便对目标句子进行事件预测。然后,目标句子将被选定的句子扩充,并完全由基于转换器的语言模型使用,以改进ED的表征学习。为此,采用强化算法为ED训练相关的句子选择。然后引入几种信息类型,形成训练过程的奖励函数,包括ED表现,句子相似性和语篇关系。我们在多个基准数据集上进行的大量实验表明了所提出模型的有效性,从而产生了最新的性能。
Paper: https://aclanthology.org/2021.emnlp-main.439.pdf

  1. Salience-Aware Event Chain Modeling for Narrative Understanding

讲故事,无论是通过寓言、新闻报道、纪录片还是回忆录,都可以被认为是有趣和相关事件的交流,这些事件加在一起形成了一个具体的过程。提取代表这些过程的事件链是可取的。然而,这种提取仍然是一个具有挑战性的问题。我们认为这是由于发现链条的文本的性质。自然语言文本中穿插了一段具体、突出的事件,包括背景信息、语境、观点和其他元素,这些元素对各种必要的话语和语用行为都很重要,但不属于正在交流的主要事件链的一部分。我们介绍了从自然语言文本中提取主链的方法,通过过滤掉不显著的事件和支持性句子。我们通过比较关键事件链,证明了我们的方法在隔离关键事件链方面的有效性对下游任务的影响。我们表明,通过在提取的链上预训练大型语言模型,我们可以在两项任务上获得改进,这两项任务得益于对事件链的清晰理解:叙事预测和预测基于事件的时态问答。经证实的改进和烧蚀研究证实,我们的提取方法可以分离出关键事件链。
image.png
Paper: https://arxiv.org/pdf/2109.10475.pdf
Code: https://github. com/juvezxy/Salience-Event-Chain

  1. Honey or Poison? Solving the Trigger Curse in Few-shot Event Detection via Causal Intervention

Jiawei Chen, Hongyu Lin, Xianpei Han and Le Sun
长期以来,事件检测一直受到触发器识别的困扰:过度拟合触发词将损害泛化能力,而欠拟合触发词将损害检测性能。在少样本场景中,这个问题更加严重。在本文中,我们从因果关系的角度识别并解决了少样本事件检测(FSED)中的触发器问题。通过用结构因果模型(SCM)建模FSED,我们发现触发词是上下文和结果的混杂因素,这使得以前的FSED方法更容易过度拟合触发词。为了解决这个问题,我们建议在训练时候通过后门调整来干预。实验表明,我们的方法显著提高了在ACE05、MAVEN和KBP17数据集上的FSED表现。
image.png
Paper: https://arxiv.org/pdf/2109.05747.pdf
Code:

  1. Learning Prototype Representations Across Few-Shot Tasks for Event Detection

关键词:few shot learning
我们讨论了事件检测(信息提取的一个子任务)的少样本学习中的采样偏差和异常值问题。通过引入跨任务原型,我们提出了情景少样本学习中训练任务之间的关系模型。我们进一步提出在不同任务的分类器之间加强预测一致性,以使模型对异常值更具鲁棒性。我们的广泛实验表明,在三个少样本学习数据集上,情况持续改善。研究结果表明,当新事件类型的标记数据有限时,我们的模型更加稳健。
Paper: https://aclanthology.org/2021.emnlp-main.427.pdf
Code: https://github.com/laiviet/fsl-proact

Event Induction

  1. Corpus-based Open-Domain Event Type Induction

Jiaming Shen, Yunyi Zhang, Heng Ji and Jiawei Han
image.png
传统的事件提取方法需要预定义的事件类型及其相应的注释来学习事件提取器。在实际应用中,这些先决条件通常很难满足。本文提出了一种基于语料库的开放域事件类型归纳方法,该方法可以从给定的语料库中自动发现一组事件类型。
Paper: https://arxiv.org/pdf/2109.03322.pdf
Code: https://github.com/jmshen1994/ETypeClus

Text Generation

  1. Sentence-Permuted Paragraph Generation

Wenhao Yu, Chenguang Zhu, Tong Zhao, Zhichun Guo and Meng Jiang
image.png
在许多应用程序中,生成不同内容的段落非常重要。现有的生成模型由于固定的左右句子顺序,从同质化的语境中生成相似的内容。我们的想法是排列句子顺序,以提高多句子段落的内容多样性。我们提出了一个新的框架PermGen,其目标是最大化所有可能的句子顺序的输出段落分布的对数似然概率。PermGen使用分层位置嵌入,并为训练阶段和推理阶段设计了新的程序。在三段生成基准测试上的实验表明,PermGen生成的输出比现有模型更多样化,质量更高。
Paper:https://arxiv.org/pdf/2104.07228.pdf
Codehttps://github.com/wyu97/permgen

  1. Efficient Mind-Map Generation via Sequence-to-Graph and Reinforced Graph Refinement

Mengting Hu, Honglei Guo, Shiwan Zhao, Hang Gao and Zhong Su
image.png
思维导图是一种以分层方式表示中心概念和关键思想的图表。将传统的纯文本转换成思维导图可以反映他关键的语义结构和容易被理解。给定一个文档。现有的方法是提取每个句子对应的关系,生成该文档的有向语义图。这种方法随着文档的长度呈现指数增加,而且很难捕获整体的文档语义。为了解决这个问题,我们提出了一个高效的Mind-Map 生成网络通过句子到图的方式把文档转换成图。
为了保证生成有意义的Mind Map,我们用强化学习的方法设计了一个图增强的模块来调整关系图。
image.png
Paper: https://arxiv.org/pdf/2109.02457.pdf
Code:

Keyword Extraction

  1. Back to the Basics: A Quantitative Analysis of Statistical and Graph-Based Term Weighting Schemes for Keyword Extraction

Asahi Ushio, Federico Liberatore and Jose Camacho-Collados
image.png
Paper: https://arxiv.org/abs/2104.08028
Code: https://github.com/asahi417/kex

Information Extraction

实体抽取

小样本&低资源&降噪&跨领域NER

  1. Learning from Noisy Labels for Entity-Centric Information Extraction

image.png
近期的信息提取方法都过度依赖训练深层神经网络模型。然而这样的模型非常容易和噪音标签过度匹配,造成性能下降。虽然在大量学习资源的情况下过滤噪声标签的成本非常高,但最近的研究表明,与干净的标签相比,这些标签需要更多的训练步骤来记忆,且更容易被遗忘,因此在训练中是可以被识别的。基于这些特性,我们提出一个简单的以实体为中心的信息提取协同正则化框架,它由几个结构相同但参数初始化不同的神经网络模型组成。这些模型与特定任务的损失一起进行优化,并进行正则化

Paper: https://arxiv.org/pdf/2104.08656.pdf
Code: https://github.com/wzhouad/NLL-IE

细粒度实体分类

1.Fine-grained Entity Typing without Knowledge Base
image.png
关于细粒度实体类型(FET)的现有工作通常在使用知识库(KB)作为远程监控获得的数据集上训练自动模型。
然而,对知识库的依赖意味着这种训练设置可能会因知识库的缺乏或不完整而受到阻碍。为了缓解这一限制,我们提出了一种新的FET模型训练设置:不访问任何知识库的FET。
在此背景下,我们提出了一个两步框架来训练FET模型。
在第一步中,我们从一个大型的未标记数据集中自动创建带有细粒度标签的伪数据。然后,基于伪数据以无监督的方式或在粗粒度命名实体识别(NER)模型的弱指导下使用自训练的方式训练神经网络模型。
实验结果表明,相对于在原始KB监督数据集上训练的模型,我们的方法取得了有竞争力的性能。

Entity Typing是自然语言处理中一项最基础的任务。传统的Entity Typing研究关注在有限数量的实体类型,然而最近的研究更加专注更细粒度实体分类。
细粒度的实体分类最大的挑战是缺少人工标记的数据。为了解决这个问题,通常的做法是从知识库中寻求远程监督(已有的知识库(比如 freebase)对应到丰富的非结构化数据中(比如新闻文本),从而生成大量的训练数据,从而训练出一个效果不错的抽取器)。通常,训练数据是通过链接实体mention并从知识库中提取它们的类型来获得的。
但是在实际应用中,知识库的缺乏或者不完整,会限制这种训练模型的应用。

Paper: https://aclanthology.org/2021.emnlp-main.431.pdf
Code: https://github.com/lemaoliu/fet-data

Fine-grained Entity Typing via Label Reasoning
细粒度标签推理技术
传统的entity typing方法是基于独立的分类范式,这使得它们难以识别相互依赖、长尾和细粒度的实体类型。在本文中,我们认为标签之间隐含的外在和内在依赖性可以为解决上述挑战提供关键知识。为此,我们提出了标签推理网络(LRN),该网络通过发现和利用数据中的标签依赖知识,依次推理细粒度实体标签。具体而言,LRN利用自回归网络进行演绎推理,利用二部属性图进行标签之间的归纳推理,可以以序列到集合、端到端的方式有效地建模、学习和推理复杂的标签依赖关系。实验表明,LRN在标准的超细粒度实体类型测试中达到了最先进的性能,并且可以有效地解决长尾标签问题。
image.png

Other

Long Papers

MSˆ2: Multi-Document Summarization of Medical Studies
Jay DeYoung, Iz Beltagy, Madeleine van Zuylen, Bailey Kuehl and Lucy Lu Wang
Paper: https://aclanthology.org/2021.emnlp-main.594.pdf
Code: https://github.com/allenai/ms2

DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages
Dominik Schlechtweg, Nina Tahmasebi, Simon Hengchen, Haim Dubossarsky and Barbara McGillivray
Investigating the Helpfulness of Word-Level Quality Estimation for Post-Editing Machine Translation Output
Raksha Shenoy, Nico Herbig, Antonio Krüger and Josef van Genabith
A Semantic Filter Based on Relations for Knowledge Graph Completion
Liang Zongwei, Yang Junan, Liu Hui and Huang KeJu
Cross-Register Projection for Headline Part of Speech Tagging
Adrian Benton, Hanyang Li and Igor Malioutov
Joint Passage Ranking for Diverse Multi-Answer Retrieval
Sewon Min, Kenton Lee, Ming-Wei Chang, Kristina Toutanova and Hannaneh Hajishirzi
Editing Factual Knowledge in Language Models
Nicola De Cao, Wilker Aziz and Ivan Titov
Efficient Dialogue Complementary Policy Learning via Deep Q-network Policy and Episodic Memory Policy
Yangyang Zhao, Zhenyu Wang, Changxi Zhu and Shihan Wang
Inflate and Shrink:Enriching and Reducing Interactions for Fast Text-Image Retrieval
Haoliang Liu, Tan Yu and Ping Li
Beta Distribution Guided Aspect-aware Graph for Aspect Category Sentiment Analysis with Affective Knowledge
Bin Liang, Hang Su, Rongdi Yin, Lin Gui, Min Yang, Qin Zhao, Xiaoqi Yu and Ruifeng Xu
A Role-Selected Sharing Network for Joint Machine-Human Chatting Handoff and Service Satisfaction Analysis
Jiawei Liu, Kaisong Song, Yangyang Kang, Guoxiu He, Zhuoren Jiang, Changlong Sun, Wei Lu and Xiaozhong Liu
Unsupervised Keyphrase Extraction by Jointly Modeling Local and Global Context
Xinnian Liang, Shuangzhi Wu, Mu Li and Zhoujun Li
GFST: Gender-Filtered Self-Training for More Accurate Gender in Translation
Prafulla Kumar Choubey, Anna Currey, Prashant Mathur and Georgiana Dinu
Pushing on Text Readability Assessment: A Transformer Meets Handcrafted Linguistic Features
Bruce W. Lee, Yoo Sung Jang and Jason Lee
Relational World Knowledge Representation in Contextual Language Models: A Review
Tara Safavi and Danai Koutra
Low-Rank Subspaces for Unsupervised Entity Linking
Akhil Arora, Alberto Garcia-Duran and Robert West
Certified Robustness to Programmable Transformations in LSTMs
Yuhao Zhang, Aws Albarghouthi and Loris D’Antoni

Consistent Accelerated Inference via Confident Adaptive Transformers
Tal Schuster, Adam Fisch, Tommi Jaakkola and Regina Barzilay
Multi-Modal Open-Domain Dialogue
Kurt Shuster, Eric Michael Smith, Da Ju and Jason Weston
Cross-Policy Compliance Detection via Question Answering
Marzieh Saeidi, Majid Yazdani and Andreas Vlachos
Adaptive Proposal Generation Network for Temporal Sentence Localization in Videos
Daizong Liu, Xiaoye Qu, Jianfeng Dong and Pan Zhou
Progressively Guide to Attend: An Iterative Alignment Framework for Temporal Sentence Grounding
Daizong Liu, Xiaoye Qu and Pan Zhou
Matching-oriented Embedding Quantization For Ad-hoc Retrieval
Shitao Xiao, Zheng Liu, Yingxia Shao, Defu Lian and Xing Xie
R^3Net:Relation-embedded Representation Reconstruction Network for Change Captioning
Yunbin Tu, Liang Li, Chenggang Yan, Shengxiang Gao and Zhengtao YU

Learning Neural Templates for Recommender Dialogue System
Zujie Liang, Huang Hu, Can Xu, Jian Miao, Yingying He, yining Chen, Xiubo Geng, Fan Liang and Daxin Jiang

What to Pre-Train on? Efficient Intermediate Task Selection
Clifton Poth, Jonas Pfeiffer, Andreas Rücklé and Iryna Gurevych
UNKs Everywhere: Adapting Multilingual Language Models to New Scripts
Jonas Pfeiffer, Ivan Vulić, Iryna Gurevych and Sebastian Ruder
Data Augmentation with Hierarchical SQL-to-Question Generation for Cross-domain Text-to-SQL Parsing
Kun Wu, Lijie Wang, Zhenghua Li, Ao Zhang, Xinyan Xiao, Hua Wu, Min Zhang and Haifeng Wang
Answering Open-Domain Questions of Varying Reasoning Steps from Text
Peng Qi, Haejun Lee, TG Sido and Christopher Manning
Meta-LMTC: Meta-Learning for Large-Scale Multi-Label Text Classification
Ran Wang, Xi’ao Su, Siyu Long, Xinyu Dai, Shujian Huang and Jiajun CHEN
TDEER: An Efficient Translating Decoding Schema for Joint Extraction of Entities and Relations
Xianming Li, Xiaotian Luo, Chenghao Dong, Daichuan Yang, Beidi Luan and Zhen He
Distantly Supervised Relation Extraction using Multi-Layer Revision Network and Confidence-based Multi-Instance Learning
Xiangyu Lin, Tianyi Liu, Weijia Jia and Zhiguo Gong
TimeTraveler: Reinforcement Learning for Temporal Knowledge Graph Forecasting
Haohai Sun, Jialun Zhong, Yunpu Ma, Zhen Han and Kun He
Robust Open-Vocabulary Translation from Visual Text Representations
Elizabeth Salesky, David Etter and Matt Post
Text Detoxification using Large Pre-trained Neural Models
David Dale, Anton Voronov, Daryna Dementieva, Varvara Logacheva, Olga Kozlova, Nikita Semenov and Alexander Panchenko

Using Sociolinguistic Variables to Reveal Changing Attitudes Towards Sexuality and Gender
Sky CH-Wang and David Jurgens
Memory and Knowledge Augmented Language Models for Inferring Salience in Long-Form Stories
David Wilmot and Frank Keller
Sparse Attention with Linear Units
Biao Zhang, Ivan Titov and Rico Sennrich
Finetuning Pretrained Transformers into RNNs
Jungo Kasai, Hao Peng, Yizhe Zhang, Dani Yogatama, Gabriel Ilharco, Nikolaos Pappas, Yi Mao, Weizhu Chen and Noah A. Smith
Neural Path Hunter: Reducing Hallucination in Dialogue Systems via Path Grounding
Nouha Dziri, Andrea Madotto, Osmar Zaïane and Avishek Joey Bose
Causal Direction of Data Collection Matters: Implications of Causal and Anticausal Learning for NLP
Zhijing Jin, ‪Julius von Kügelgen‬, Jingwei Ni, Tejas Vaidhya, Ayush Kaushal, Mrinmaya Sachan and Bernhard Schoelkopf
SimCSE: Simple Contrastive Learning of Sentence Embeddings
Tianyu Gao, Xingcheng Yao and Danqi Chen
Thinking Clearly, Talking Fast: Concept-Guided Non-Autoregressive Generation for Open-Domain Dialogue Systems
Yicheng Zou, Zhihua Liu, Xingwu Hu and Qi Zhang
Harms of Gender Exclusivity and Challenges in Non-Binary Representation in Language Technologies
Sunipa Dev, Masoud Monajatipoor, Anaelia Ovalle, Arjun Subramonian, Jeff Phillips and Kai-Wei Chang
A Simple and Effective Positional Encoding for Transformers
Pu-Chin Chen, Henry Tsai, Srinadh Bhojanapalli, Hyung Won Chung, Yin-Wen Chang and Chun-Sung Ferng
Less is More: Pretrain a Strong Siamese Encoder for Dense Text Retrieval Using a Weak Decoder
Shuqi Lu, Di He, Chenyan Xiong, Guolin Ke, Waleed Malik, Zhicheng Dou, Paul Bennett, Tie-Yan Liu and Arnold Overwijk
Unimodal and Crossmodal Refinement Network for Multimodal Sequence Fusion
Xiaobao Guo, Adams Kong, Huan Zhou, Xianfeng Wang and Min Wang
To be Closer: Learning to Link up Aspects with Opinions
Yuxiang Zhou, Lejian Liao, Yang Gao, Zhanming Jie and Wei Lu
CAST: Enhancing Code Summarization with Hierarchical Splitting and Reconstruction of Abstract Syntax Trees
Ensheng Shi, Yanlin Wang, Lun Du, Hongyu Zhang, Shi Han, Dongmei Zhang and Hongbin Sun
MATE: Multi-view Attention for Table Transformer Efficiency
Julian Eisenschlos, Maharshi Gor, Thomas Müller and William Cohen
Classifying Dyads for Militarized Conflict Analysis
Niklas Stoehr, Lucas Torroba Hennigen, Samin Ahbab, Robert West and Ryan Cotterell

Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning
Runxin Xu, Fuli Luo, Zhiyuan Zhang, Chuanqi Tan, Baobao Chang, Songfang Huang and Fei Huang
Structural Adapters in Pretrained Language Models for AMR-to-Text Generation
Leonardo F. R. Ribeiro, Yue Zhang and Iryna Gurevych
MLEC-QA: A Chinese Multi-Choice Biomedical Question Answering Dataset
Jing Li, Shangping Zhong and Kaizhi Chen
Mathematical Word Problem Generation from Commonsense Knowledge Graph and Equations
Tianqiao Liu, Qiang Fang, Wenbiao Ding, Hang Li, Zhongqin Wu and Zitao Liu
CTAL: Pre-training Cross-modal Transformer for Audio-and-Language Representations
Hang Li, Wenbiao Ding, Yu Kang, Tianqiao Liu, Zhongqin Wu and Zitao Liu
Logic-level Evidence Retrieval and Graph-based Verification Network for Table-based Fact Verification
Qi Shi, Yu Zhang, Qingyu Yin and Ting Liu
Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion Causes
Hyunwoo Kim, Byeongchang Kim and Gunhee Kim
STANKER: Stacking Network based on Level-grained Attention-masked BERT for Rumor Detection on Social Media
Dongning Rao, Xin Miao, Zhihua Jiang and Ran Li
Adversarial Mixing Policy for Relaxing Locally Linear Constraints in Mixup
Liu Guang, yuzhao mao, Huang Hailong, Gao Weiguo and Li Xuan
YASO: A Targeted Sentiment Analysis Evaluation Dataset for Open-Domain Reviews
Matan Orbach, Orith Toledo-Ronen, Artem Spector, Ranit Aharonov, Yoav Katz and Noam Slonim
IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation
Samuel Cahyawijaya, Genta Indra Winata, Bryan Wilie, Karissa Vincentio, Xiaohong Li, Adhiguna Kuncoro, Sebastian Ruder, Zhi Yuan Lim, Syafri Bahar, Masayu Khodra, Ayu Purwarianti and Pascale Fung
Exploring Task Difficulty for Few-Shot Relation Extraction
Jiale Han, Bo Cheng and Wei Lu
Pseudo Zero Pronoun Resolution Improves Zero Anaphora Resolution
Ryuto Konno, Shun Kiyono, Yuichiroh Matsubayashi, Hiroki Ouchi and Kentaro Inui
Low-Resource Dialogue Summarization with Domain-Agnostic Multi-Source Pretraining
Yicheng Zou, Bolin Zhu, Xingwu Hu, Tao Gui and Qi Zhang
Bayesian Topic Regression for Causal Inference
Maximilian Ahrens, Julian Ashwin, Jan-Peter Calliess and Vu Nguyen
Interactive Machine Comprehension with Dynamic Knowledge Graphs
Xingdi Yuan

I Wish I Would Have Loved This One, But I Didn’t — A Multilingual Dataset for Counterfactual Detection in Product Review
James O’Neill, Polina Rozenshtein, Ryuichi Kiryo, Motoko Kubota and Danushka Bollegala
CRFR: Improving Conversational Recommender Systems via Flexible Fragments Reasoning on Knowledge Graphs
Jinfeng Zhou, Bo Wang, Ruifang He and Yuexian Hou
A Label-Aware BERT Attention Network for Zero-Shot Multi-Intent Detection in Spoken Language Understanding
Ting-Wei Wu, Ruolin Su and Biing Juang
“Wikily” Supervised Neural Translation Tailored to Cross-Lingual Tasks
Mohammad Sadegh Rasooli, Chris Callison-Burch and Derry Tanti Wijaya
Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation
Yuning Mao, Wenchang Ma, Deren Lei, Jiawei Han and Xiang Ren
GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based on Transformer Networks
Weicheng Ma, Renze Lou, Kai Zhang, Lili Wang and Soroush Vosoughi
LayoutReader: Pre-training of Text and Layout for Reading Order Detection
Zilong Wang, Yiheng Xu, Lei Cui, Jingbo Shang and Furu Wei
SPARQLing Database Queries from Intermediate Question Decompositions
Irina Saparina and Anton Osokin
ChemNER: Fine-Grained Chemistry Named Entity Recognition with Ontology-Guided Distant Supervision
Xuan Wang, Vivian Hu, Xiangchen Song, Shweta Garg, Jinfeng Xiao and Jiawei Han
Time-aware Graph Neural Network for Entity Alignment between Temporal Knowledge Graphs
Chengjin Xu, Fenglong Su and Jens Lehmann
Types of Out-of-Distribution Texts and How to Detect Them
Udit Arora, William Huang and He He
MetaTS: Meta Teacher-Student Network for Multilingual Sequence Labeling with Minimal Supervision
Zheng Li, Danqing Zhang, Tianyu Cao, Ying Wei, Yiwei Song and Bing Yin
CLIPScore: A Reference-free Evaluation Metric for Image Captioning
Jack Hessel, Ari Holtzman, Maxwell Forbes, Ronan Le Bras and Yejin Choi
Learning to Selectively Learn for Weakly-supervised Paraphrase Generation
Kaize Ding, Dingcheng Li, Alexander Hanbo Li, Xing Fan, Chenlei Guo, Yang Liu and Huan Liu
ActiveEA: Active Learning for Neural Entity Alignment
Bing Liu, Harrisen Scells, Guido Zuccon, Wen Hua and Genghong Zhao
A Partition Filter Network for Joint Entity and Relation Extraction
Zhiheng Yan, Chong Zhang, Jinlan Fu, Qi Zhang and Zhongyu Wei
TEBNER: Domain Specific Named Entity Recognition with Type Expanded Boundary-aware Network
Zheng Fang, Yanan Cao, Tai Li, Ruipeng Jia, Fang Fang, Yanmin Shang and Yuhai Lu
Allocating Large Vocabulary Capacity for Cross-Lingual Language Model Pre-Training
Bo Zheng, Li Dong, Shaohan Huang, Saksham Singhal, Wanxiang Che, Ting Liu, Xia Song and Furu Wei
Local Word Discovery for Interactive Transcription
William Lane and Steven Bird
Proxy Indicators for the Quality of Open-domain Dialogues
Rostislav Nedelchev, Jens Lehmann and Ricardo Usbeck
Definition Modelling for Appropriate Specificity
Han Huang, Tomoyuki Kajiwara and Yuki Arase
Backdoor Attacks on Pre-trained Models by Layerwise Weight Poisoning
Linyang Li, Demin Song, Xiaonan Li, Jiehang Zeng, Ruotian Ma and Xipeng Qiu
On Pursuit of Designing Multi-modal Transformer for Video Grounding
Meng Cao, Long Chen, Mike Zheng Shou, Can Zhang and Yuexian Zou
Recurrent Attention for Neural Machine Translation
Jiali Zeng, Shuangzhi Wu, Yongjing Yin, Yufan Jiang and Mu Li
Meta Distant Transfer Learning for Pre-trained Language Models
Chengyu Wang, Haojie Pan, Minghui Qiu, jun huang, Fei Yang and Yin Zhang
Set Generation Networks for End-to-End Knowledge Base Population
Dianbo Sui, Chenhao Wang, Yubo Chen, Kang Liu, Jun Zhao and Wei Bi
CoLV: A Collaborative Latent Variable Model for Knowledge-Grounded Dialogue Generation
Haolan Zhan, Lei Shen, Hongshen Chen and Hainan Zhang
DILBERT: Customized Pre-Training for Domain Adaptation with Category Shift, with an Application to Aspect Extraction
Entony Lekhtman, Yftah Ziser and Roi Reichart
GOLD: Improving Out-of-Scope Detection in Dialogues using Data Augmentation
Derek Chen and Zhou Yu
Cross-Domain Label-Adaptive Stance Detection
Momchil Hardalov, Arnav Arora, Preslav Nakov and Isabelle Augenstein
A Three-Stage Learning Framework for Low-Resource Knowledge-Grounded Dialogue Generation
Shilei Liu, Xiaofeng Zhao, Bochao Li, Feiliang Ren, Longhui Zhang and Shujuan Yin
$Q^2$: Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question Answering
Or Honovich, Leshem Choshen, Roee Aharoni, Ella Neeman, Idan Szpektor and Omri Abend
MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents
Song Feng, Siva Sankalp Patel, Hui Wan and Sachindra Joshi
Conversational Multi-Hop Reasoning with Neural Commonsense Knowledge and Symbolic Logic Rules
Forough Arabshahi, Jennifer Lee, Antoine Bosselut, Yejin Choi and Tom Mitchell
Contrastive Domain Adaptation for Question Answering using Limited Text Corpora
Zhenrui Yue, Bernhard Kratzwald and Stefan Feuerriegel
Document-Level Text Simplification: Dataset, Criteria and Baseline
Renliang Sun, Hanqi Jin and Xiaojun Wan
Does It Capture STEL? A Modular, Similarity-based Linguistic Style Evaluation Framework
Anna Wegmann and Dong Nguyen
A Fine-Grained Domain Adaption Model for Joint Word Segmentation and POS Tagging
Peijie Jiang, Dingkun Long, Yueheng Sun, Meishan Zhang, Guangwei Xu and Pengjun Xie
GAML-BERT: Improving BERT Early Exiting by Gradient Aligned Mutual Learning
Wei Zhu, Xiaoling Wang, Yuan Ni and GUOTONG XIE
Idiosyncratic but not Arbitrary: Learning Idiolects in Online Registers Reveals Distinctive yet Consistent Individual Styles
Jian Zhu and David Jurgens
A Large-Scale Dataset for Empathetic Response Generation
Anuradha Welivita, Yubo Xie and Pearl Pu
NegatER: Unsupervised Discovery of Negatives in Commonsense Knowledge Bases
Tara Safavi, Jing Zhu and Danai Koutra
Genre as Weak Supervision for Cross-lingual Dependency Parsing
Max Müller-Eberstein, Rob van der Goot and Barbara Plank
The Power of Scale for Parameter-Efficient Prompt Tuning
Brian Lester, Rami Al-Rfou and Noah Constant
Moving on from OntoNotes: Coreference Resolution Model Transfer
Patrick Xia and Benjamin Van Durme
Are Gender-Neutral Queries Really Gender-Neutral? Mitigating Gender Bias in Image Search
Jialu Wang, Yang Liu and Xin Wang
A Bayesian Framework for Information-Theoretic Probing
Tiago Pimentel and Ryan Cotterell
Evaluating the Morphosyntactic Well-formedness of Generated Texts
Adithya Pratapa, Antonios Anastasopoulos, Shruti Rijhwani, Aditi Chaudhary, David R. Mortensen, Graham Neubig and Yulia Tsvetkov
Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech
Katrin Tomanek, Vicky Zayats, Dirk Padfield, Kara Vaillancourt and Fadi Biadsy
Towards Automatic Evaluation of Dialog Systems: A Model-Free Off-Policy Evaluation Approach
Haoming Jiang, Bo Dai, Mengjiao Yang, Tuo Zhao and Wei Wei
Improving Unsupervised Question Answering via Summarization-Informed Question Generation
Chenyang Lyu, Lifeng Shang, Yvette Graham, Jennifer Foster, Xin Jiang and Qun Liu
Improving Graph-based Sentence Ordering with Iteratively Predicted Pairwise Orderings
Shaopeng Lai, Ante Wang, Fandong Meng, Jie Zhou, Yubin Ge, Jiali Zeng, Junfeng Yao, Degen Huang and Jinsong Su
TransferNet: An Effective and Transparent Framework for Multi-hop Question Answering over Relation Graph
Jiaxin Shi, Shulin Cao, Lei Hou, Juanzi Li and Hanwang Zhang
Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little
Koustuv Sinha, Robin Jia, Dieuwke Hupkes, Joelle Pineau, Adina Williams and Douwe Kiela
Transductive Learning for Unsupervised Text Style Transfer
Fei Xiao, Liang Pang, Yanyan Lan, Yan Wang, Huawei Shen and Xueqi Cheng
TransPrompt: Towards an Automatic Transferable Prompting Framework for Few-shot Text Classification
Chengyu Wang, Jianing Wang, Minghui Qiu, jun huang and Ming Gao
Improving Math Word Problems with Pre-trained Knowledge and Hierarchical Reasoning
Weijiang Yu, Yingpeng Wen, Fudan zheng and Nong Xiao
GraphMR: Graph Neural Network for Mathematical Reasoning
Weijie Feng, Binbin Liu, Dongpeng Xu, Qilong Zheng and Yun Xu
Zero-Shot Cross-Lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders
Guanhua Chen, Shuming Ma, Yun Chen, Li Dong, Dongdong Zhang, Jia Pan, Wenping Wang and Furu Wei
Learning from Multiple Noisy Augmented Data Sets for Better Cross-Lingual Spoken Language Understanding
Yingmei Guo, Linjun Shou, Jian Pei, Ming Gong, Mingxing Xu, Zhiyong Wu and Daxin Jiang
Identifying Morality Frames in Political Tweets using Relational Learning
Shamik Roy, Maria Leonor Pacheco and Dan Goldwasser
Uncertainty-Aware Balancing for Multilingual and Multi-Domain Neural Machine Translation Training
Minghao Wu, Yitong Li, Meng Zhang, Liangyou Li, Gholamreza Haffari and Qun Liu
Measuring Sentence-Level and Aspect-Level (Un)certainty in Science Communications
Jiaxin Pei and David Jurgens
Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification
Shuhuai Ren, Jinchao Zhang, Lei Li, Xu Sun and Jie Zhou
ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora
Xuan Ouyang, Shuohuan Wang, Chao Pang, Yu Sun, Hao Tian, Hua Wu and Haifeng Wang
Intention Reasoning Network for Multi-Domain End-to-end Task-Oriented Dialogue
Zhiyuan Ma, Jianjun Li, Zezheng Zhang, Guohui Li and Yongjing Cheng
Automatic Text Evaluation through the Lens of Wasserstein Barycenters
Pierre Colombo, Guillaume Staerman, Chloé Clavel and Pablo Piantanida
Improving Multimodal fusion via Mutual Dependency Maximisation
Pierre Colombo, Emile Chapuis, Matthieu Labeau and Chloé Clavel
Code-switched inspired losses for spoken dialog representations
Pierre Colombo, Emile Chapuis, Matthieu Labeau and Chloé Clavel
English Machine Reading Comprehension Datasets: A Survey
Daria Dzendzik, Jennifer Foster and Carl Vogel
Achieving Model Robustness through Discrete Adversarial Training
Maor Ivgi and Jonathan Berant
Dynamic Knowledge Distillation for Pre-trained Language Models
Lei Li, Yankai Lin, Shuhuai Ren, Peng Li, Jie Zhou and Xu Sun
Enlivening Redundant Heads in Multi-head Self-attention for Machine Translation
Tianfu Zhang, Heyan Huang, Chong Feng and Longbing Cao
Assessing the Reliability of Word Embedding Gender Bias Measures
Yupei Du, Qixiang Fang and Dong Nguyen
PermuteFormer: Efficient Relative Position Encoding for Long Sequences
Peng Chen
Universal Simultaneous Machine Translation with Mixture-of-Experts Wait-k Policy
Shaolei Zhang and Yang Feng
AM2iCo: Evaluating Word Meaning in Context across Low-Resource Languages with Adversarial Examples
Qianchu Liu, Edoardo Maria Ponti, Diana McCarthy, Ivan Vulić and Anna Korhonen
What Changes Can Large-scale Language Models Bring? Intensive Study on HyperCLOVA: Billions-scale Korean Generative Pretrained Transformers
Boseop Kim, HyoungSeok Kim, Sang-Woo Lee, Gichang Lee, Donghyun Kwak, Jeon Dong Hyeon, Sunghyun Park, Sungju Kim, Seonhoon Kim, dongpil seo, Heungsub Lee, Minyoung Jeong, Sungjae Lee, Minsub Kim, SUK HYUN KO, Seokhun Kim, Taeyong Park, JINUK KIM, Soyoung Kang, Na-Hyeon Ryu, Kang Min Yoo, Minsuk Chang, soobin suh, Sookyo In, jinseong park, Kyungduk Kim, Hiun Kim, Jisu Jeong, Yong Goo Yeo, Donghoon Ham, Dongju Park, Min Young Lee, Jaewook Kang, INHO KANG, Jung-Woo Ha, Woomyoung Park and Nako Sung
SOM-NCSCM : An Efficient Neural Chinese Sentence Compression Model Enhanced with Self-Organizing Map
Kangli Zi, Shi Wang, Yu Liu, Jicun Li, Yanan Cao and Cungen Cao
Relation-aware Video Reading Comprehension for Temporal Language Grounding
Jialin Gao, Xin Sun, Mengmeng Xu, Xi Zhou and Bernard Ghanem
Understanding Politics via Contextualized Discourse Processing
Rajkumar Pujari and Dan Goldwasser
More is Better: Enhancing Open-Domain Dialogue Generation via Multi-Source Heterogeneous Knowledge
Sixing Wu, Ying Li, Minghui Wang, Dawei Zhang, Yang Zhou and Zhonghai Wu
Case-based Reasoning for Natural Language Queries over Knowledge Bases
Rajarshi Das, Manzil Zaheer, Dung Thai, Ameya Godbole, Ethan Perez, Jay Yoon Lee, Lizhen Tan, Lazaros Polymenakos and Andrew McCallum
Paraphrase Generation: A Survey of the State of the Art
Jianing Zhou and Suma Bhat
UniKER: A Unified Framework for Combining Embedding and Definite Horn Rule Reasoning for Knowledge Graph Inference
kewei cheng, ziqing yang, Ming Zhang and Yizhou Sun
Style Pooling: Automatic Text Style Obfuscation for Improved Classification Fairness
Fatemehsadat Mireshghallah and Taylor Berg-Kirkpatrick
Universal Sentence Representation Learning with Conditional Masked Language Model
Ziyi Yang, Yinfei Yang, Daniel Cer, Jax Law and Eric Darve
Measuring Association Between Labels and Free-Text Rationales
Sarah Wiegreffe, Ana Marasović and Noah A. Smith
Summarize-then-Answer: Generating Concise Explanations for Multi-hop Reading Comprehension
Naoya Inoue, Harsh Trivedi, Steven Sinha, Niranjan Balasubramanian and Kentaro Inui
LM-Critic: Language Models for Unsupervised Grammatical Error Correction
Michihiro Yasunaga, Jure Leskovec and Percy Liang
Don’t Go Far Off: An Empirical Study on Neural Poetry Translation
Tuhin Chakrabarty, Arkadiy Saakyan and Smaranda Muresan

Multimodal Phased Transformer for Sentiment Analysis
Junyan Cheng, Iordanis Fostiropoulos, Barry Boehm and Mohammad Soleymani
How much coffee was consumed during EMNLP 2019? Fermi Problems: A New Reasoning Challenge for AI
Ashwin Kalyan, Abhinav Kumar, Arjun Chandrasekaran, Ashish Sabharwal and Peter Clark
Scalable Font Reconstruction with Dual Latent Manifolds
Nikita Srivatsan, Si Wu, Jonathan Barron and Taylor Berg-Kirkpatrick
Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training
Yu Meng, Yunyi Zhang, Jiaxin Huang, Xuan Wang, Yu Zhang, Heng Ji and Jiawei Han
Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering
Jihyung Kil, Cheng Zhang, Dong Xuan and Wei-Lun Chao
mT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs
Zewen Chi, Li Dong, Shuming Ma, Shaohan Huang, Saksham Singhal, Xian-Ling Mao, Heyan Huang, Xia Song and Furu Wei
Aspect-Controllable Opinion Summarization
Reinald Kim Amplayo, Stefanos Angelidis and Mirella Lapata

Knowing False Negatives: An Adversarial Training Method for Distantly Supervised Relation Extraction
Kailong Hao, Botao Yu and Wei Hu
Instance-adaptive training with noise-robust losses against noisy labels
Lifeng Jin, Linfeng Song, Kun Xu and Dong Yu
Revisiting Pivot-Based Paraphrase Generation: Language Is Not the Only Optional Pivot
Yitao Cai, Yue Cao and Xiaojun Wan
Hierarchical Multi-label Text Classification with Horizontal and Vertical Category Correlations
Linli Xu, Sijie Teng, Ruoyu Zhao, Junliang Guo, chi xiao, Deqiang Jiang and Bo Ren
Cross Attention Augmented Transducer Networks for Simultaneous Translation
Dan Liu, Mengge Du, Xiaoxi Li, Ya Li and Enhong Chen
Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking Reasoning Interpretability
Xin Lv, Yixin Cao, Lei Hou, Juanzi Li, Zhiyuan Liu, YICHI ZHANG and zelin Dai
Unsupervised Data Augmentation with Naive Augmentation and without Unlabeled Data
David Lowell, Brian Howard, Zachary C. Lipton and Byron Wallace
Multi-granularity Textual Adversarial Attack with Behavior Cloning
Yangyi Chen, Jin Su and Wei Wei
Unsupervised Neural Machine Translation with Universal Grammar
Zuchao Li, Masao Utiyama, Eiichiro Sumita and Hai Zhao
SgSum:Transforming Multi-document Summarization into Sub-graph Selection
Moye Chen, Wei Li, Jiachen Liu, Xinyan Xiao, Hua Wu and Haifeng Wang
Fine-grained Entity Typing via Label Reasoning
Qing Liu, Hongyu Lin, Xinyan Xiao, Xianpei Han, Le Sun and Hua Wu
Rumor Detection on Twitter with Claim-Guided Hierarchical Graph Attention Networks
Hongzhan Lin, Jing Ma, Mingfei Cheng, Zhiwei Yang, Liangliang Chen and Guang Chen
Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic Factors
Marvin Kaster, Wei Zhao and Steffen Eger
DiscoDVT: Generating Long Text with Discourse-Aware Discrete Variational Transformer
Haozhe Ji and Minlie Huang
COVR: A Test-Bed for Visually Grounded Compositional Generalization with Real Images
Ben Bogin, Shivanshu Gupta, Matt Gardner and Jonathan Berant
Encouraging Lexical Translation Consistency for Document-Level Neural Machine Translation
Xinglin Lyu, Junhui Li, Zhengxian Gong and Min Zhang
Joint Multi-modal Aspect-Sentiment Analysis with Auxiliary Cross-modal Relation Detection
Xincheng Ju, Dong Zhang, Rong Xiao, Junhui Li, Shoushan Li, Min Zhang and Guodong Zhou
Domain-Lifelong Learning for Dialogue State Tracking via Knowledge Preservation Networks
Qingbin Liu, Pengfei Cao, Cao Liu, Jiansong Chen, Xunliang Cai, Fan Yang, Shizhu He, Kang Liu and Jun Zhao
Transformer Feed-Forward Layers Are Key-Value Memories
Mor Geva, Roei Schuster, Jonathan Berant and Omer Levy
What’s in Your Head? Emergent Behaviour in Multi-Task Transformer Models
Mor Geva, Uri Katz, Aviv Ben-Arie and Jonathan Berant
Not Just Classification: Recognizing Implicit Discourse Relation on Joint Modeling of Classification and Generation
Feng Jiang, Yaxin Fan, Xiaomin Chu, Peifeng Li and Qiaoming Zhu
Solving Aspect Category Sentiment Analysis as a Text Generation Task
Jian Liu, Zhiyang Teng, Leyang Cui, Hanmeng Liu and Yue Zhang
BiQUE: Biquaternionic Embeddings of Knowledge Graphs
Jia Guo and Stanley Kok
Diagnosing the First-Order Logical Reasoning Ability Through LogicNLI
Jidong Tian, Yitian Li, Wenqing Chen, Liqiang Xiao, Hao He and Yaohui Jin
Coupling Context Modeling with Zero Pronoun Recovering for Document-Level Natural Language Generation
Xin Tan, Longyin Zhang and Guodong Zhou
Language-agnostic Representation from Multilingual Sentence Encoders for Cross-lingual Similarity Estimation
Nattapong Tiyajamorn, Tomoyuki Kajiwara, Yuki Arase and Makoto Onizuka
Distilling Relation Embeddings from Pretrained Language Models
Asahi Ushio, Jose Camacho-Collados and Steven Schockaert

A Novel Global Feature-Oriented Relational Triple Extraction Model based on Table Filling
Feiliang Ren, Longhui Zhang, Shujuan Yin, Xiaofeng Zhao, Shilei Liu, Bochao Li and Yaduo Liu
Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning
Da Yin, Liunian Harold Li, Ziniu Hu, Nanyun Peng and Kai-Wei Chang
Parallel Refinements for Lexically Constrained Text Generation with BART
Xingwei He
Visually Grounded Reasoning across Languages and Cultures
Fangyu Liu, Emanuele Bugliarello, Edoardo Maria Ponti, Siva Reddy, Nigel Collier and Desmond Elliott
Debiasing Methods in Natural Language Understanding Make Bias More Accessible
Michael Mendelson and Yonatan Belinkov
APIRecX: Cross-Library API Recommendation via Pre-Trained Language Model
Yuning Kang, Zan Wang, Hongyu Zhang, Junjie Chen and Hanmo You
Time-dependent Entity Embedding is not All You Need: A Re-evaluation of Temporal Knowledge Graph Completion Models under a Unified Framework
Zhen Han, Gengyuan Zhang, Yunpu Ma and Volker Tresp
Learning Neural Ordinary Equations for Forecasting Future Links on Temporal Knowledge Graphs
Zhen Han, Zifeng Ding, Yunpu Ma, Yujia Gu and Volker Tresp
Efficient Multi-Task Auxiliary Learning: Selecting Auxiliary Data by Feature Similarity
Po-Nien Kung, Sheng-Siang Yin, Yi-Cheng Chen, Tse-Hsuan Yang and Yun-Nung Chen
On the Relation between Syntactic Divergence and Zero-Shot Performance
Ofir Arviv, Dmitry Nikolaev, Taelin Karidi and Omri Abend
Weakly-supervised Text Classification Based on Keyword Graph
Lu Zhang, Jiandong Ding, Yi Xu, Yingyao Liu and Shuigeng Zhou
Fast, Effective, and Self-Supervised: Transforming Masked Language Models into Universal Lexical and Sentence Encoders
Fangyu Liu, Ivan Vulić, Anna Korhonen and Nigel Collier
Effective Convolutional Attention Network for Multi-label Clinical Document Classification
Yang Liu, Hua Cheng, Russell Klopfer, Matthew R. Gormley and Thomas Schaaf
Will this Question be Answered? Question Filtering via Answer Model Distillation for Efficient Question Answering
Siddhant Garg and Alessandro Moschitti
Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis
Wei Han, Hui Chen and Soujanya Poria
Total Recall: a Customized Continual Learning Method for Neural Semantic Parsers
Zhuang Li, Lizhen Qu and Gholamreza Haffari
Effects of Parameter Norm Growth During Transformer Training: Inductive Bias from Gradient Descent
William Merrill, Vivek Ramanujan, Yoav Goldberg, Roy Schwartz and Noah A. Smith
Adaptive Information Seeking for Open-Domain Question Answering
Yunchang Zhu, Liang Pang, Yanyan Lan, Huawei Shen and Xueqi Cheng
Few-Shot Text Generation with Natural Language Instructions
Timo Schick and Hinrich Schütze
MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer
Ilias Chalkidis, Manos Fergadiotis and Ion Androutsopoulos
Lawyers are Dishonest? Quantifying Representational Harms in Commonsense Knowledge Resources
Ninareh Mehrabi, Pei Zhou, Fred Morstatter, Jay Pujara, Xiang Ren and Aram Galstyan
Neural Machine Translation Quality and Post-Editing Performance
Vilém Zouhar, Martin Popel, Ondřej Bojar and Aleš Tamchyna
Improving Zero-Shot Cross-Lingual Transfer Learning via Robust Training
Kuan-Hao Huang, Wasi Ahmad, Nanyun Peng and Kai-Wei Chang
Low Frequency Names Exhibit Bias and Overfitting in Contextualizing Language Models
Robert Wolfe and Aylin Caliskan
Distantly-Supervised Dense Retrieval Enables Open-Domain Question Answering without Evidence Annotation
Chen Zhao, Chenyan Xiong, Jordan Boyd-Graber and Hal Daumé III
CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in NLP
Qinyuan Ye, Bill Yuchen Lin and Xiang Ren
On the Influence of Masking Policies in Intermediate Pre-training
Qinyuan Ye, Belinda Z. Li, Sinong Wang, Benjamin Bolte, Hao Ma, Wen-tau Yih, Xiang Ren and Madian Khabsa

Connecting Attributions and QA Model Behavior on Realistic Counterfactuals
Xi Ye, Rohan Nair and Greg Durrett
Structure-Augmented Keyphrase Generation
Jihyuk Kim, Myeongho Jeong, Seungtaek Choi and Seung-won Hwang
Multivalent Entailment Graphs for Question Answering
Nick McKenna, Liane Guillou, Mohammad Javad Hosseini, Sander Bijl de Vroe, Mark Johnson and Mark Steedman
Region under Discussion for visual dialog
Mauricio Mazuecos, Franco M. Luque, Jorge Sánchez, Hernán Maina, Thomas Vadora and Luciana Benotti
A Root of a Problem: Optimizing Single-Root Dependency Parsing
Miloš Stanojević and Shay B. Cohen
Is Everything in Order? A Simple Way to Order Sentences
Somnath Basu Roy Chowdhury, Faeze Brahman and Snigdha Chaturvedi
Learning Bill Similarity with Annotated and Augmented Corpora of Bills
Jiseon Kim, Elden Griggs, In Song Kim and Alice Oh
Reference-Centric Models for Grounded Collaborative Dialogue
Daniel Fried, Justin Chiu and Dan Klein
A Differentiable Relaxation of Graph Segmentation and Alignment for AMR Parsing
Chunchuan Lyu, Shay B. Cohen and Ivan Titov
Narrative Theory for Computational Narrative Understanding
Andrew Piper, Richard Jean So and David Bamman
Exposure Bias versus Self-Recovery: Are Distortions Really Incremental for Autoregressive Text Generation?
Tianxing He, Jingzhao Zhang, Zhiming Zhou and James Glass
An Empirical Study on Multiple Information Sources for Zero-Shot Fine-Grained Entity Typing
Yi Chen, Haiyun Jiang, Lemao Liu, Shuming Shi, Chuang Fan, Min Yang and Ruifeng Xu
GMH: A General Multi-hop Reasoning Model for KG Completion
Yao Zhang, Hongru Liang, Adam Jatowt, Wenqiang Lei, Xin Wei, Ning Jiang and Zhenglu Yang
Event Graph based Sentence Fusion
Ruifeng Yuan, zili Wang and Wenjie Li
DyLex: Incoporating Dynamic Lexicons into BERT for Sequence Labeling
Baojun Wang, Zhao Zhang, Kun Xu, Guang-Yuan Hao, Yuyang Zhang, Lifeng Shang, Linlin Li, Xiao Chen, Xin Jiang and Qun Liu
On the Benefit of Syntactic Supervision for Cross-lingual Transfer in Semantic Role Labeling
Zhisong Zhang, Emma Strubell and Eduard Hovy
Translating Headers of Tabular Data: A Pilot Study of Schema Translation
Kunrui Zhu, Yan Gao, Jiaqi Guo and Jian-Guang LOU
Scheduled Sampling Based on Decoding Steps for Neural Machine Translation
Yijin Liu, Fandong Meng, Yufeng Chen, Jinan Xu and Jie Zhou
Learning with Instance Bundles for Reading Comprehension
Dheeru Dua, Pradeep Dasigi, Sameer Singh and Matt Gardner
Learning to Rewrite for Non-Autoregressive Neural Machine Translation
Xinwei Geng, Xiaocheng Feng and Bing Qin
Learning with Different Amounts of Annotation: From Zero to Many Labels
Shujian Zhang, Chengyue Gong and Eunsol Choi
Exophoric Pronoun Resolution in Dialogues with Topic Regularization
Xintong Yu, Hongming Zhang, Yangqiu Song, Changshui Zhang, Kun Xu and Dong Yu
Powering Comparative Classification with Sentiment Analysis via Domain Adaptive Knowledge Transfer
Zeyu Li, Yilong Qin, Zihan Liu and Wei Wang
Controllable Neural Dialogue Summarization with Personal Named Entity Planning
Zhengyuan Liu and Nancy Chen
Towards Making the Most of Dialogue Characteristics for Neural Chat Translation
Yunlong Liang, Chulun Zhou, Fandong Meng, Jinan Xu, Yufeng Chen, Jinsong Su and Jie Zhou
Dimensional Emotion Detection from Categorical Emotion
Sungjoon Park, Jiseon Kim, Seonghyeon Ye, Jaeyeol Jeon, Hee Young Park and Alice Oh
Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization
Tiezheng Yu, Wenliang Dai, Zihan Liu and Pascale Fung
A Unified Speaker Adaptation Approach for ASR
Yingzhu Zhao, Chongjia Ni, Cheung-Chi LEUNG, Shafiq Joty, Eng Siong Chng and Bin Ma
Different Strokes for Different Folks: Investigating Appropriate Further Pre-training Approaches for Diverse Dialogue Tasks
Yao Qiu, Jinchao Zhang and Jie Zhou
Modular Self-Supervision for Document-Level Relation Extraction
Sheng Zhang, Cliff Wong, Naoto Usuyama, Sarthak Jain, Tristan Naumann and Hoifung Poon
A Language Model-based Generative Classifier for Sentence-level Discourse Parsing
Ying Zhang, Hidetaka Kamigaito and Manabu Okumura
STaCK: Sentence Ordering with Temporal Commonsense Knowledge
Deepanway Ghosal, Navonil Majumder, Rada Mihalcea and Soujanya Poria
Graph Based Network with Contextualized Representations of Turns in Dialogue
Bongseok Lee and Yong Suk Choi
Condenser: a Pre-training Architecture for Dense Retrieval
Luyu Gao and Jamie Callan
Coarse2Fine: Fine-grained Text Classification on Coarsely-grained Annotated Data
Dheeraj Mekala, Varun Gangal and Jingbo Shang
Self-training with Few-shot Rationalization
Meghana Moorthy Bhat, Alessandro Sordoni and Subhabrata Mukherjee
Generating Self-Contained and Summary-Centric Question Answer Pairs via Differentiable Reward Imitation Learning
Li Zhou, Kevin Small, Yong Zhang and Sandeep Atluri
Distributionally Robust Multilingual Machine Translation
Chunting Zhou, Daniel Levy, Xian Li, Marjan Ghazvininejad and Graham Neubig
Predicting emergent linguistic compositions through time: Syntactic frame extension via multimodal chaining
Lei Yu and Yang Xu
Unsupervised Paraphrasing with Pretrained Language Models
Tong Niu, Semih Yavuz, Yingbo Zhou, Nitish Shirish Keskar, Huan Wang and Caiming Xiong
On the Challenges of Evaluating Compositional Explanations in Multi-Hop Inference: Relevance, Completeness, and Expert Ratings
Peter Jansen, Kelly Smith, Dan Moreno and Huitzilin Ortiz
Mapping probability word problems to executable representations
Simon Suster, Pieter Fivez, Pietro Totis, Angelika Kimmig, Jesse Davis, Luc de Raedt and Walter Daelemans
MapRE: An Effective Semantic Mapping Approach for Low-resource Relation Extraction
Manqing Dong, Chunguang Pan and Zhipeng Luo
Progressive Adversarial Learning for Bootstrapping: A Case Study on Entity Set Expansion
Lingyong Yan, Xianpei Han and Le Sun
Topic Transferable Table Question Answering
Saneem Chemmengath, Vishwajeet Kumar, Samarth Bharadwaj, Jaydeep Sen, Mustafa Canim, Soumen Chakrabarti, Alfio Gliozzo and Karthik Sankaranarayanan
Learn to Copy from the Copying History: Correlational Copy Network for Abstractive Summarization
Haoran Li, Song Xu, Peng Yuan, Yujia Wang, Youzheng Wu, Xiaodong He and Bowen Zhou
Graphine: A Dataset for Graph-aware Terminology Definition Generation
Zequn Liu, Shukai Wang, Yiyang Gu, Ruiyi Zhang, Ming Zhang and Sheng Wang
FewshotQA: A simple framework for few-shot learning of question answering tasks using pre-trained text-to-text models
Rakesh Chada and Pradeep Natarajan
Knowledge Enhanced Fine-Tuning for Better Handling Unseen Entities in Dialogue Generation
Leyang Cui, Yu Wu, Shujie Liu and Yue Zhang
Learning Implicit Sentiment in Aspect-based Sentiment Analysis with Supervised Contrastive Pre-Training
Zhengyan Li, Yicheng Zou, Chong Zhang, Qi Zhang and Zhongyu Wei
Filling the Gaps in Ancient Akkadian Texts: A Masked Language Modelling Approach
Koren Lazar, Benny Saret, Asaf Yehudai, Wayne Horowitz, Nathan Wasserman and Gabriel Stanovsky
Automatically Exposing Problems with Neural Dialog Models
Dian Yu and Kenji Sagae
RuleBERT: Teaching Soft Rules to Pre-Trained Language Models
Mohammed Saeed, Naser Ahmadi, Preslav Nakov and Paolo Papotti
Enhancing Multiple-choice Machine Reading Comprehension by Punishing Illogical Interpretations
Yiming Ju, Yuanzhe Zhang, Zhixing Tian, Kang Liu, Xiaohuan Cao, Wenting Zhao, Jinlong li and Jun Zhao
CSDS: A Fine-Grained Chinese Dataset for Customer Service Dialogue Summarization
Haitao Lin, Liqun Ma, Junnan Zhu, Lu Xiang, Yu Zhou, Jiajun Zhang and Chengqing Zong
Detecting Speaker Personas from Conversational Texts
Jia-Chen Gu, Zhenhua Ling, Yu Wu, Quan Liu, Zhigang Chen and Xiaodan Zhu
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation
Yue Wang, Weishi Wang, Shafiq Joty and Steven C.H. Hoi
Efficient-FedRec: Efficient Federated Learning Framework for Privacy-Preserving News Recommendation
Jingwei Yi, Fangzhao Wu, Chuhan Wu, Ruixuan Liu, Guangzhong Sun and Xing Xie
Multi-stage Training with Improved Negative Contrast for Neural Passage Retrieval
Jing Lu, Gustavo Hernandez Abrego, Ji Ma, Jianmo Ni and Yinfei Yang
Conundrums in Event Coreference Resolution: Making Sense of the State of the Art
Jing Lu and Vincent Ng
Document Graph for Neural Machine Translation
Mingzhou Xu, Liangyou Li, Derek F. Wong, Qun Liu and Lidia S. Chao
Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema
Yanai Elazar, Hongming Zhang, Yoav Goldberg and Dan Roth
Heterogeneous Graph Neural Networks for Keyphrase Generation
Jiacheng Ye, Ruijian Cai, Tao Gui and Qi Zhang
Synthetic Data Augmentation for Zero-Shot Cross-Lingual Question Answering
Arij riabi, Thomas Scialom, Rachel Keraron, Benoît Sagot, Djamé Seddah and Jacopo Staiano
Learning grounded word meaning representations on similarity graphs
Mariella Dimiccoli, Herwig Wendt and Pau Batlle Franch
QuestEval: Summarization Asks for Fact-based Evaluation
Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano, Alex Wang and patrick Gallinari
Uncovering Main Causalities for Long-tailed Information Extraction
Guoshun Nan, Jiaqi Zeng, Rui Qiao, Zhijiang Guo and Wei Lu
Seeking Common but Distinguishing Difference, A Joint Aspect-based Sentiment Analysis Model
Hongjiang Jing, Zuchao Li, Hai Zhao and Shu Jiang
Segment, Mask, and Predict: Augmenting Chinese Word Segmentation with Self-Supervision
Mieradilijiang Maimaiti, Yang Liu, Yuanhang Zheng, Gang Chen, Kaiyu Huang, Ji Zhang, Huanbo Luan and Maosong Sun
Evaluating the Robustness of Neural Language Models to Input Perturbations
Milad Moradi and Matthias Samwald
Leveraging Order-Free Tag Relations for Context-Aware Recommendation
Junmo Kang, Jeonghwan Kim, Suwon Shin and Sung-Hyon Myaeng
Adaptive Bridge between Training and Inference for Dialogue Generation
Haoran Xu, Hainan Zhang, Yanyan Zou, Hongshen Chen, Zhuoye Ding and Yanyan Lan
End-to-End Conversational Search for Online Shopping with Utterance Transfer
Liqiang Xiao, Jun Ma, Xin Luna Dong, Pascual Martínez-Gómez, Nasser Zalmout, Wei Chen, Tong Zhao, Hao He and Yaohui Jin
Natural Language Video Localization with Learnable Moment Proposals
Shaoning Xiao, Long Chen, Jian Shao, Yueting Zhuang and Jun Xiao
Graph Algorithms for Multiparallel Word Alignment
Ayyoob ImaniGooghari, Masoud Jalili Sabet, Lutfi Kerem Senel, Philipp Dufter, François Yvon and Hinrich Schütze

RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models
Wenkai Yang, Yankai Lin, Peng Li, Jie Zhou and Xu Sun
VeeAlign: Multifaceted Context Representation Using Dual Attention for Ontology Alignment
Vivek Iyer, Arvind Agarwal and Harshit Kumar
DuRecDial 2.0: A Bilingual Parallel Corpus for Conversational Recommendation
Zeming Liu, Haifeng Wang, Zheng-Yu Niu, Hua Wu and Wanxiang Che
Aligning Actions Across Recipe Graphs
Lucia Donatelli, Theresa Schmidt, Debanjali Biswas, Arne Köhn, Fangzhou Zhai and Alexander Koller
PAUSE: Positive and Annealed Unlabeled Sentence Embedding
Lele Cao, Emil Larsson, Vilhelm von Ehrenheim, Dhiana Deva Cavalcanti Rocha, Anna Martin and Sonja Horn
Robustness Evaluation of Entity Disambiguation Using Prior Probes: the Case of Entity Overshadowing
Vera Provatorova, Samarth Bhargav, Svitlana Vakulenko and Evangelos Kanoulas
Detect and Classify – Joint Span Detection and Classification for Health Outcomes
Micheal Abaho, Danushka Bollegala, Paula Williamson and Susanna Dodd
RocketQAv2: A Joint Training Method for Dense Passage Retrieval and Passage Re-ranking
Ruiyang Ren, Yingqi Qu, Jing Liu, Wayne Xin Zhao, QiaoQiao She, Hua Wu, Haifeng Wang and Ji-Rong Wen
Multi-Class Grammatical Error Detection for Correction: A Tale of Two Systems
Zheng Yuan, Shiva Taslimipoor, Christopher Davis and Christopher Bryant
Efficient Sampling of Dependency Structure
Ran Zmigrod, Tim Vieira and Ryan Cotterell
Fine-grained Entity Typing without Knowledge Base
Jing Qian, yibin liu, Lemao Liu, Yangming Li, Haiyun Jiang, Haisong Zhang and Shuming Shi
Fix-Filter-Fix: Intuitively Connect Any Models for Effective Bug Fixing
Haiwen Hong, Jingfeng Zhang, Yin Zhang, Yao Wan and Yulei Sui
IndoNLI: A Natural Language Inference Dataset for Indonesian
Rahmad Mahendra, Alham Fikri Aji, Samuel Louvan, Fahrurrozi Rahman and Clara Vania
BARThez: a Skilled Pretrained French Sequence-to-Sequence Model
Moussa Kamal Eddine, Antoine Tixier and Michalis Vazirgiannis
MTAdam: Automatic Balancing of Multiple Training Loss Terms
Itzik Malkiel and Lior Wolf
Aspect Sentiment Quad Prediction as Paraphrase Generation
Wenxuan Zhang, Yang Deng, Xin Li, Yifei Yuan, Lidong Bing and Wai Lam
Cross-lingual Aspect-based Sentiment Analysis with Aspect Term Code-Switching
Wenxuan Zhang, Ruidan He, Haiyun Peng, Lidong Bing and Wai Lam
Layer-wise Model Pruning based on Mutual Information
Chun Fan, Jiwei Li, Tianwei Zhang, Xiang Ao, Fei Wu, Yuxian Meng and Xiaofei Sun
Hierarchical Heterogeneous Graph Representation Learning for Short Text Classification
Yaqing Wang, Song Wang, Quanming Yao and Dejing Dou
Argument Pair Extraction with Mutual Guidance and Inter-sentence Relation Graph
Jianzhu Bao, Bin Liang, Jingyi Sun, Yice Zhang, Min Yang and Ruifeng Xu
ConRPG: Paraphrase Generation using Contexts as Regularizer
Yuxian Meng, Xiang Ao, Qing He, Xiaofei Sun, Qinghong Han, Fei Wu, Chun Fan and Jiwei Li
TEMP: Taxonomy Expansion with Dynamic Margin Loss through Taxonomy-Paths
Zichen Liu, Hongyuan Xu, Yanlong Wen, Ning Jiang, HaiYing Wu and Xiaojie Yuan
Minimal Supervision for Morphological Inflection
Omer Goldman and Reut Tsarfaty
CodRED: A Cross-Document Relation Extraction Dataset for Acquiring Knowledge in the Wild
Yuan Yao, Jiaju Du, Yankai Lin, Peng Li, Zhiyuan Liu, Jie Zhou and Maosong Sun
$k$Folden: $k$-Fold Ensemble for Out-Of-Distribution Detection
Xiaoya Li, Jiwei Li, Xiaofei Sun, Chun Fan, Tianwei Zhang, Fei Wu, Yuxian Meng and Jun Zhang
Language Modeling, Lexical Translation, Reordering: The Training Process of NMT through the Lens of Classical SMT
Elena Voita, Rico Sennrich and Ivan Titov
Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting
Wangchunshu Zhou, Tao Ge, Canwen Xu, Ke Xu and Furu Wei
Profanity-Avoiding Training Framework for Seq2seq Models with Certified Robustness
Hengtong Zhang, Tianhang Zheng, Yaliang Li, Jing Gao, Lu Su and Bo Li
Revisiting Tri-training of Dependency Parsers
Joachim Wagner and Jennifer Foster
Knowledge Base Completion Meets Transfer Learning
Vid Kocijan and Thomas Lukasiewicz
A Graph-Based Neural Model for End-to-End Frame Semantic Parsing
ZhiChao Lin, Yueheng Sun and Meishan Zhang
Monitoring geometrical properties of word embeddings for detecting the emergence of new topics.
Clément Christophe, Julien Velcin, Jairo Cugliari, Manel BOUMGHAR and Philippe Suignard
MassiveSumm: a very large-scale, very multilingual, news summarisation dataset
Daniel Varab and Natalie Schluter
Knowledge-Aware Graph-Enhanced GPT-2 for Dialogue State Tracking
Weizhe Lin, Bo-Hsiang Tseng and Bill Byrne
How much pretraining data do language models need to learn syntax?
Laura Pérez-Mayos, Miguel Ballesteros and Leo Wanner
Classification of hierarchical text using geometric deep learning: the case of clinical trials corpus
Sohrab Ferdowsi, Nikolay Borissov, Julien Knafou, Poorya Amini and Douglas Teodoro
XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation
Sebastian Ruder, Noah Constant, Jan Botha, Aditya Siddhant, Orhan Firat, Jinlan Fu, Pengfei Liu, Junjie Hu, Dan Garrette, Graham Neubig and Melvin Johnson
Adversarial Attack against Cross-lingual Knowledge Graph Alignment
Zeru Zhang, Zijie Zhang, Yang Zhou, Lingfei Wu, Sixing Wu, Xiaoying Han, Dejing Dou, Tianshi Che and Da Yan
Adversarial Regularization as Stackelberg Game: An Unrolled Optimization Approach
Simiao Zuo, Chen Liang, Haoming Jiang, Xiaodong Liu, Pengcheng He, Jianfeng Gao, Weizhu Chen and Tuo Zhao
Structured Context and High-Coverage Grammar for Conversational Question Answering over Knowledge Graphs
Pierre Marion, Pawel Nowak and Francesco Piccinno
Fine-grained Factual Consistency Assessment for Abstractive Summarization Models
Sen Zhang, Jianwei Niu and Chuyuan Wei
Leveraging Capsule Routing to Associate Knowledge with Medical Literature Hierarchically
Xin Liu, Qingcai Chen, Junying Chen, Wenxiu Zhou, Tingyu Liu, Xinlan Yang and Weihua Peng
Model Selection for Cross-lingual Transfer
Yang Chen and Alan Ritter
Contextualized Query Embeddings for Conversational Search
Sheng-Chieh Lin, Jheng-Hong Yang and Jimmy Lin
Deep Attention Diffusion Graph Neural Networks for Text Classification
Yonghao Liu, Renchu Guan, fausto giunchiglia, Yanchun Liang and Xiaoyue Feng
Enhanced Language Representation with Label Knowledge for Span Extraction
Pan Yang, Xin Cong, Zhenyu Sun and Xingwu Liu
Effective Fine-Tuning Methods for Cross-lingual Adaptation
Tao Yu and Shafiq Joty
Wasserstein Selective Transfer Learning for Cross-domain Text Mining
Lingyun Feng, Minghui Qiu, Yaliang Li, Haitao Zheng and Ying Shen
Label-Enhanced Hierarchical Contextualized Representation for Sequential Metaphor Identification
Shuqun Li, Liang Yang, Weidong He, Shiqi Zhang, Jingjie Zeng and Hongfei LIN
Can We Improve Model Robustness through Secondary Attribute Counterfactuals?
Ananth Balashankar, Xuezhi Wang, Ben Packer, Nithum Thain, Ed Chi and Alex Beutel
From Alignment to Assignment: Frustratingly Simple Unsupervised Entity Alignment
Xin Mao, wenting wang, Yuanbin Wu and Man Lan
Multilingual Unsupervised Neural Machine Translation with Denoising Adapters
Ahmet Üstün, Alexandre Berard, Laurent Besacier and Matthias Gallé
Point-of-Interest Type Prediction using Text and Images
Danae Sánchez Villegas and Nikolaos Aletras
Foreseeing the Benefits of Incidental Supervision
Hangfeng He, Mingyuan Zhang, Qiang Ning and Dan Roth
AttentionRank: Unsupervised Keyphrase Extraction using Self and Cross Attentions
Haoran Ding and Xiao Luo
Ultra-High Dimensional Sparse Representations with Binarization for Efficient Text Retrieval
Kyoung-Rok Jang, Junmo Kang, Giwon Hong, Sung-Hyon Myaeng, Joohee Park, Taewon Yoon and Heecheol Seo
Discretized Integrated Gradients for Explaining Language Models
Soumya Sanyal and Xiang Ren
Neuralizing Regular Expressions for Slot Filling
Chengyue Jiang, Zijian Jin and Kewei Tu
Progressive Self-Training with Discriminator for Aspect Term Extraction
Qianlong Wang, zhiyuan wen, Qin Zhao, Min Yang and Ruifeng Xu
FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations
Lukas Lange, Heike Adel, Jannik Strötgen and Dietrich Klakow
The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers
Róbert Csordás, Kazuki Irie and Juergen Schmidhuber
Self-Supervised Quality Estimation for Machine Translation
Yuanhang Zheng, Zhixing Tan, Meng Zhang, Mieradilijiang Maimaiti, Huanbo Luan, Maosong Sun, Qun Liu and Yang Liu
Come hither or go away? Recognising pre-electoral coalition signals in the news
Ines Rehbein, Simone Paolo Ponzetto, Anna Adendorf, Oke Bahnsen, Lukas Stoetzer and Heiner Stuckenschmidt
Importance Estimation from Multiple Perspectives for Keyphrase Extraction
Mingyang Song, Liping Jing and Lin Xiao
Stepmothers are mean and academics are pretentious: What do pretrained language models learn about you?
Rochelle Choenni, Ekaterina Shutova and Robert van Rooij
HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression
Chenhe Dong, Yaliang Li, Ying Shen and Minghui Qiu
When is Wall a Pared and when a Muro?: Extracting Rules Governing Lexical Selection
Aditi Chaudhary, Kayo Yin, Antonios Anastasopoulos and Graham Neubig
Label Verbalization and Entailment for Effective Zero and Few-Shot Relation Extraction
Oscar Sainz, Oier Lopez de Lacalle, Gorka Labaka, Ander Barrena and Eneko Agirre
A Massively Multilingual Analysis of Cross-linguality in Shared Embedding Space
Alexander Jones, William Yang Wang and Kyle Mahowald
Contrastive Code Representation Learning
Paras Jain, Ajay Jain, Tianjun Zhang, Pieter Abbeel, Joseph Gonzalez and Ion Stoica
Rethinking Data Augmentation for Low-Resource Neural Machine Translation: A Multi-Task Learning Approach
Víctor M. Sánchez-Cartagena, Miquel Esplà-Gomis, Juan Antonio Pérez-Ortiz and Felipe Sánchez-Martínez
Building and Evaluating Open-Domain Dialogue Corpora with Clarifying Questions
Mohammad Ailannejadi, Julia Kiseleva, Aleksandr Chuklin, Jeff Dalton and Mikhail Burtsev
End-to-End Learning of Flowchart Grounded Task-Oriented Dialogs
Dinesh Raghu, Shantanu Agarwal, Sachindra Joshi and Mausam -
Open Knowledge Graphs Canonicalization using Variational Autoencoders
Sarthak Dash, Gaetano Rossiello, Nandana Mihindukulasooriya, Sugato Bagchi and Alfio Gliozzo
Adversarial Attacks on Knowledge Graph Embeddings via Instance Attribution Methods
Peru Bhardwaj, John Kelleher, Luca Costabello and Declan O’Sullivan
Continual Few-Shot Learning for Text Classification
Ramakanth Pasunuru, Veselin Stoyanov and Mohit Bansal
Disentangling Representations of Text by Masking Transformers
Xiongyi Zhang, Jan-Willem van de Meent and Byron Wallace
Shortcutted Commonsense: Data Spuriousness in Deep Learning of Commonsense Reasoning
Ruben Branco, António Branco, João António Rodrigues and João Ricardo Silva
Putting Words in BERT’s Mouth: Navigating Contextualized Vector Spaces with Pseudowords
Taelin Karidi, Yichu Zhou, Nathan Schneider, Omri Abend and Vivek Srikumar
Generic resources are what you need: Style transfer tasks without task-specific parallel training data
Huiyuan Lai, Antonio Toral and Malvina Nissim
Finding needles in a haystack: Sampling Structurally-diverse Training Sets from Synthetic Data for Compositional Generalization
Inbar Oren, Jonathan Herzig and Jonathan Berant
Jointly Learning to Repair Code and Generate Commit Message
Jiaqi Bai, Long Zhou, Ambrosio Blanco, Shujie Liu, Furu Wei, Ming Zhou and Zhoujun Li
Journalistic Guidelines Aware News Image Captioning
Xuewen Yang, Svebor Karaman, Joel Tetreault and Alejandro Jaimes
Distilling Linguistic Context for Language Model Compression
Geondo Park, Gyeongman Kim and Eunho Yang
Agreeing to Disagree: Annotating Offensive Language Datasets with Annotators’ Disagreement
Elisa Leonardelli, Stefano Menini, Alessio Palmero Aprosio, Marco Guerini and Sara Tonelli
#HowYouTagTweets: Learning User Hashtagging Preferences via Personalized Topic Attention
Yuji Zhang, Yubo Zhang, Chunpu Xu, Jing Li, Ziyan Jiang and Baolin Peng
Signed Coreference Resolution
Kayo Yin, Kenneth DeHaan and Malihe Alikhani
Wino-X: Multilingual Winograd Schemas for Commonsense Reasoning and Coreference Resolution
Denis Emelin and Rico Sennrich
ReGen: Reinforcement Learning for Text and Knowledge Base Generation using Pretrained Language Models
Pierre Dognin, Inkit Padhi, Igor Melnyk and Payel Das
Moral Stories: Situated Reasoning about Norms, Intents, Actions, and their Consequences
Denis Emelin, Ronan Le Bras, Jena D. Hwang, Maxwell Forbes and Yejin Choi
Understanding and Overcoming the Challenges of Efficient Transformer Quantization
Yelysei Bondarenko, Markus Nagel and tijmen Blankevoort
GeneSis: A Generative Approach to Substitutes in Context
Caterina Lacerra, Rocco Tripodi and Roberto Navigli
Maximal Clique Based Non-Autoregressive Open Information Extraction
Bowen Yu, Yucheng Wang, Tingwen Liu, Hongsong Zhu, Limin Sun and Bin Wang
Sorting through the noise: Testing robustness of information processing in pre-trained language models
Lalchand Pandia and Allyson Ettinger
The Impact of Positional Encodings on Multilingual Compression
Vinit Ravishankar and Anders Søgaard
Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers
Stella Frank, Emanuele Bugliarello and Desmond Elliott
CR-Walker: Tree-Structured Graph Reasoning and Dialog Acts for Conversational Recommendation
Wenchang Ma, Ryuichi Takanobu and Minlie Huang
Asking It All: Generating Contextualized Questions for any Semantic Role
Valentina Pyatkin, Paul Roit, Julian Michael, Yoav Goldberg, Reut Tsarfaty and Ido Dagan
ARMAN: Pre-training with Semantically Selecting and Reordering of Sentences for Persian Abstractive Summarization
Alireza Salemi, Emad Kebriaei, Ghazal Neisi Minaei and Azadeh Shakery
Efficient Nearest Neighbor Language Models
Junxian He, Graham Neubig and Taylor Berg-Kirkpatrick
Self-Supervised Detection of Contextual Synonyms in a Multi-Class Setting: Phenotype Annotation Use Case
Jingqing Zhang, Luis Bolanos Trujillo, Tong Li, Ashwani Tanwar, Guilherme Freire, Xian Yang, Julia Ive, Vibhor Gupta and Yike Guo
Integrating Deep Event-Level and Script-Level Information for Script Event Prediction
Long Bai, Saiping Guan, Jiafeng Guo, Zixuan Li, Xiaolong Jin and Xueqi Cheng
Visual News: Benchmark and Challenges in News Image Captioning
Fuxiao Liu, Yinghan Wang, Tianlu Wang and Vicente Ordonez
Models and Datasets for Cross-Lingual Summarisation
Laura Perez-Beltrachini and Mirella Lapata
Cross-lingual Sentence Embedding using Multi-Task Learning
Koustava Goswami, Sourav Dutta, Haytham Assem, Theodorus Fransen and John P. McCrae
BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation
Haoran Xu, Benjamin Van Durme and Kenton Murray
Weakly supervised discourse segmentation for multiparty oral conversations
Lila Gravellier, Julie Hunter, Philippe Muller, Thomas Pellegrini and Isabelle Ferrané
SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
Satwik Kottur, Seungwhan Moon, Alborz Geramifard and Babak Damavandi
Unsupervised Conversation Disentanglement through Co-Training
Hui Liu, Zhan Shi and Xiaodan Zhu
IR like a SIR: Sense-enhanced Information Retrieval for Multiple Languages
Rexhina Blloshmi, Tommaso Pasini, Niccolò Campolungo, Somnath Banerjee, Roberto Navigli and Gabriella Pasi
Artificial Text Detection via Examining the Topology of Attention Maps
Laida Kushnareva, Daniil Cherniavskii, Vladislav Mikhailov, Ekaterina Artemova, Serguei Barannikov, Alexander Bernstein, Irina Piontkovskaya, Dmitri Piontkovski and Evgeny Burnaev
ConSeC: Word Sense Disambiguation as Continuous Sense Comprehension
Edoardo Barba, Luigi Procopio and Roberto Navigli
HintedBT: Augmenting Back-Translation with Quality and Transliteration Hints
Sahana Ramnath, Melvin Johnson, Abhirut Gupta and Aravindan Raghuveer
Narrative Embedding: Re-Contextualization Through Attention
Sean Wilner, Daniel Woolridge and Madeleine Glick
Mitigating Language-Dependent Ethnic Bias in BERT
Jaimeen Ahn and Alice Oh
Tribrid: Stance Classification with Neural Inconsistency Detection
Song Yang and Jacopo Urbani
Fast WordPiece Tokenization
Xinying Song, Alex Salcianu, Yang Song, Dave Dopson and Denny Zhou
Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation
Max Bartolo, Tristan Thrush, Robin Jia, Sebastian Riedel, Pontus Stenetorp and Douwe Kiela
Constructing a Psychometric Testbed for Fair Natural Language Processing
Ahmed Abbasi, David Dobolyi, John P. Lalor, Richard Netemeyer, Kendall Smith and Yi Yang
Cross-lingual Intermediate Fine-tuning improves Dialogue State Tracking
Nikita Moghe, Mark Steedman and Alexandra Birch
MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks
Cristian-Paul Bara, Sky CH-Wang and Joyce Chai
Neural Attention-Aware Hierarchical Topic Model
YUAN JIN, He Zhao, Ming Liu, Lan Du and Wray Buntine
Sequential Randomized Smoothing for Adversarially Robust Speech Recognition
Raphael Olivier and Bhiksha Raj
Rationales for Sequential Predictions
Keyon Vafa, Yuntian Deng, David Blei and Alexander Rush
One Source, Two Targets: Challenges and Rewards of Dual Decoding
Jitao Xu and François Yvon
SPECTRA: Sparse Structured Text Rationalization
Nuno M. Guerreiro and André F. T. Martins
Active Learning by Acquiring Contrastive Examples
Katerina Margatina, Giorgos Vernikos, Loïc Barrault and Nikolaos Aletras
Data-to-text Generation by Splicing Together Nearest Neighbors
Sam Wiseman, Arturs Backurs and Karl Stratos
SYSML: StYlometry with Structure and Multitask Learning: Implications for Darknet Forum Migrant Analysis
Pranav Maneriker, Yuntian He and Srinivasan Parthasarathy
Frequency Effects on Syntactic Rule Learning in Transformers
Jason Wei, Dan Garrette, Tal Linzen and Ellie Pavlick
A surprisal—duration trade-off across and within the world’s languages
Tiago Pimentel, Clara Meister, Elizabeth Salesky, Simone Teufel, Damián Blasi and Ryan Cotterell
Unsupervised Relation Extraction: A Variational Autoencoder Approach
Chenhan Yuan and Hoda Eldardiry
Aligning Multidimensional Worldviews and Discovering Ideological Differences
Jeremiah Milbauer, Adarsh Mathew and James Evans
Extend, don’t rebuild: Phrasing conditional graph modification as autoregressive sequence labelling
Leon Weber, Jannes Münchmeyer, Samuele Garda and Ulf Leser
Simple Conversational Data Augmentation for Semi-supervised Abstractive Dialogue Summarization
Jiaao Chen and Diyi Yang
You should evaluate your language model on marginal likelihood over tokenisations
Kris Cao and Laura Rimell
A Strong Baseline for Query Efficient Attacks in a Black Box Setting
Rishabh Maheshwary, Saket Maheshwary and Vikram Pudi
All Bark and No Bite: Rogue Dimensions in Transformer Language Models Obscure Representational Quality
William Timkey and Marten van Schijndel
Revisiting the Uniform Information Density Hypothesis
Clara Meister, Tiago Pimentel, Patrick Haller, Lena Jäger, Ryan Cotterell and Roger Levy
Towards Label-Agnostic Emotion Embeddings
Sven Buechel, Luise Modersohn and Udo Hahn
Conditional Poisson Stochastic Beams
Clara Meister, Afra Amini, Tim Vieira and Ryan Cotterell
Exploring the Role of BERT Token Representations to Explain Sentence Probing Results
Hosein Mohebbi, Ali Modarressi and Mohammad Taher Pilehvar
(Mis)alignment Between Stance Expressed in Social Media Data and Public Opinion Surveys
Kenneth Joseph, Sarah Shugars, Ryan Gallagher, Jon Green, Alexi Quintana Mathé, zijian an and David Lazer
Inducing Transformer’s Compositional Generalization Ability via Auxiliary Sequence Prediction Tasks
Yichen Jiang and Mohit Bansal
Phrase Retrieval Learns Passage Retrieval, Too
Jinhyuk Lee, Alexander Wettig and Danqi Chen
Controllable Semantic Parsing via Retrieval Augmentation
Panupong Pasupat, Yuan Zhang and Kelvin Guu
ConvAbuse: Data, Analysis, and Benchmarks for Nuanced Detection in Conversational AI
Amanda Cercas Curry, Gavin Abercrombie and Verena Rieser
BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief
Nora Kassner, Oyvind Tafjord, Hinrich Schütze and Peter Clark
STraTA: Self-Training with Task Augmentation for Better Few-shot Learning
Tu Vu, Minh-Thang Luong, Quoc Le, Grady Simon and Mohit Iyyer
Semi-Supervised Exaggeration Detection of Health Science Press Releases
Dustin Wright and Isabelle Augenstein
Contrastive Explanations for Model Interpretability
Alon Jacovi, Swabha Swayamdipta, Shauli Ravfogel, Yanai Elazar, Yejin Choi and Yoav Goldberg
ValNorm Quantifies Semantics to Reveal Consistent Valence Biases Across Languages and Over Centuries
Autumn Toney and Aylin Caliskan
Data Augmentation for Cross-Domain Named Entity Recognition
Shuguang Chen, Gustavo Aguilar, Leonardo Neves and Thamar Solorio
Improved Latent Tree Induction with Distant Supervision via Span Constraints
Zhiyang Xu, Andrew Drozdov, Jay Yoon Lee, Tim O’Gorman, Subendhu Rongali, Dylan Finkbeiner, Shilpa Suresh, Mohit Iyyer and Andrew McCallum
Incorporating medical knowledge in BERT for clinical relation extraction
Arpita Roy and Shimei Pan
GupShup: Summarizing Open-Domain Code-Switched Conversations
Laiba Mehnaz, Debanjan Mahata, Rakesh Gosangi, Uma Sushmitha Gunturi, Riya Jain, Gauri Gupta, Amardeep Kumar, Isabelle G. Lee, Anish Acharya and Rajiv Ratn Shah
Semantic Novelty Detection in Natural Language Descriptions
Nianzu Ma, Alexander Politowicz, Sahisnu Mazumder, Jiahua Chen, Bing Liu, Eric Robertson and Scott Grigsby
TADPOLE: Task ADapted Pre-Training via AnOmaLy DEtection
Vivek Madan, Ashish Khetan and Zohar Karnin
How Do Neural Sequence Models Generalize? Local and Global Cues for Out-of-Distribution Prediction
D. Anthony Bau and Jacob Andreas
PRIDE: Predicting Relationships in Conversations
Anna Tigunova, Paramita Mirza, Andrew Yates and Gerhard Weikum
ESTER: A Machine Reading Comprehension Dataset for Reasoning about Event Semantic Relations
Rujun Han, I-Hung Hsu, Jiao Sun, Julia Baylon, Qiang Ning, Dan Roth and Nanyun Peng
Focus on what matters: Applying Discourse Coherence Theory to Cross Document Coreference
William Held, Dan Iter and Dan Jurafsky
Robust Retrieval Augmented Generation for Zero-shot Slot Filling
Michael Glass, Gaetano Rossiello, Md Faisal Mahbub Chowdhury and Alfio Gliozzo
IGA: An Intent-Guided Authoring Assistant
Simeng Sun, Wenlong Zhao, Varun Manjunatha, Rajiv Jain, Vlad Morariu, Franck Dernoncourt, Balaji Vasan Srinivasan and Mohit Iyyer
Gradient-based Adversarial Attacks against Text Transformers
Chuan Guo, Alexandre Sablayrolles, Hervé Jégou and Douwe Kiela
Do Long-Range Language Models Actually Use Long-Range Context?
Simeng Sun, Kalpesh Krishna, Andrew Mattarella-Micke and Mohit Iyyer
ECONET: Effective Continual Pretraining of Language Models for Event Temporal Reasoning
Rujun Han, Xiang Ren and Nanyun Peng
Classification-based Quality Estimation: Small and Efficient Models for Real-world Applications
Shuo Sun, Ahmed El-Kishky, Vishrav Chaudhary, James Cross, Lucia Specia and Francisco Guzmán
FastIF: Scalable Influence Functions for Efficient Model Interpretation and Debugging
Han Guo, Nazneen Fatema Rajani, Peter Hase, Mohit Bansal and Caiming Xiong
Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts
Ashutosh Baheti, Maarten Sap, Alan Ritter and Mark Riedl
DIALKI: Knowledge Identification in Conversational Systems through Dialogue-Document Contextualization
Zeqiu Wu, Bo-Ru Lu, Hannaneh Hajishirzi and Mari Ostendorf
The Perils of Using Mechanical Turk to Evaluate Open-Ended Text Generation
Marzena Karpinska, Nader Akoury and Mohit Iyyer
Contrastive Out-of-Distribution Detection for Pretrained Transformers
Wenxuan Zhou, Fangyu Liu and Muhao Chen
Learning from Noisy Labels for Entity-Centric Information Extraction
Wenxuan Zhou and Muhao Chen
Improving Stance Detection with Multi-Dataset Learning and Knowledge Distillation
Yingjie Li, Chenye Zhao and Cornelia Caragea
Few-Shot Emotion Recognition in Conversation with Sequential Prototypical Networks
Gaël Guibon, Matthieu Labeau, Hélène Flamein, Luce Lefeuvre and Chloé Clavel
Constrained Language Models Yield Few-Shot Semantic Parsers
Richard Shin, Christopher Lin, Sam Thomson, Charles Chen, Subhro Roy, Emmanouil Antonios Platanios, Adam Pauls, Dan Klein, Jason Eisner and Benjamin Van Durme
Synthetic Textual Features for the Large-Scale Detection of Basic-level Categories in English and Mandarin
Yiwen Chen and Simone Teufel
Phrase-BERT: Improved Phrase Embeddings from BERT with an Application to Corpus Exploration
Shufan Wang, Laure Thompson and Mohit Iyyer
Adversarial Scrubbing of Demographic Information for Text Classification
Somnath Basu Roy Chowdhury, Sayan Ghosh, Yiyuan Li, Junier Oliva, Shashank Srivastava and Snigdha Chaturvedi
Comparing Text Representations: A Theory-Driven Approach
Gregory Yauney and David Mimno
The World of an Octopus: How Reporting Bias Influences a Language Model’s Perception of Color
Cory Paik, Stéphane Aroca-Ouellette, Alessandro Roncone and Katharina Kann
WhyAct: Identifying Action Reasons in Lifestyle Vlogs
Oana Ignat, Santiago Castro, Hanwen Miao, Weiji Li and Rada Mihalcea
Efficient Inference for Multilingual Neural Machine Translation
Alexandre Berard, Dain Lee, Stephane Clinchant, Kweonwoo Jung and Vassilina Nikoulina
Hitting your MARQ: Multimodal ARgument Quality Assessment in Long Debate Video
Md Kamrul Hasan, James Spann, Masum Hasan, Md. Saiful Islam, Kurtis Haut, Rada Mihalcea and Ehsan Hoque
Do Transformer Modifications Transfer Across Implementations and Applications?
Sharan Narang, Hyung Won Chung, Yi Tay, Liam Fedus, Thibault Fevry, Michael Matena, Karishma Malkan, Noah Fiedel, Noam Shazeer, Zhenzhong Lan, Yanqi Zhou, Wei Li, Nan Ding, Jake Marcus, Adam Roberts and Colin Raffel
Flexible Generation of Natural Language Deductions
Kaj Bostrom, Xinyu Zhao, Swarat Chaudhuri and Greg Durrett
QA-Align: Representing Cross-Text Content Overlap by Aligning Question-Answer Propositions
Daniela Brook Weiss, Paul Roit, Ayal Klein, Ori Ernst and Ido Dagan
Truth-Conditional Captions for Time Series Data
Harsh Jhamtani and Taylor Berg-Kirkpatrick
RAST: Domain-Robust Dialogue Rewriting as Sequence Tagging
Jie Hao, Linfeng Song, Liwei Wang, Kun Xu, Zhaopeng Tu and Dong Yu
Extracting Material Property Measurement Data from Scientific Articles
Gihan Panapitiya, Fred Parks, Jonathan Sepulveda and Emily Saldanha
Learning Opinion Summarizers by Selecting Informative Reviews
Arthur Bražinskas, Mirella Lapata and Ivan Titov
Studying word order through iterative shuffling
Nikolay Malkin, Sameera Lanka, Pranav Goel and Nebojsa Jojic
AESOP: Paraphrase Generation with Adaptive Syntactic Control
Jiao Sun, Xuezhe Ma and Nanyun Peng
Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction
Mahsa Yarmohammadi, Shijie Wu, Marc Marone, Haoran Xu, Seth Ebner, Guanghui Qin, Yunmo Chen, Jialiang Guo, Craig Harman, Kenton Murray, Aaron Steven White, Mark Dredze and Benjamin Van Durme
Block Pruning For Faster Transformers
François Lagunas, Ella Charlaix, Victor Sanh and Alexander Rush
ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense Reasoning
Swarnadeep Saha, Prateek Yadav, Lisa Bauer and Mohit Bansal
Math Word Problem Generation with Mathematical Consistency and Problem Context Constraints
Zichao Wang, Andrew Lan and Richard Baraniuk
Controlling Machine Translation for Multiple Attributes with Additive Interventions
Andrea Schioppa, David Vilar, Artem Sokolov and Katja Filippova
Building Adaptive Acceptability Classifiers for Neural NLG
Soumya Batra, Shashank Jain, Peyman Heidari, Ankit Arun, Catharine Youngs, Xintong Li, Pinar Donmez, Shawn Mei, Shiunzu Kuo, Vikas Bhardwaj, Anuj Kumar and Michael White
Explaining Answers with Entailment Trees
Bhavana Dalvi, Peter Jansen, Oyvind Tafjord, Zhengnan Xie, Hannah Smith, Leighanna Pipatanangkura and Peter Clark
Modeling Document-Level Context for Event Detection via Important Context Selection
Amir Pouran Ben Veyseh, Minh Van Nguyen, Nghia Ngo Trung, Bonan Min and Thien Huu Nguyen
Paired Examples as Indirect Supervision in Latent Decision Models
Nitish Gupta, Sameer Singh, Matt Gardner and Dan Roth
“Was it “stated” or was it “claimed”?: How linguistic bias affects generative language models
Roma Patel and Ellie Pavlick
Pairwise Supervised Contrastive Learning of Sentence Representations
Dejiao Zhang, Shang-Wen Li, Wei Xiao, Henghui Zhu, Ramesh Nallapati, Andrew O. Arnold and Bing Xiang
Refocusing on Relevance: Personalization in NLG
Shiran Dudy, Steven Bedrick and Bonnie Webber
HittER: Hierarchical Transformers for Knowledge Graph Embeddings
Sanxing Chen, Xiaodong Liu, Jianfeng Gao, Jian Jiao, Ruofei Zhang and Yangfeng Ji
Muppet: Massive Multi-task Representations with Pre-Finetuning
Armen Aghajanyan, Anchit Gupta, Akshat Shrivastava, Xilun Chen, Luke Zettlemoyer and Sonal Gupta
RICA: Evaluating Robust Inference Capabilities Based on Commonsense Axioms
Pei Zhou, Rahul Khanna, Seyeon Lee, Bill Yuchen Lin, Daniel Ho, Jay Pujara and Xiang Ren
SituatedQA: Incorporating Extra-Linguistic Contexts into QA
Michael Zhang and Eunsol Choi
Competency Problems: On Finding and Removing Artifacts in Language Data
Matt Gardner, William Merrill, Jesse Dodge, Matthew Peters, Alexis Ross, Sameer Singh and Noah A. Smith
A Generative Framework for Simultaneous Machine Translation
Yishu Miao, Phil Blunsom and Lucia Specia
BiSECT: Learning to Split and Rephrase Sentences with Bitexts
Joongwon Kim, Mounica Maddela, Reno Kriz, Wei Xu and Chris Callison-Burch
When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute
Tao Lei
Mind the Context: The Impact of Contextualization in Neural Module Networks for Grounding Visual Referring Expressions
Arjun Akula, Spandana Gella, Keze Wang, Song-Chun Zhu and Siva Reddy
Detecting Contact-Induced Semantic Shifts: What Can Embedding-Based Methods Do in Practice?
Filip Miletic, Anne Przewozny-Desriaux and Ludovic Tanguy
CrossVQA: Scalably Generating Benchmarks for Systematically Testing VQA Generalization
Arjun Akula, Soravit Changpinyo, Boqing Gong, Piyush Sharma, Song-Chun Zhu and Radu Soricut
Translation-based Supervision for Policy Generation in Simultaneous Neural Machine Translation
Ashkan Alinejad, Hassan S. Shavarani and Anoop Sarkar
How Does Counterfactually Augmented Data Impact Models for Social Computing Constructs?
Indira Sen, Mattia Samory, Fabian Flöck, Claudia Wagner and Isabelle Augenstein
Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled Corpus
Jesse Dodge, Maarten Sap, Ana Marasović, William Agnew, Gabriel Ilharco, Dirk Groeneveld, Margaret Mitchell and Matt Gardner
Softmax Tree: An Accurate, Fast Classifier When the Number of Classes Is Large
Arman Zharmagambetov, Magzhan Gabidolla and Miguel Carreira-Perpinan
Finding a Balanced Degree of Automation for Summary Evaluation
Shiyue Zhang and Mohit Bansal
AfroMT: Pretraining Strategies and Reproducible Benchmarks for Translation of 8 African Languages
Machel Reid, Junjie Hu, Graham Neubig and Yutaka Matsuo
Surface Form Competition: Why the Highest Probability Answer Isn’t Always Right
Ari Holtzman, Peter West, Vered Shwartz, Yejin Choi and Luke Zettlemoyer
Weakly-Supervised Visual-Retriever-Reader for Knowledge-based Question Answering
Man Luo, Yankai Zeng, Pratyay Banerjee and Chitta Baral
Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories
Wenlin Yao, Xiaoman Pan, Lifeng Jin, Jianshu Chen, Dian Yu and Dong Yu
Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text
Christopher Clark, Jordi Salvador, Dustin Schwenk, Derrick Bonafilia, Mark Yatskar, Eric Kolve, Alvaro Herrasti, Jonghyun Choi, Sachin Mehta, Sam Skjonsberg, Carissa Schoenick, Aaron Sarnat, Hannaneh Hajishirzi, Aniruddha Kembhavi, Oren Etzioni and Ali Farhadi
Crosslingual Transfer Learning for Relation and Event Extraction via Word Category and Class Alignments
Minh Van Nguyen, Tuan Ngo Nguyen, Bonan Min and Thien Huu Nguyen
NDH-Full: Learning and Evaluating Navigational Agents on Full-Length Dialogue
Hyounghun Kim, Jialu Li and Mohit Bansal
Jump-Starting Item Parameters for Adaptive Language Tests
Arya D. McCarthy, Kevin P. Yancey, Geoff LaFlair, Jesse Egbert, Manqian Liao and Burr Settles
Latent Hatred: A Benchmark for Understanding Implicit Hate Speech
Mai ElSherief, Caleb Ziems, David Muchlinski, Vaishnavi Anupindi, Jordyn Seybolt, Munmun De Choudhury and Diyi Yang
Integrating Visuospatial, Linguistic, and Commonsense Structure into Story Visualization
Adyasha Maharana and Mohit Bansal
Diverse Distributions of Self-Supervised Tasks for Meta-Learning in NLP
Trapit Bansal, Karthick Prasad Gunasekaran, Tong Wang, Tsendsuren Munkhdalai and Andrew McCallum
A Large-Scale Study of Machine Translation in Turkic Languages
Jamshidbek Mirzakhalov, Anoop Babu, Duygu Ataman, Sherzod Kariev, Francis Tyers, Otabek Abduraufov, Mammad Hajili, Sardana Ivanova, Abror Khaytbaev, Antonio Laverghetta Jr., Bekhzodbek Moydinboyev, Esra Onal, Shaxnoza Pulatova, Ahsan Wahab, Orhan Firat and Sriram Chellappan
Navigating the Kaleidoscope of COVID-19 Misinformation Using Deep Learning
Yuanzhi Chen and Mohammad Hasan
CLASSIC: Continual and Contrastive Learning of Aspect Sentiment Classification Tasks
Zixuan Ke, Bing Liu, Hu Xu and Lei Shu
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
Hu Xu, Gargi Ghosh, Po-Yao Huang, Dmytro Okhonko, Armen Aghajanyan, Florian Metze, Luke Zettlemoyer and Christoph Feichtenhofer
Pre-train or Annotate? Domain Adaptation with a Constrained Budget
Fan Bai, Alan Ritter and Wei Xu
Self-training Improves Pre-training for Few-shot Learning in Task-oriented Dialog Systems
Fei Mi, Wanhao Zhou, Lingjing Kong, Fengyu Cai, Minlie Huang and Boi Faltings
Human Rationales as Attribution Priors for Explainable Stance Detection
Sahil Jayaram and Emily Allaway
ReasonBERT: Pre-trained to Reason with Distant Supervision
Xiang Deng, Yu Su, Alyssa Lees, You Wu, Cong Yu and Huan Sun
Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation
Mozhdeh Gheini, Xiang Ren and Jonathan May
The Stem Cell Hypothesis: Dilemma behind Multi-Task Learning with Transformer Encoders
Han He and Jinho D. Choi
Don’t be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System
Libo Qin, Tianbao Xie, Shijue Huang, Qiguang Chen, Xiao Xu and Wanxiang Che
Continual Learning in Task-Oriented Dialogue Systems
Andrea Madotto, Zhaojiang Lin, Zhenpeng Zhou, Seungwhan Moon, Paul Crook, Bing Liu, Zhou Yu, Eunjoon Cho, Pascale Fung and Zhiguang Wang
Building the Directed Semantic Graph for Coherent Long Text Generation
Ziao Wang, Xiaofeng Zhang and Hongwei Du
Salience-Aware Event Chain Modeling for Narrative Understanding
Xiyang Zhang, Muhao Chen and Jonathan May
Controlled Evaluation of Grammatical Knowledge in Mandarin Chinese Language Models
Yiwen Wang, Jennifer Hu, Roger Levy and Peng Qian
StreamHover: Livestream Transcript Summarization and Annotation
Sangwoo Cho, Franck Dernoncourt, Tim Ganter, Trung Bui, Nedim Lipka, Walter Chang, Hailin Jin, Jonathan Brandt, Hassan Foroosh and Fei Liu
Zero-Shot Dialogue State Tracking via Cross-Task Transfer
Zhaojiang Lin, Bing Liu, Andrea Madotto, Seungwhan Moon, Zhenpeng Zhou, Paul Crook, Zhiguang Wang, Zhou Yu, Eunjoon Cho, Rajen Subba and Pascale Fung
Gradient Imitation Reinforcement Learning for Low Resource Relation Extraction
Xuming Hu, Chenwei Zhang, Yawen Yang, Xiaohe Li, Li Lin, Lijie Wen and Philip S. Yu
Neural Natural Logic Inference for Interpretable Question Answering
Jihao Shi, Xiao Ding, Li Du, Ting Liu and Bing Qin
Linguistic Dependencies and Statistical Dependence
Jacob Hoover, Wenyu Du, Alessandro Sordoni and Timothy J. O’Donnell
Text2Mol: Cross-Modal Molecule Retrieval with Natural Language Queries
Carl Edwards, ChengXiang Zhai and Heng Ji
Analyzing the Surprising Variability in Word Embedding Stability Across Languages
Laura Burdick, Jonathan K. Kummerfeld and Rada Mihalcea
Few-Shot Named Entity Recognition: An Empirical Baseline Study
Jiaxin Huang, Chunyuan Li, Krishan Subudhi, Damien Jose, Shobana Balakrishnan, Weizhu Chen, Baolin Peng, Jianfeng Gao and Jiawei Han
Sequential Cross-Document Coreference Resolution
Emily Allaway, Shuai Wang and Miguel Ballesteros
Evaluating the Evaluation Metrics for Style Transfer: A Case Study in Multilingual Formality Transfer
Eleftheria Briakou, Sweta Agrawal, Joel Tetreault and Marine Carpuat
Iterative GNN-based Decoder for Question Generation
Zichu Fei, Qi Zhang and Yaqian Zhou
Generalised Unsupervised Domain Adaptation of Neural Machine Translation with Cross-Lingual Data Selection
Thuy-Trang Vu, Xuanli He, Dinh Phung and Gholamreza Haffari
Decision-Focused Summarization
Chao-Chun Hsu and Chenhao Tan
Rule-based Morphological Inflection Improves Neural Terminology Translation
Weijia Xu and Marine Carpuat
Chinese Opinion Role Labeling with Corpus Translation: A Pivot Study
Ranran Zhen, Rui Wang, Guohong Fu, Chengguo Lv and Meishan Zhang
Detecting Health Advice in Medical Research Literature
Yingya Li, Jun Wang and Bei Yu
A Semantic Feature-Wise Transformation Relation Network for Automatic Short Answer Grading
Zhaohui Li, Yajur Tomar and Rebecca J. Passonneau
Structure-aware Fine-tuning of Sequence-to-sequence Transformers for Transition-based AMR Parsing
Jiawei Zhou, Tahira Naseem, Ramón Fernandez Astudillo, Young-Suk Lee, Radu Florian and Salim Roukos
CLIFF: Contrastive Learning for Improving Faithfulness and Factuality in Abstractive Summarization
Shuyang Cao and Lu Wang
NewsCLIPpings: Automatic Generation of Out-of-Context Multimodal Media
Grace Luo, Trevor Darrell and Anna Rohrbach
Benchmarking Commonsense Knowledge Base Population with an Effective Evaluation Dataset
Tianqing Fang, Weiqi Wang, Sehyun Choi, Shibo Hao, Hongming Zhang, Yangqiu Song and Bin He
RankNAS: Efficient Neural Architecture Search by Pairwise Ranking
Chi Hu, Chenglong Wang, Xiangnan Ma, Xia Meng, Yinqiao Li, Tong Xiao, Jingbo Zhu and changliang li
Universal-KD: Attention-based Output-Grounded Intermediate Layer Knowledge Distillation
Yimeng Wu, Mehdi Rezagholizadeh, Abbas Ghaddar, Md Akmal Haidar and Ali Ghodsi
Enriching and Controlling Global Semantics for Text Summarization
Thong Nguyen, Anh Tuan Luu, Truc Lu and Tho Quan

MRF-Chat: Improving Dialogue with Markov Random Fields
Ishaan Grover, Matthew Huggins, Cynthia Breazeal and Hae Won Park
Learning Kernel-Smoothed Machine Translation with Retrieved Examples
Qingnan Jiang, Mingxuan Wang, Jun Cao, Shanbo Cheng, Shujian Huang and Lei Li
Compression, Transduction, and Creation: A Unified Framework for Evaluating Natural Language Generation
Mingkai Deng, Bowen Tan, Zhengzhong Liu, Eric Xing and Zhiting Hu
Think about it! Improving defeasible reasoning by first modeling the question scenario.
Aman Madaan, Niket Tandon, Dheeraj Rajagopal, Peter Clark, Yiming Yang and Eduard Hovy
Rethinking Denoised Auto-Encoding in Language Pre-Training
Fuli Luo, Pengcheng Yang, Shicheng Li, Xuancheng Ren, Xu Sun, Songfang Huang and Fei Huang
Collaborative Learning of Bidirectional Decoders for Unsupervised Text Style Transfer
Yun Ma, Yangbin Chen, xudong mao and Qing Li
Evaluating Scholarly Impact: Towards Content-Aware Bibliometrics
Saurav Manchanda and George Karypis
Low-resource Taxonomy Enrichment with Pretrained Language Models
Kunihiro Takeoka, Kosuke Akimoto and Masafumi Oyamada
Dialogue State Tracking with a Language Model using Schema-Driven Prompting
Chia-Hsuan Lee, Hao Cheng and Mari Ostendorf
PASTE: A Tagging-Free Decoding Framework Using Pointer Networks for Aspect Sentiment Triplet Extraction
Rajdeep Mukherjee, Tapas Nayak, Yash Butala, Sourangshu Bhattacharya and Pawan Goyal
Incorporating Residual and Normalization Layers into Analysis of Masked Language Models
Goro Kobayashi, Tatsuki Kuribayashi, Sho Yokoi and Kentaro Inui
Searching for an Effective Defender: Benchmarking Defense against Adversarial Word Substitution
Zongyi Li, Jianhan Xu, Jiehang Zeng, Linyang Li, Xiaoqing Zheng, Qi Zhang, Kai-Wei Chang and Cho-Jui Hsieh
Entity-Based Knowledge Conflicts in Question Answering
Shayne Longpre, Kartik Perisetla, Anthony Chen, Nikhil Ramesh, Chris DuBois and Sameer Singh
WebSRC: A Dataset for Web-Based Structural Reading Comprehension
Xingyu Chen, Zihan Zhao, Lu Chen, JiaBao JI, Danyang Zhang, Ao Luo, Yuxuan Xiong and Kai Yu
Transferable Persona-Grounded Dialogues via Grounded Minimal Edits
Chen Henry Wu, Yinhe Zheng, Xiaoxi Mao and Minlie Huang
Long-Range Modeling of Source Code Files with eWASH: Extended Window Access by Syntax Hierarchy
Colin Clement, Shuai Lu, Xiaoyu Liu, Michele Tufano, Dawn Drain, Nan Duan, Neel Sundaresan and Alexey Svyatkovskiy
PDALN: Progressive Domain Adaptation over a Pre-trained Model for Low-Resource Cross-Domain Named Entity Recognition
Tao Zhang, Congying Xia, Philip S. Yu, Zhiwei Liu and Shu Zhao
A Relation-Oriented Clustering Method for Open Relation Extraction
Jun Zhao, Tao Gui, Qi Zhang and Yaqian Zhou
Not All Negatives are Equal: Label-Aware Contrastive Loss for Fine-grained Text Classification
Varsha Suresh and Desmond Ong
Mitigating False-Negative Contexts in Multi-document Question Answering with Retrieval Marginalization
Ansong Ni, Matt Gardner and Pradeep Dasigi
Back-Training excels Self-Training at Unsupervised Domain Adaptation of Question Generation and Passage Retrieval
Devang Kulshreshtha, Robert Belfer, Iulian Vlad Serban and Siva Reddy
Text Counterfactuals via Latent Optimization and Shapley-Guided Search
Xiaoli Fern and Quintin Pope
Automated Generation of Accurate & Fluent Medical X-ray Reports
Hoang Nguyen, Dong Nie, Taivanbat Badamdorj, Yujie Liu, yingying zhu, Jason Truong and Li Cheng
Word Reordering for Zero-shot Cross-lingual Structured Prediction
Tao Ji, Yong Jiang, Tao Wang, Zhongqiang Huang, Fei Huang, Yuanbin Wu and Xiaoling Wang
A Unified Encoding of Structures in Transition Systems
Tao Ji, Yong Jiang, Tao Wang, Zhongqiang Huang, Fei Huang, Yuanbin Wu and Xiaoling Wang
Zero-Shot Information Extraction as a Unified Text-to-Triple Translation
Chenguang Wang, Xiao Liu, Zui Chen, Haoyun Hong, Jie Tang and Dawn Song
Enhancing Document Ranking with Task-adaptive Training and Segmented Token Recovery Mechanism
Xingwu Sun, Yanling Cui, Hongyin Tang, Fuzheng Zhang, Beihong Jin and Shi Wang
Re-embedding Difficult Samples via Mutual Information Constrained Semantically Oversampling for Imbalanced Text Classification
Jiachen Tian, Shizhan Chen, Xiaowang Zhang, Zhiyong Feng, Deyi Xiong, Shaojuan Wu and Chunliu Dou
AligNART: Non-autoregressive Neural Machine Translation by Jointly Learning to Estimate Alignment and Translate
Jongyoon Song, Sungwon Kim and Sungroh Yoon
Smoothing Dialogue States for Open Conversational Machine Reading
Zhuosheng Zhang, Siru Ouyang, Hai Zhao, Masao Utiyama and Eiichiro Sumita
Beyond Text: Incorporating Metadata and Label Structure for Multi-Label Document Classification using Heterogeneous Graphs
Chenchen Ye, Linhai Zhang, Yulan He, Deyu Zhou and Jie Wu
A Scalable Framework for Learning From Implicit User Feedback to Improve Natural Language Understanding in Large-Scale Conversational AI Systems
Sunghyun Park, Han Li, Ameen Patel, Sidharth Mudgal, Sungjin Lee, Young-Bum Kim, Spyros Matsoukas and Ruhi Sarikaya
Improving Federated Learning for Aspect-based Sentiment Analysis via Topic Memories
Han Qin, Guimin Chen, Yuanhe Tian and Yan Song
Perturbation CheckLists for Evaluating NLG Evaluation Metrics
Ananya B. Sai, Tanay Dixit, Dev Sheth, Sreyas Mohan and Mitesh M. Khapra
Learning Logic Rules for Document-Level Relation Extraction
Dongyu Ru, Changzhi Sun, Jiangtao Feng, Lin Qiu, Hao Zhou, Weinan Zhang, Yong Yu and Lei Li
Improving Multilingual Translation by Representation and Gradient Regularization
Yilin Yang, Akiko Eriguchi, Alexandre Muzio, Prasad Tadepalli, Stefan Lee and Hany Hassan
Exploring Methods for Generating Feedback Comments for Writing Learning
Kazuaki Hanawa, Ryo Nagata and Kentaro Inui
Multitask Semi-Supervised Learning for Class-Imbalanced Discourse Classification
Alexander Spangher, Jonathan May, Sz-Rung Shiang and Lingjia Deng
Modeling Disclosive Transparency in NLP Application Descriptions
Michael Saxon, Sharon Levy, Xinyi Wang, Alon Albalak and William Yang Wang
ConvFiT: Conversational Fine-Tuning of Pretrained Language Models
Ivan Vulić, Pei-Hao Su, Samuel Coope, Daniela Gerz, Paweł Budzianowski, Iñigo Casanueva, Nikola Mrkšić and Tsung-Hsien Wen
Implicit Sentiment Analysis with Event-centered Text Representation
Deyu Zhou, Jianan Wang, Linhai Zhang and Yulan He
MS-Mentions: Consistently Annotating Entity Mentions in Materials Science Procedural Text
Tim O’Gorman, Zach Jensen, Sheshera Mysore, Kevin Huang, Rubayyat Mahbub, Elsa Olivetti and Andrew McCallum
FLiText: A Faster and Lighter Semi-Supervised Text Classification with Convolution Networks
Chen Liu, zhang mengchao, Fu Zhibing, Panpan Hou and Yu Li
Virtual Data Augmentation: A Robust and General Framework for Fine-tuning Pre-trained Models
Kun Zhou, Wayne Xin Zhao, Sirui Wang, Fuzheng Zhang, Wei Wu and Ji-Rong Wen
EARL: Informative Knowledge-Grounded Conversation Generation with Entity-Agnostic Representation Learning
Hao Zhou, Minlie Huang, Yong Liu, Wei Chen and Xiaoyan Zhu
On the Transferability of Adversarial Attacks against Neural Text Classifier
Liping Yuan, Xiaoqing Zheng, Yi Zhou, Cho-Jui Hsieh and Kai-Wei Chang
Machine Translation Decoding beyond Beam Search
Rémi Leblond, Jean-Baptiste Alayrac, laurent sifre, Miruna Pislar, Lespiau Jean-Baptiste, Ioannis Antonoglou, Karen Simonyan and Oriol Vinyals
AUTOSUMM: Automatic Model Creation for Text Summarization
Sharmila Reddy Nangi, Atharv Tyagi, Jay Mundra, Sagnik Mukherjee, Raj Snehal, Niyati Chhaya and Aparna Garimella
Natural Language Processing Meets Quantum Physics: A Survey and Categorization
Sixuan Wu, Jian Li, Peng Zhang and Yue Zhang
Revisiting Self-training for Few-shot Learning of Language Model
Yiming Chen, Yan Zhang, Chen Zhang, Grandee Lee, Ran Cheng and Haizhou Li
Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer
Fanchao Qi, Yangyi Chen, Xurui Zhang, Mukai Li, Zhiyuan Liu and Maosong Sun
Entity Relation Extraction as Dependency Parsing in Visually Rich Documents
Yue Zhang, Zhang Bo, Rui Wang, Junjie Cao, Chen Li and Zuyi Bao
How to leverage the multimodal EHR data for better medical prediction?
Bo Yang and Lijun Wu
CATE: A Contrastive Pre-trained Model for Metaphor Detection with Semi-supervised Learning
Zhenxi Lin, Qianli Ma, Jiangyue Yan and JIEYU CHEN
Role of Language Relatedness in Multilingual Fine­-tuning of Language Models: A Case Study in Indo-­Aryan Languages
Tejas Dhamecha, Rudra Murthy, Samarth Bharadwaj, Karthik Sankaranarayanan and Pushpak Bhattacharyya
Asking Questions Like Educational Experts: Automatically Generating Question-Answer Pairs on Real-World Examination Data
Fanyi Qu, Xin Jia and Yunfang Wu
Towards Zero-Shot Knowledge Distillation for Natural Language Processing
Ahmad Rashid, Vasileios Lioutas, Abbas Ghaddar and Mehdi Rezagholizadeh
Synchronous Dual Network with Cross-Type Attention for Joint Entity and Relation Extraction
Hui Wu and xiaodong shi
HypMix: Hyperbolic Interpolative Data Augmentation
Ramit Sawhney, Megh Thakkar, Shivam Agarwal, Di Jin, Diyi Yang and Lucie Flek
Knowledge Graph Representation Learning using Ordinary Differential Equations
Mojtaba Nayyeri, Chengjin Xu, Franca Hoffmann, Mirza Mohtashim Alam, Jens Lehmann and sahar vahdati
Comparing Feature-Engineering and Feature-Learning Approaches for Multilingual Translationese Classification
Daria Pylypenko, Kwabena Amponsah-Kaakyire, Koel Dutta Chowdhury, Josef van Genabith and Cristina España-Bonet
WinoLogic: A Zero-Shot Logic-based Diagnostic Dataset for Winograd Schema Challenge
Weinan He, Canming Huang, Yongmei Liu and Xiaodan Zhu
Uncertainty Measures in Neural Belief Tracking and the Effects on Dialogue Policy Performance
Carel van Niekerk, Andrey Malinin, Christian Geishauser, Michael Heck, Hsien-chin Lin, Nurul Lubis, Shutong Feng and Milica Gasic
DialogueCSE: Dialogue-based Contrastive Learning of Sentence Embeddings
Che Liu, Rui Wang, jinghua liu, Jian Sun, Fei Huang and Luo Si
Contrasting Human- and Machine-Generated Word-Level Adversarial Examples for Text Classification
Maximilian Mozes, Max Bartolo, Pontus Stenetorp, Bennett Kleinberg and Lewis Griffin
Syntactically-Informed Unsupervised Paraphrasing with Non-Parallel Data
Erguang Yang, Mingtong Liu, Deyi Xiong, YUJIE ZHANG, Yao Meng, Changjian Hu, Jinan Xu and Yufeng Chen
Multi-Sentence Resampling: A Simple Approach to Alleviate Dataset Length Bias and Beam-Search Degradation
Ivan Provilkov and Andrey Malinin
Comparative Opinion Quintuple Extraction from Product Reviews
Ziheng Liu, Rui Xia and Jianfei Yu
Reinforced Counterfactual Data Augmentation for Dual Sentiment Classification
Hao Chen, Rui Xia and Jianfei Yu
FinQA: A Dataset of Numerical Reasoning over Financial Data
Zhiyu Chen, Wenhu Chen, Charese Smiley, Sameena Shah, Iana Borova, Dylan Langdon, Reema Moussa, Matt Beane, Ting-Hao Huang, Bryan Routledge and William Yang Wang
Contrastive Conditioning for Assessing Disambiguation in MT: A Case Study of Distilled Bias
Jannis Vamvas and Rico Sennrich
OSCaR: Orthogonal Subspace Correction and Rectification of Biases in Word Embeddings
Sunipa Dev, Tao Li, Jeff M Phillips and Vivek Srikumar
SELFEXPLAIN: A Self-Explaining Architecture for Neural Text Classifiers
Dheeraj Rajagopal, Vidhisha Balachandran, Eduard Hovy and Yulia Tsvetkov
FiD-Ex: Improving Sequence-to-Sequence Models for Extractive Rationale Generation
Kushal Lakhotia, Bhargavi Paranjape, Asish Ghoshal, Wen-tau Yih, Yashar Mehdad and Srinivasan Iyer

Short Papers

Enhancing the Context Representation in Similarity-based Word Sense Disambiguation
Ming Wang, Jianzhang Zhang and Yinglin Wang
Reconstruction Attack on Instance Encoding for Language Understanding
Shangyu Xie and Yuan Hong
Language Models are Few-Shot Butlers
Vincent Micheli and Francois Fleuret
Cryptonite: A Cryptic Crossword Benchmark for Extreme Ambiguity in Language
Avia Efrat, Uri Shaham, Dan Kilman and Omer Levy
Toward Deconfounding the Influence of Entity Demographics for Question Answering Accuracy
Maharshi Gor, Kellie Webster and Jordan Boyd-Graber
Considering Nested Tree Structure in Sentence Extractive Summarization with Pre-trained Transformer
Jingun Kwon, Naoki Kobayashi, Hidetaka Kamigaito and Manabu Okumura
Generating Datasets with Pretrained Language Models
Timo Schick and Hinrich Schütze
Exploring Strategies for Generalizable Commonsense Reasoning with Pre-trained Models
Kaixin Ma, Filip Ilievski, Jonathan Francis, Satoru Ozaki, Eric Nyberg and Alessandro Oltramari
Improving and Simplifying Pattern Exploiting Training
Derek Tam, Rakesh Radhakrishnan Menon, Mohit Bansal, Shashank Srivastava and Colin Raffel
Knowledge-Aware Meta-learning for Low-Resource Text Classification
Huaxiu Yao, Ying-xin Wu, Maruan Al-Shedivat and Eric Xing
Reducing Discontinuous to Continuous Parsing with Pointer Network Reordering
Daniel Fernández-González and Carlos Gómez-Rodríguez
Sociolectal Analysis of Pretrained Language Models
Sheng Zhang, Xin Zhang, Weiming Zhang and Anders Søgaard
Frame Semantic-Enhanced Sentence Modeling for Sentence-level Extractive Text Summarization
Yong Guan, Shaoru Guo, Ru Li, Xiaoli Li and Hongye Tan
Explore Better Relative Position Embeddings from Encoding Perspective for Transformer Models
Anlin Qu, Jianwei Niu and Shasha Mo
CAPE: Context-Aware Private Embeddings for Private Language Learning
Richard Plant, Dimitra Gkatzia and Valerio Giuffrida
Generation and Extraction Combined Dialogue State Tracking with Hierarchical Ontology Integration
Xinmeng Li, Qian Li, Wansen Wu and Quanjun Yin
Good-Enough Example Extrapolation
Jason Wei
Is this the end of the gold standard? A straightforward reference-less grammatical error correction metric
Md Asadul Islam and Enrico Magnani
Augmenting BERT-style Models with Predictive Coding to Improve Discourse-level Representations
Vladimir Araujo, Andrés Felipe Villa, Marcelo Mendoza, Marie-Francine Moens and Alvaro M. Soto
Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings
Weixuan Wang, Wei Peng, Meng Zhang and Qun Liu
Cost-effective End-to-end Information Extraction for Semi-structured Document Images
Wonseok Hwang, Hyunji Lee, Jinyeong Yim, Geewook Kim and Minjoon Seo
An Empirical Study on Leveraging Position Embeddings for Target-oriented Opinion Words Extraction
Samuel Mensah, Kai Sun and Nikolaos Aletras
AdapterDrop: On the Efficiency of Adapters in Transformers
Andreas Rücklé, Gregor Geigle, Max Glockner, Tilman Beck, Jonas Pfeiffer, Nils Reimers and Iryna Gurevych
We Need to Talk About train-dev-test Splits
Rob van der Goot
Enjoy the Salience: Towards Better Transformer-based Faithful Explanations with Word Salience
George Chrysostomou and Nikolaos Aletras
On Classifying whether Two Texts are on the Same Side of an Argument
Erik Körner, Gregor Wiedemann, Ahmad Dawar Hakimi, Gerhard Heyer and Martin Potthast
Highly Parallel Autoregressive Entity Linking with Discriminative Correction
Nicola De Cao, Wilker Aziz and Ivan Titov
Sparsity and Sentence Structure in Encoder-Decoder Attention of Summarization Systems
Potsawee Manakul and Mark Gales
When differential privacy meets NLP: The devil is in the detail
Ivan Habernal
Looking for Confirmations: An Effective and Human-Like Visual Dialogue Strategy
Alberto Testoni and Raffaella Bernardi
Word-Level Coreference Resolution
Vladimir Dobrovolskii
Is Information Density Uniform in Task-Oriented Dialogues?
Mario Giulianelli, Arabella Sinclair and Raquel Fernández
Unsupervised Multi-View Post-OCR Error Correction With Language Models
Harsh Gupta, Luciano Del Corro, Samuel Broscheit, Johannes Hoffart and Eliot Brenner
On Homophony and Rényi Entropy
Tiago Pimentel, Clara Meister, Simone Teufel and Ryan Cotterell
Semantics-Preserved Data Augmentation for Aspect-Based Sentiment Analysis
Ting-Wei Hsu, Chung-Chi Chen, Hen-Hsen Huang and Hsin-Hsi Chen
Students Who Study Together Learn Better: On the Importance of Collective Knowledge Distillation for Domain Transfer in Fact Verification
Mitch Paul Mithun, Sandeep Suntwal and Mihai Surdeanu
An Information-Theoretic Characterization of Morphological Fusion
Neil Rathi, Michael Hahn and Richard Futrell
Aligning Cross-lingual Sentence Representations with Dual Momentum Contrast
Liang Wang, Wei Zhao and Jingming Liu
Multiplex Graph Neural Network for Extractive Text Summarization
Baoyu Jing, Zeyu You, Tao Yang, Wei Fan and Hanghang Tong
MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations
Xinyin Ma, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang and Weiming Lu
A Bag of Tricks for Dialogue Summarization
Muhammad Khalifa, Miguel Ballesteros and Kathleen McKeown
A Simple and Effective Method To Eliminate the Self Language Bias in Multilingual Representations
Ziyi Yang, Yinfei Yang, Daniel Cer and Eric Darve
Value-aware Approximate Attention
Ankit Gupta and Jonathan Berant
Injecting Entity Types into Entity-Guided Text Generation
Xiangyu Dong, Wenhao Yu, Chenguang Zhu and Meng Jiang
RockNER: A Simple Method to Create Adversarial Examples for Evaluating the Robustness of Named Entity Recognition Models
Bill Yuchen Lin, Wenyang Gao, Jun Yan, Ryan Moreno and Xiang Ren
Smelting Gold and Silver for Improved Multilingual AMR-to-Text Generation
Leonardo F. R. Ribeiro, Jonas Pfeiffer, Yue Zhang and Iryna Gurevych
Honey or Poison? Solving the Trigger Curse in Few-shot Event Detection via Causal Intervention
Jiawei Chen, Hongyu Lin, Xianpei Han and Le Sun
Learning Prototype Representations Across Few-Shot Tasks for Event Detection
Viet Lai, Franck Dernoncourt and Thien Huu Nguyen
Integrating Semantic Scenario and Word Relations for Abstractive Sentence Summarization
Yong Guan, Shaoru Guo, Ru Li, Xiaoli Li and Hu Zhang
Finnish Dialect Identification: The Effect of Audio and Text
Mika Hämäläinen, Khalid Alnajjar, Niko Partanen and Jack Rueter
Continuous Entailment Patterns for Lexical Inference in Context
Martin Schmitt and Hinrich Schütze
Improving Neural Machine Translation by Bidirectional Training
Liang Ding, Di Wu and Dacheng Tao
Mixture-of-Partitions: Infusing Large Biomedical Knowledge Graphs into BERT
Zaiqiao Meng, Fangyu Liu, Thomas Clark, Ehsan Shareghi and Nigel Collier
BERT-Beta: A Proactive Probabilistic Approach to Text Moderation
FEI TAN, Yifan Hu, Kevin Yen and Changwei Hu
Open-domain clarification question generation without question examples
Julia White, Gabriel Poesia, Robert Hawkins, Dorsa Sadigh and Noah Goodman
We’ve had this conversation before: A Novel Approach to Measuring Dialog Similarity
Ofer Lavi, Ella Rabinovich, Segev Shlomov, David Boaz, Inbal Ronen and Ateret Anaby Tavor
Mutual-Learning Improves End-to-End Speech Translation
Jiawei Zhao, Wei Luo, Boxing Chen and Andrew Gilman
Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning
Prasetya Utama, Nafise Sadat Moosavi, Victor Sanh and Iryna Gurevych
What’s Hidden in a One-layer Randomly Weighted Transformer?
Sheng Shen, Zhewei Yao, Douwe Kiela, Kurt Keutzer and Michael Mahoney
Visual Goal-Step Inference using wikiHow
Yue Yang, Artemis Panagopoulou, Qing Lyu, Li Zhang, Mark Yatskar and Chris Callison-Burch
Voice Query Auto Completion
Raphael Tang, Karun Kumar, Kendra Chalkley, Ji Xin, Liming Zhang, Wenyan Li, Gefei Yang, Yajie Mao, Junho Shin, Geoffrey Murray and Jimmy Lin
A New Representation for Span-based CCG Parsing
Yoshihide Kato and Shigeki Matsubara
CSAGN: Conversational Structure Aware Graph Network for Conversational Semantic Role Labeling
Han Wu, Kun Xu and Linqi Song
What’s in a Name? Answer Equivalence For Open-Domain Question Answering
Chenglei Si, Chen Zhao and Jordan Boyd-Graber
Generative Context Pair Selection for Multi-hop Question Answering
Dheeru Dua, Cicero Nogueira dos Santos, Patrick Ng, Ben Athiwaratkun, Bing Xiang, Matt Gardner and Sameer Singh
HETFORMER: Heterogeneous Transformer with Sparse Attention for Long-Text Extractive Summarization
Ye Liu, Jianguo Zhang, Yao Wan, Congying Xia, Lifang He and Philip Yu
Context-Aware Interaction Network for Question Matching
Zhe Hu, Zuohui Fu, Yu Yin and Gerard de Melo
Implicit Premise Generation with Discourse-aware Commonsense Knowledge Models
Tuhin Chakrabarty, Aadit Trivedi and Smaranda Muresan
Transformer-based Lexically Constrained Headline Generation
Kosuke Yamada, Yuta Hitomi, Hideaki Tamori, Ryohei Sasano, Naoaki Okazaki, Kentaro Inui and Koichi Takeda
Improving Pre-trained Vision-and-Language Embeddings for Phrase Grounding
Zi-Yi Dou and Nanyun Peng
Exploring Underexplored Limitations of Cross-Domain Text-to-SQL Generalization
Yujian Gan, Xinyun Chen and Matthew Purver
Zero-Shot Dialogue Disentanglement by Self-Supervised Entangled Response Selection
Ta-Chung Chi and alexander rudnicky
BPM_MT: Enhanced Backchannel Prediction Model using Multi-Task Learning
Jin Yea Jang, san kim, Minyoung Jung, Saim Shin and Gahgene Gweon
It Is Not As Good As You Think! Evaluating Simultaneous Machine Translation on Interpretation Data
Jinming Zhao, Philip Arthur, Gholamreza Haffari, Trevor Cohn and Ehsan Shareghi
Unsupervised Paraphrasing Consistency Training for Low Resource Named Entity Recognition
Rui Wang and Ricardo Henao
Neuro-Symbolic Approaches for Text-Based Policy Learning
Subhajit Chaudhury, Prithviraj Sen, Masaki Ono, Daiki Kimura, Michiaki Tatsubori and Asim Munawar
Inducing Stereotypical Character Roles from Plot Structure
Labiba Jahan, Rahul Mittal and Mark Finlayson
SHAPE: Shifted Absolute Position Embedding for Transformers
Shun Kiyono, Sosuke Kobayashi, Jun Suzuki and Kentaro Inui
Efficient Contrastive Learning via Novel Data Augmentation and Curriculum Learning
Seonghyeon Ye, Jiseon Kim and Alice Oh
Preventing Author Profiling through Zero-Shot Multilingual Back-Translation
David Adelani, Miaoran Zhang, Xiaoyu Shen, Ali Davody, Thomas Kleinbauer and Dietrich Klakow
Fairness-aware Class Imbalanced Learning
Shivashankar Subramanian, Afshin Rahimi, Timothy Baldwin, Trevor Cohn and Lea Frermann
Evaluating Debiasing Techniques for Intersectional Biases
Shivashankar Subramanian, Xudong Han, Timothy Baldwin, Trevor Cohn and Lea Frermann
Paraphrasing Compound Nominalizations
John Lee, Ho Hung Lim and Carol Webster
Have You Seen That Number? Investigating Extrapolation in Question Answering Models
Jeonghwan Kim, Giwon Hong, Kyung-min Kim, Junmo Kang and Sung-Hyon Myaeng
BERT4GCN: Using BERT Intermediate Layers to Augment GCN for Aspect-based Sentiment Classification
Zeguan Xiao, Jiarun Wu, Qingliang Chen and Congjian Deng
Speechformer: Reducing Information Loss in Direct Speech Translation
Sara Papi, Marco Gaido, Matteo Negri and Marco Turchi
An Empirical Investigation of Word Alignment Supervision for Zero-Shot Multilingual Neural Machine Translation
Alessandro Raganato, Raúl Vázquez, Mathias Creutz and Jörg Tiedemann
Self-Supervised Curriculum Learning for Spelling Error Correction
Zifa Gan, Hongfei Xu and Hongying Zan
Does Social Pressure Drive Persuasion in Online Fora?
Ayush Jain and Shashank Srivastava
Improving the Quality Trade-Off for Neural Machine Translation Multi-Domain Adaptation
Eva Hasler, Tobias Domhan, Jonay Trenous, Ke Tran, Bill Byrne and Felix Hieber
Data-QuestEval: A Referenceless Metric for Data-to-Text Semantic Evaluation
Clement Rebuffel, Thomas Scialom, Laure Soulier, Benjamin Piwowarski, Sylvain Lamprier, Jacopo Staiano, Geoffrey Scoutheeten and patrick Gallinari
Towards Zero-shot Commonsense Reasoning with Self-supervised Refinement of Language Models
Tassilo Klein and Moin Nabi
Expanding End-to-End Question Answering on Differentiable Knowledge Graphs with Intersection
Priyanka Sen, Armin Oliya and Amir Saffari
Is “moby dick” a Whale or a Bird? Named Entities and Terminology in Speech Translation
Marco Gaido, Susana Rodríguez, Matteo Negri, Luisa Bentivogli and Marco Turchi
Neuro-Symbolic Reinforcement Learning with First-Order Logic
Daiki Kimura, Masaki Ono, Subhajit Chaudhury, Ryosuke Kohita, Akifumi Wachi, Don Joven Agravante, Michiaki Tatsubori, Asim Munawar and Alexander Gray
Towards Incremental Transformers: An Empirical Analysis of Transformer Models for Incremental NLU
Patrick Kahardipraja, Brielen Madureira and David Schlangen
KnowMAN: Weakly Supervised Multinomial Adversarial Networks
Luisa März, Ehsaneddin Asgari, Fabienne Braune, Franziska Zimmermann and Benjamin Roth
An Evaluation Dataset and Strategy for Building Robust Multi-turn Response Selection Model
Kijong Han, Seojin Lee and Dong-hun Lee
Dealing with Typos for BERT-based Passage Retrieval and Ranking
Shengyao Zhuang and Guido Zuccon
Bridge to Target Domain by Prototypical Contrastive Learning and Label Confusion: Re-explore Zero-Shot Learning for Slot Filling
Liwen Wang, Xuefeng Li, Jiachi Liu, Keqing He, Yuanmeng Yan and Weiran Xu
Biomedical Concept Normalization by Leveraging Hypernyms
Cheng Yan, Yuanzhe Zhang, Kang Liu, Jun Zhao, Yafei Shi and Shengping Liu
What happens if you treat ordinal ratings as interval data? Human evaluations in NLP are even more under-powered than you think
David M. Howcroft and Verena Rieser
Caption Enriched Samples for Improving Hateful Memes Detection
Efrat Blaier, Itzik Malkiel and Lior Wolf
Frustratingly Simple Pretraining Alternatives to Masked Language Modeling
Atsuki Yamaguchi, George Chrysostomou, Katerina Margatina and Nikolaos Aletras
AVocaDo: Strategy for Adapting Vocabulary to Downstream Domain
Jimin Hong, TaeHee Kim, Hyesu Lim and Jaegul Choo
A Collaborative Multi-agent Reinforcement Learning Framework for Dialog Action Decomposition
Huimin Wang and Kam-Fai Wong
Examining Cross-lingual Contextual Embeddings with Orthogonal Structural Probes
Tomasz Limisiewicz and David Mareček
Don’t Search for a Search Method —- Simple Heuristics Suffice for Adversarial Text Attacks
Nathaniel Berger, Stefan Riezler, Sebastian Ebert and Artem Sokolov
To Share or not to Share: Predicting Sets of Sources for Model Transfer Learning
Lukas Lange, Jannik Strötgen, Heike Adel and Dietrich Klakow
Balancing Methods for Multi-label Text Classification with Long-Tailed Class Distribution
Yi Huang, Buse Giledereli, Abdullatif Köksal, Arzucan Özgür and Elif Ozkirimli
Dynamic Forecasting of Conversation Derailment
Yova Kementchedjhieva and Anders Søgaard
Feedback Attribution for Counterfactual Bandit Learning in Multi-Domain Spoken Language Understanding
Tobias Falke and Patrick Lehnen
Language-Aligned Waypoint (LAW) Supervision for Vision-and-Language Navigation in Continuous Environments
Sonia Raychaudhuri, Saim Wani, Shivansh Patel, Unnat Jain and Angel Chang
Integrating Personalized PageRank into Neural Word Sense Disambiguation
Ahmed El Sheikh, Michele Bevilacqua and Roberto Navigli
SWEAT: Scoring Polarization of Topics across Different Corpora
Federico Bianchi, Marco Marelli, Paolo Nicoli and Matteo Palmonari
Frustratingly Simple but Surprisingly Strong: Using Language-Independent Features for Zero-shot Cross-lingual Semantic Parsing
Jingfeng Yang, Federico Fancellu, Bonnie Webber and Diyi Yang
ONION: A Simple and Effective Defense Against Textual Backdoor Attacks
Fanchao Qi, Yangyi Chen, Mukai Li, Yuan Yao, Zhiyuan Liu and Maosong Sun
End-to-End Entity Resolution and Question Answering Using Differentiable Knowledge Graphs
Amir Saffari, Armin Oliya, Priyanka Sen and Tom Ayoola
Exploiting Twitter as Source of Large Corpora of Weakly Similar Pairs for Semantic Sentence Embeddings
Marco Di Giovanni and Marco Brambilla
“So You Think You’re Funny?”: Rating the Humour Quotient in Standup Comedy
Anirudh Mittal, Pranav Jeevan P, Prerak Gandhi, Diptesh Kanojia and Pushpak Bhattacharyya
Locke’s Holiday: Belief Bias in Machine Reading
Anders Søgaard
How to Train BERT with an Academic Budget
Peter Izsak, Moshe Berchansky and Omer Levy
Large-Scale Relation Learning for Question Answering over Knowledge Bases with Pre-trained Language Models
Yuanmeng Yan, Rumei Li, Sirui Wang, Hongzhi Zhang, Zan Daoguang, Fuzheng Zhang, Wei Wu and Weiran Xu
NeuTral Rewriter: A Rule-Based and Neural Approach to Automatic Rewriting into Gender Neutral Alternatives
Eva Vanmassenhove, Chris Emmery and Dimitar Shterionov
Sequence Length is a Domain: Length-based Overfitting in Transformer Models
Dusan Varis and Ondřej Bojar
The Effect of Round-Trip Translation on Fairness in Sentiment Analysis
Jonathan Christiansen, Mathias Gammelgaard and Anders Søgaard
XLEnt: Mining a Large Cross-lingual Entity Dataset with Lexical-Semantic-Phonetic Word Alignment
Ahmed El-Kishky, Adithya Renduchintala, James Cross, Francisco Guzmán and Philipp Koehn
Open Aspect Target Sentiment Classification with Natural Language Prompts
Ronald Seoh, Ian Birle, Mrinal Tak, Haw-Shiuan Chang, Brian Pinette and Alfred Hough
PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation
Long Doan, Linh The Nguyen, Nguyen Luong Tran, Thai Hoang and Dat Quoc Nguyen
Improving Simultaneous Translation by Incorporating Pseudo-References with Fewer Reorderings
Junkun Chen, Renjie Zheng, Atsuhito Kita, Mingbo Ma and Liang Huang
Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT Compression
Canwen Xu, Wangchunshu Zhou, Tao Ge, Ke Xu, Julian McAuley and Furu Wei
A Simple Geometric Method for Cross-Lingual Linguistic Transformations with Pre-trained Autoencoders
Maarten De Raedt, Fréderic Godin, Pieter Buteneers, Chris Develder and Thomas Demeester
Multilingual and Cross-Lingual Intent Detection from Spoken Data
Daniela Gerz, Pei-Hao Su, Razvan Kusztos, Avishek Mondal, Michał Lis, Eshan Singhal, Nikola Mrkšić, Tsung-Hsien Wen and Ivan Vulić
Improving Query Graph Generation for Complex Question Answering over Knowledge Base
Kechen Qin, Cheng Li, Virgil Pavlu and Javed Aslam
Towards Realistic Few-Shot Relation Extraction
Sam Brody, Sichao Wu and Adrian Benton
The Effect of Efficient Messaging and Input Variability on Neural-Agent Iterated Language Learning
Yuchen Lian, Arianna Bisazza and Tessa Verhoef
Are Transformers a Modern Version of ELIZA? Observations on French Object Verb Agreement
Bingzhi Li, Guillaume Wisniewski and Benoit Crabbé
Contextual Rephrase Detection for Reducing Friction in Dialogue Systems
Zhuoyi Wang, Saurabh Gupta, Jie Hao, Xing Fan, Dingcheng Li, Alexander Hanbo Li and Chenlei Guo
Few-Shot Intent Detection via Contrastive Pre-Training and Fine-Tuning
Jianguo Zhang, Trung Bui, Seunghyun Yoon, Xiang Chen, Zhiwei Liu, Congying Xia, Quan Hung Tran, Walter Chang and Philip Yu
A Secure and Efficient Federated Learning Framework for NLP
CHENGHONG Wang, Jieren Deng, Xianrui Meng, Yijue Wang, Ji Li, Sheng Lin, Shuo Han, Fei Miao, Sanguthevar Rajasekaran and Caiwen Ding
Discrete and Soft Prompting for Multilingual Models
Mengjie Zhao and Hinrich Schütze
PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models
Torsten Scholak, Nathan Schucher and Dzmitry Bahdanau
Sentence Bottleneck Autoencoders from Transformer Language Models
Ivan Montero, Nikolaos Pappas and Noah A. Smith
CHoRaL: Collecting Humor Reaction Labels from Millions of Social Media Users
Zixiaofan Yang, Shayan Hooshmand and Julia Hirschberg
Vision Matters When It Should: Sanity Checking Multimodal Machine Translation Models
Jiaoda Li, Duygu Ataman and Rico Sennrich
Does BERT Learn as Humans Perceive? Understanding Linguistic Styles through Lexica
Shirley Anugrah Hayati, Dongyeop Kang and Lyle Ungar
Evaluation Paradigms in Question Answering
Pedro Rodriguez and Jordan Boyd-Graber
Simple and Effective Unsupervised Redundancy Elimination to Compress Dense Vectors for Passage Retrieval
Xueguang Ma, Minghan Li, Kai Sun, Ji Xin and Jimmy Lin
Investigating Robustness of Dialog Models to Popular Figurative Language Constructs
Harsh Jhamtani, Varun Gangal, Eduard Hovy and Taylor Berg-Kirkpatrick
IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization
Fajri Koto, Jey Han Lau and Timothy Baldwin
Perhaps PTLMs Should Go to School – A Task to Assess Open Book and Closed Book QA
Manuel Ciosici, Joe Cecil, Dong-Ho Lee, Alex Hedges, Marjorie Freedman and Ralph Weischedel
CoPHE: A Count-Preserving Hierarchical Evaluation Metric in Large-Scale Multi-Label Text Classification
Matúš Falis, Hang Dong, Alexandra Birch and Beatrice Alex
“It doesn’t look good for a date”: Transforming Critiques into Preferences for Conversational Recommendation Systems
Victor Bursztyn, Jennifer Healey, Nedim Lipka, Eunyee Koh, Doug Downey and Larry Birnbaum
Systematic Generalization on gSCAN: What is Nearly Solved and What is Next?
Linlu Qiu, Hexiang Hu, Bowen Zhang, Peter Shaw and Fei Sha
Multi-Vector Attention Models for Deep Re-ranking
Giulio Zhou and Jacob Devlin
NB-MLM: Efficient Domain Adaptation of Masked Language Models for Sentiment Analysis
Nikolay Arefyev, Dmitrii Kharchev and Artem Shelmanov
Nearest Neighbour Few-Shot Learning for Cross-lingual Classification
M Saiful Bari, Batool Haider and Saab Mansour
Boosting Cross-Lingual Transfer via Self-Learning with Uncertainty Estimation
Liyan Xu, Xuchao Zhang, Xujiang Zhao, Haifeng Chen, Feng Chen and Jinho D. Choi
Extracting Fine-Grained Knowledge Graphs of Scientific Claims: Dataset and Transformer-Based Results
Ian Magnusson and Scott Friedman
Effect of Visual Extensions on Natural Language Understanding in Vision-and-Language Models
Taichi Iki and Akiko Aizawa
Learning Universal Authorship Representations
Rafael Rivera-Soto, Olivia Miano, Juanita Ordonez, Barry Chen, Aleem Khan, Marcus Bishop and Nicholas Andrews
Guilt by Association: Emotion Intensities in Lexical Representations
Shahab Raji and Gerard de Melo
Single-dataset Experts for Multi-dataset Question Answering
Dan Friedman, Ben Dodge and Danqi Chen
COUGH: A Challenge Dataset and Models for COVID-19 FAQ Retrieval
Xinliang Frederick Zhang, Heming Sun, Xiang Yue, Simon Lin and Huan Sun
Simple Entity-Centric Questions Challenge Dense Retrievers
Christopher Sciavolino, Zexuan Zhong, Jinhyuk Lee and Danqi Chen
Emotion Inference in Multi-Turn Conversations with Addressee-Aware Module and Ensemble Strategy
Dayu Li, Xiaodan Zhu, Yang Li, Suge Wang, Deyu Li, Jian Liao and Jianxing Zheng
Utilizing Relative Event Time to Enhance Event-Event Temporal Relation Extraction
Haoyang Wen and Heng Ji
Levenshtein Training for Word-level Quality Estimation
Shuoyang Ding, Marcin Junczys-Dowmunt, Matt Post and Philipp Koehn
CRYPTOGRU: Low Latency Privacy-Preserving Text Analysis With GRU
Bo Feng, Qian Lou, Lei Jiang and Geoffrey Fox
SpellBERT: A Lightweight Pretrained Model for Chinese Spelling Check
Tuo Ji, Hang Yan and Xipeng Qiu
Numeracy enhances the Literacy of Language Models
Avijit Thawani, Jay Pujara and Filip Ilievski
Lying Through One’s Teeth: A Study on Verbal Leakage Cues
Min-Hsuan Yeh and Lun-Wei Ku
Exploring Non-Autoregressive Text Style Transfer
Yun Ma and Qing Li
Effective Sequence-to-Sequence Dialogue State Tracking
Jeffrey Zhao, Mahdis Mahdieh, Ye Zhang, Yuan Cao and Yonghui Wu
Can Language Models be Biomedical Knowledge Bases?
Mujeen Sung, Jinhyuk Lee, Sean Yi, Minji Jeon, Sungdong Kim and Jaewoo Kang
Learning Compact Metrics for MT
Amy Pu, Hyung Won Chung, Ankur Parikh, Sebastian Gehrmann and Thibault Sellam
“Average” Approximates “First Principal Component”? An Empirical Analysis on Representations from Neural Language Models
Zihan Wang, Chengyu Dong and Jingbo Shang
Abstract, Rationale, Stance: A Joint Model for Scientific Claim Verification
Zhiwei Zhang, Jiyi Li, Fumiyo Fukumoto and Yanming Ye
Gradient-Based Adversarial Factual Consistency Evaluation for Abstractive Summarization
Zhiyuan Zeng, Jiaze Chen, Weiran Xu and Lei Li
Lifelong Explainer for Lifelong Learners
Xuelin Situ, Sameen Maruf, Ingrid Zukerman, Cecile Paris and Gholamreza Haffari
Relation Extraction with Word Graphs from N-grams
Han Qin, Yuanhe Tian and Yan Song
Modeling Human Sentence Processing with Left-Corner Recurrent Neural Network Grammars
Ryo Yoshida, Hiroshi Noji and Yohei Oseki
Numerical reasoning in machine reading comprehension tasks: are we there yet?
Hadeel Al-Negheimish, Pranava Madhyastha and Alessandra Russo
Separating Retention from Extraction in the Evaluation of End-to-end Relation Extraction
Bruno Taillé, Vincent Guigue, Geoffrey Scoutheeten and patrick Gallinari
Chinese WPLC: A Chinese Dataset for Evaluating Pretrained Language Models on Word Prediction Given Long-Range Context
Huibin Ge, Chenxi Sun, Deyi Xiong and Qun Liu
ClauseRec: A Clause Recommendation Framework for AI-aided Contract Authoring
Vinay Aggarwal, Aparna Garimella, Balaji Vasan Srinivasan, Anandhavelu N and Rajiv Jain
A Thorough Evaluation of Task-Specific Pretraining for Summarization
Sascha Rothe, Joshua Maynez and Shashi Narayan
Conditional probing: measuring usable information beyond a baseline
John Hewitt, Kawin Ethayarajh, Percy Liang and Christopher Manning
Bridging Perception, Memory, and Inference through Semantic Relations
Johanna Björklund, Adam Dahlgren Lindström and Frank Drewes
Data Collection vs. Knowledge Graph Completion: What is Needed to Improve Coverage?
Kenneth Church and Yuchen Bian
Data and Parameter Scaling Laws for Neural Machine Translation
Mitchell A Gordon, Kevin Duh and Jared Kaplan

System demonstration papers

MiSS: An Assistant for Multi-Style Simultaneous Translation
Zuchao Li, Kevin Parnow, Masao Utiyama, Eiichiro Sumita and Hai Zhao
Automatic Construction of Enterprise Knowledge Base
Junyi Chai, Yujie He, Homa Hashemi, Bing Li, Daraksha Parveen, Ranganath Kondapally and Wenjin Xu
LightTag: Text Annotation Platform
Tal Perry
TransIns: Document Translation with Markup Reinsertion
Jörg Steffen and Josef van Genabith
ET: A Workstation for Querying, Editing and Evaluating Annotated Corpora
Elvis de Souza and Cláudia Freitas
N-LTP: An Open-source Neural Language Technology Platform for Chinese
Wanxiang Che, Yunlong Feng, Libo Qin and Ting Liu
COMBO: State-of-the-Art Morphosyntactic Analysis
Mateusz Klimaszewski and Alina Wróblewska
ExcavatorCovid: Extracting Events and Relations from Text Corpora for Temporal and Causal Analysis for COVID-19
Bonan Min, Benjamin Rozonoyer, Haoling Qiu, Alexander Zamanian, Nianwen Xue and Jessica MacBride
KOAS: Korean Text Offensiveness Analysis System
San-Hee Park, Kang-Min Kim, Seonhee Cho, Jun-Hyung Park, Hyuntae Park, Hyuna Kim, Seongwon Chung and SangKeun Lee
RepGraph: Visualising and Analysing Meaning Representation Graphs
Jaron Cohen, Roy Cohen, Edan Toledo and Jan Buys
Thermostat: A Large Collection of NLP Model Explanations and Analysis Tools
Nils Feldhus, Robert Schwarzenberg and Sebastian Möller
LMdiff: A Visual Diff Tool to Compare Language Models
Hendrik Strobelt, Benjamin Hoover, Arvind Satyanaryan and Sebastian Gehrmann
Semantic Context Path Labeling for Semantic Exploration of User Reviews
Salah Aït-Mokhtar, Caroline Brun, Yves Hoppenot and Agnes Sandor
Beyond Accuracy: A Consolidated Tool for Visual Question Answering Benchmarking
Dirk Väth, Pascal Tilli and Ngoc Thang Vu
Athena 2.0: Contextualized Dialogue Management for an Alexa Prize SocialBot
Marilyn Walker, Vrindavan Harrison, Juraj Juraska, Lena Reed, Kevin Bowden, Wen Cui, Omkar Patil and Adwait Ratnaparkhi
SPRING Goes Online: End-to-End AMR Parsing and Generation
Rexhina Blloshmi, Michele Bevilacqua, Edoardo Fabiano, Valentina Caruso and Roberto Navigli
fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit
Changhan Wang, Wei-Ning Hsu, Yossi Adi, Adam Polyak, Ann Lee, Peng-Jen Chen, Jiatao Gu and Juan Pino
Press Freedom Monitor: Detection of Reported Press and Media Freedom Violations in Twitter and News Articles
Tariq Yousef, Antje Schlaf, Janos Borst, Andreas Niekler and Gerhard Heyer
UMR-Writer: A Web Application for Annotating Uniform Meaning Representations
Jin Zhao, Nianwen Xue, Jens Van Gysel and Jinho D. D. Choi
TranslateLocally: Blazing-fast translation running on the local CPU
Nikolay Bogoychev, Jelmer Van der Linde and Kenneth Heafield
Datasets: A Community Library for Natural Language Processing
Quentin lhoest, Albert Villanova del Moral, Yacine Jernite, Abhishek Thakur, Patrick von Platen, Suraj Patil, Julien Chaumond, Mariama Drame, Julien Plu, Lewis Tunstall, Joe Davison, Mario Šaško, Gunjan Chhablani, Bhavitvya Malik, Simon Brandeis, Teven Le Scao, Victor Sanh, Canwen Xu, Nicolas Patry, Angelina McMillan-Major, Philipp Schmid, Sylvain Gugger, Clément Delangue, Théo Matussière, Lysandre Debut, Stas Bekman, Pierric Cistac, Thibault Goehringer, Victor Mustar, François Lagunas, Alexander Rush and Thomas Wolf
Summary Explorer: Visualizing the State of the Art in Text Summarization
Shahbaz Syed, Tariq Yousef, Khalid Al Khatib, Stefan Jänicke and Martin Potthast
MeetDot: Videoconferencing with Live Translation Captions
Arkady Arkhangorodsky, Christopher Chu, Scot Fang, Yiqi Huang, Denglin Jiang, Ajay Nagesh, Boliang Zhang and Kevin Knight
Box Embeddings: An open-source library for representation learning using geometric structures
Tejas Chheda, Purujit Goyal, Trang Tran, Dhruvesh Patel, Michael Boratko, Shib Sankar Dasgupta and Andrew McCallum
LexiClean: An annotation tool for rapid multi-task lexical normalisation
Tyler Bikaun, Tim French, Melinda Hodkiewicz, Michael Stewart and Wei Liu
T3-Vis: visual analytic for Training and fine-Tuning Transformers in NLP
Raymond Li, Wen Xiao, Lanjun Wang, Hyeju Jang and Giuseppe Carenini
DomiKnowS: A Library for Integration of Symbolic Domain Knowledge in Deep Learning
Hossein Rajaby Faghihi, Quan Guo, Andrzej Uszok, Aliakbar Nafar and Parisa Kordjamshidi
OpenFraming: Open-sourced Tool for Computational Framing Analysis of Multilingual Data
Vibhu Bhatia, Vidya Prasad Akavoor, Sejin Paik, Lei Guo, Mona Jalal, Alyssa Smith, David Assefa Tofu, Edward Edberg Halim, Yimeng Sun, Margrit Betke, Prakash Ishwar and Derry Tanti Wijaya
IrEne-viz: Visualizing Energy Consumption of Transformer Models
Yash Kumar Lal, Reetu Singh, Harsh Trivedi, Qingqing Cao, Aruna Balasubramanian and Niranjan Balasubramanian
Open-Domain Question-Answering for COVID-19 and Other Emergent Domains
Sharon Levy, Kevin Mo, Wenhan Xiong and William Yang Wang
Project Debater APIs: Decomposing the AI Grand Challenge
Roy Bar-Haim, Yoav Kantor, Elad Venezian, Yoav Katz and Noam Slonim
CroAno : A Crowd Annotation Platform for Improving Label Consistency of Chinese NER Dataset
Baoli Zhang, zhucong li, Zhen Gan, Yubo Chen, Jing Wan, Kang Liu, Jun Zhao, Shengping Liu and Yafei Shi
iFᴀᴄᴇᴛSᴜᴍ: Coreference-based Interactive Faceted Summarization for Multi-Document Exploration
Eran Hirsch, Alon Eirew, Ori Shapira, Avi Caciularu, Arie Cattan, Ori Ernst, Ramakanth Pasunuru, Hadar Ronen, Mohit Bansal and Ido Dagan
AMuSE-WSD: An All-in-one Multilingual System for Easy Word Sense Disambiguation
Riccardo Orlando, Simone Conia, Fabrizio Brignone, Francesco Cecconi and Roberto Navigli
SeqAttack: On Adversarial Attacks for Named Entity Recognition
Walter Simoncini and Gerasimos Spanakis
InVeRo-XL: Making Cross-Lingual Semantic Role Labeling Accessible with Intelligible Verbs and Roles
Simone Conia, Riccardo Orlando, Fabrizio Brignone, Francesco Cecconi and Roberto Navigli
SummerTime: Text Summarization Toolkit for Non-experts
Ansong Ni, Zhangir Azerbayev, Mutethia Mutuma, Troy Feng, Yusen Zhang, Tao Yu, Ahmed Hassan Awadallah and Dragomir Radev
Chandler: An Explainable Sarcastic Response Generator
Silviu Oprea, Steven Wilson and Walid Magdy
TabPert : An Effective Platform for Tabular Perturbation
Nupur Jain, Vivek Gupta, Anshul Rai and Gaurav Kumar
DRIFT: A Toolkit for Diachronic Analysis of Scientific Literature
Abheesht Sharma, Gunjan Chhablani, Harshit Pandey and Rajaswa Patil
FAST: Fast Annotation tool for SmarT devices
Shunyo Kawamoto, Yu Sawai, Kohei Wakimoto and Peinan Zhang
deepQuest-py: Large and Distilled Models for Quality Estimation
Fernando Alva-Manchego, Abiola Obamuyide, Amit Gajbhiye, Frédéric Blain, Marina Fomicheva and Lucia Specia

Findings of EMNLP

Long Papers

K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce
Song Xu, Haoran Li, Peng Yuan, Yujia Wang, Youzheng Wu, Xiaodong He, Ying Liu and Bowen Zhou
Extracting Topics with Simultaneous Word Co-occurrence and Semantic Correlation Graphs: Neural Topic Modeling for Short Texts
Yiming Wang, Ximing Li, Xiaotang Zhou and Jihong Ouyang
Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering
Chenyu You, Nuo Chen and Yuexian Zou
Neural News Recommendation with Collaborative News Encoding and Structural User Encoding
Zhiming Mao, Xingshan Zeng and Kam-Fai Wong
Self-Teaching Machines to Read and Comprehend with Large-Scale Multi-Subject Question-Answering Data
Dian Yu, Kai Sun, Dong Yu and Claire Cardie
Joint Multimedia Event Extraction from Video and Article
Brian Chen, Xudong Lin, Christopher Thomas, Manling Li, Shoya Yoshida, Lovish Chum, Heng Ji and Shih-Fu Chang
Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language Grounding
Yuechen Wang, Wengang Zhou and Houqiang Li
Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation
Yuexiang Xie, Fei Sun, Yang Deng, Yaliang Li and Bolin Ding
Cross-Modal Retrieval Augmentation for Multi-Modal Classification
Shir Gur, Natalia Neverova, Chris Stauffer, Ser-Nam Lim, Douwe Kiela and Austin Reiter
HiTRANS: A Hierarchical Transformer Network for Nested Named Entity Recognition
Zhiwei Yang, Jing Ma, Hechang Chen, Yunke Zhang and Yi Chang
Improving Embedding-based Large-scale Retrieval via Label Enhancement
Peiyang Liu, Xi Wang, Sen Wang, Wei Ye, Xiangyu Xi and Shikun Zhang
Improving Privacy Guarantee and Efficiency of Latent Dirichlet Allocation Model Training Under Differential Privacy
Tao Huang and Hong Chen
Generating Mammography Reports from Multi-view Mammograms with BERT
Alexander Yalunin, Elena Sokolova, Ilya Burenko, Alexander Ponomarchuk, Olga Puchkova and Dmitriy Umerenkov
Decomposing Complex Questions Makes Multi-Hop QA Easier and More Interpretable
RuiLiu Fu, Han Wang, xuejun zhang, Jun Zhou and yonghong yan
Dense Hierarchical Retrieval for Open-domain Question Answering
Ye Liu, Kazuma Hashimoto, Yingbo Zhou, Semih Yavuz, Caiming Xiong and Philip Yu
Visually Grounded Concept Composition
Bowen Zhang, Hexiang Hu, Linlu Qiu, Peter Shaw and Fei Sha
Compositional Networks Enable Systematic Generalization for Grounded Language Understanding
Yen-Ling Kuo, Boris Katz and Andrei Barbu
An Unsupervised Method for Building Sentence Simplification Corpora in Multiple Languages
Xinyu Lu, Jipeng Qiang, Yun Li, Yunhao Yuan and Yi Zhu
TWEETSUMM - A Dialog Summarization Dataset for Customer Service
Guy Feigenblat, Chulaka Gunasekara, Benjamin Sznajder, Sachindra Joshi, David Konopnicki and Ranit Aharonov
Discourse-Based Sentence Splitting
Liam Cripwell, Joël Legrand and Claire Gardent
Multi-Task Dense Retrieval via Model Uncertainty Fusion for Open-Domain Question Answering
Minghan Li, Ming Li, Kun Xiong and Jimmy Lin
Mining the Cause of Political Decision-Making from Social Media: A Case Study of COVID-19 Policies across the US States
Zhijing Jin, Zeyu Peng, Tejas Vaidhya, Bernhard Schoelkopf and Rada Mihalcea
Self-Attention Graph Residual Convolutional Networks for Event Detection with dependency relations
Anan Liu, Ning Xu and Haozhe Liu
Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer
Gi-Cheon Kang, Junseok Park, Hwaran Lee, Byoung-Tak Zhang and Jin-Hwa Kim
Exploring Sentence Community for Document-Level Event Extraction
Yusheng Huang and Weijia Jia
A Model of Cross-Lingual Knowledge-Grounded Response Generation for Open-Domain Dialogue Systems
san kim, Jin Yea Jang, minyoung jung and saim shin
WHOSe Heritage: Classification of UNESCO World Heritage Statements of “Outstanding Universal Value” with Soft Labels
Nan Bai, Renqian Luo, Pirouz Nourian and Ana Pereira Roders
P-INT: A Path-based Interaction Model for Few-shot Knowledge Graph Completion
Jingwen Xu, Jing Zhang, Xirui Ke, Yuxiao Dong, Hong Chen, Cuiping Li and Yongbin Liu
Cartography Active Learning
Mike Zhang and Barbara Plank
Beyond Reptile: Meta-Learned Dot-Product Maximization between Gradients for Improved Single-Task Regularization
Akhil Kedia, Sai Chetan Chinthakindi and Wonho Ryu
GooAQ: Open Question Answering with Diverse Answer Types
Daniel Khashabi, Amos Ng, Tushar Khot, Ashish Sabharwal, Hannaneh Hajishirzi and Chris Callison-Burch
Attention Weights in Transformer NMT Fail Aligning Words Between Sequences but Largely Explain Model Predictions
Javier Ferrando and Marta R. Costa-jussà
BFClass: A Backdoor-free Text Classification Framework
Zichao Li, Dheeraj Mekala, Chengyu Dong and Jingbo Shang
Multilingual Chart-based Constituency Parse Extraction from Pre-trained Language Models
Taeuk Kim, Bowen Li and Sang-goo Lee
Hyperbolic Geometry is Not Necessary: Lightweight Euclidean-Based Models for Low-Dimensional Knowledge Graph Embeddings
Kai Wang, Yu Liu, Dan Lin and Michael Sheng
CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade
Lei Li, Yankai Lin, Deli Chen, Shuhuai Ren, Peng Li, Jie Zhou and Xu Sun
Semi-supervised Relation Extraction via Incremental Meta Self-Training
Xuming Hu, Chenwei Zhang, Fukun Ma, Chenyao Liu, Lijie Wen and Philip S. Yu
Keyphrase Generation with Fine-Grained Evaluation-Guided Reinforcement Learning
Yichao Luo, Yige Xu, Jiacheng Ye, Xipeng Qiu and Qi Zhang
Improving Knowledge Graph Embedding Using Affine Transformations of Entities Corresponding to Each Relation
Jinfa Yang, Yongjie Shi, Xin Tong, Robin Wang, Taiyan Chen and Xianghua Ying
Distilling Word Meaning in Context from Pre-trained Language Models
Yuki Arase and Tomoyuki Kajiwara
Bidirectional Hierarchical Attention Networks based on Document-level Context for Emotion Cause Extraction
Guimin Hu, Guangming Lu and Yi Zhao
Distantly Supervised Relation Extraction in Federated Settings
Dianbo Sui, Yubo Chen, Kang Liu and Jun Zhao
Saliency-based Multi-View Mixed Language Training for Zero-shot Cross-lingual Classification
Siyu Lai, Hui Huang, Dong Jing, Yufeng Chen, Jinan Xu and Jian Liu
Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the Society
Firoj Alam, Shaden Shaar, Fahim Dalvi, Hassan Sajjad, Alex Nikolov, Hamdy Mubarak, Giovanni Da San Martino, Ahmed Abdelali, Nadir Durrani, Kareem Darwish, Abdulaziz Al-Homaid, Wajdi Zaghouani, Tommaso Caselli, Gijs Danoe, Friso Stolk, Britt Bruntink and Preslav Nakov
FANATIC: FAst Noise-Aware TopIc Clustering
Ari Silburt, Anja Subasic, Evan Thompson, Carmeline Dsilva and Tarec Fares
TSDAE: Using Transformer-based Sequential Denoising Auto-Encoderfor Unsupervised Sentence Embedding Learning
Kexin Wang, Nils Reimers and Iryna Gurevych
How Suitable Are Subword Segmentation Strategies for Translating Non-Concatenative Morphology?
Chantal Amrhein and Rico Sennrich
Learn Continually, Generalize Rapidly: Lifelong Knowledge Accumulation for Few-shot Learning
Xisen Jin, Bill Yuchen Lin, Mohammad Rostami and Xiang Ren
An Analysis of Euclidean vs. Graph-Based Framing for Bilingual Lexicon Induction from Word Embedding Spaces
Kelly Marchisio, Youngser Park, Ali Saad-Eldin, Anton Alyakin, Kevin Duh, carey priebe and Philipp Koehn
How to Select One Among All ? An Empirical Study Towards the Robustness of Knowledge Distillation in Natural Language Understanding
Tianda Li, Ahmad Rashid, Aref Jafari, Pranav Sharma, Ali Ghodsi and Mehdi Rezagholizadeh
Recommend for a Reason: Unlocking the Power of Unsupervised Aspect-Sentiment Co-Extraction
Zeyu Li, Wei Cheng, Reema Kshetramade, John Houser, Haifeng Chen and Wei Wang
Recall and Learn: A Memory-augmented Solver for Math Word Problems
Shifeng Huang, Jiawei Wang, Jiao Xu, Da Cao and Ming Yang
An Uncertainty-Aware Encoder for Aspect Detection
Nhung Nguyen, Kiem-Hieu Nguyen, Young-In Song and Tuan Cao
Improving Empathetic Response Generation by Recognizing Emotion Cause in Conversations
Jun Gao, Yuhan Liu, Haolin Deng, Wei Wang, Yu Cao, Jiachen Du and Ruifeng Xu
Probing Across Time: What Does RoBERTa Know and When?
Zeyu Liu, Yizhong Wang, Jungo Kasai, Hannaneh Hajishirzi and Noah A. Smith
Knowledge-Guided Paraphrase Identification
Haoyu Wang, Fenglong Ma, Yaqing Wang and Jing Gao
R2-D2: A Modular Baseline for Open-Domain Question Answering
Martin Fajcik, Martin Docekal, Karel Ondrej and Pavel Smrz
What Does Your Smile Mean? Jointly Detecting Multi-Modal Sarcasm and Sentiment Using Quantum Probability
Yaochen Liu, Yazhou Zhang, Qiuchi Li, Benyou Wang and Dawei Song
Discovering Representation Sprachbund For Multilingual Pre-Training
Yimin Fan, Yaobo Liang, Alexandre Muzio, Hany Hassan, Houqiang Li, Ming Zhou and Nan Duan
Plan-then-Generate: Controlled Data-to-Text Generation via Planning
Yixuan Su, David Vandyke, Sihui Wang, Yimai Fang and Nigel Collier
Exploiting Curriculum Learning in Unsupervised Neural Machine Translation
Jinliang Lu and Jiajun Zhang
Towards Improving Adversarial Training of NLP Models
Jin Yong Yoo and Yanjun Qi
To Protect and To Serve? Analyzing Entity-Centric Framing of Police Violence
Caleb Ziems and Diyi Yang
When Retriever-Reader Meets Scenario-Based Multiple-Choice Questions
ZiXian Huang, Ao Wu, Yulin Shen, Gong Cheng and Yuzhong Qu
Structured abbreviation expansion in context
Kyle Gorman, Christo Kirov, Brian Roark and Richard Sproat
Task-adaptive Pre-training and Self-training are Complementary for Natural Language Understanding
Shiyang Li, Semih Yavuz, Wenhu Chen and Xifeng Yan
Compositional Generalization via Semantic Tagging
Hao Zheng and Mirella Lapata
Towards Document-Level Paraphrase Generation with Sentence Rewriting and Reordering
Zhe Lin, Yitao Cai and Xiaojun Wan
Diversity and Consistency: Exploring Visual Question-Answer Pair Generation
Sen Yang, Qingyu Zhou, dawei feng, yang liu, Chao Li, Yunbo Cao and Dongsheng Li
Entity-level Cross-modal Learning Improves Multi-modal Machine Translation
Xin Huang, Jiajun Zhang and Chengqing Zong
Learning to Ground Visual Objects for Visual Dialog
Feilong Chen, Xiuyi Chen, Can Xu and Daxin Jiang
KERS: A Knowledge-Enhanced Framework for Recommendation Dialog Systems with Multiple Subgoals
Jun Zhang, Yan Yang, Chencai Chen, liang he and Zhou Yu
Less Is More: Domain Adaptation with Lottery Ticket for Reading Comprehension
Haichao Zhu, Zekun Wang, Heng Zhang, Ming Liu, Sendong Zhao and Bing Qin
Improving Abstractive Dialogue Summarization with Hierarchical Pretraining and Topic Segment
Qi MengNan, liu hao, yuzhuo fu and Ting Liu
Learning to Answer Psychological Questionnaire for Personality Detection
Feifan Yang, Tao Yang, Xiaojun Quan and Qinliang Su
Exploiting Reasoning Chains for Multi-hop Science Question Answering
Weiwen Xu, Yang Deng, Huihui Zhang, Deng Cai and Wai Lam
Neural Media Bias Detection Using Distant Supervision With BABE - Bias Annotations By Experts
Timo Spinde, Manuel Plank, Jan-David Krieger, Terry Ruas, Bela Gipp and Akiko Aizawa
Learning and Evaluating a Differentially Private Pre-trained Language Model
Shlomo Hoory, Amir Feder, Avichai Tendler, Sofia Erell, Alon Peled-Cohen, Itay Laish, Hootan Nakhost, Uri Stemmer, Ayelet Benjamini, Avinatan Hassidim and Yossi Matias
Simulated Chats for Building Dialog Systems: Learning to Generate Conversations from Instructions
Biswesh Mohapatra, Gaurav Pandey, Danish Contractor and Sachindra Joshi
Past, Present, and Future: Conversational Emotion Recognition through Structural Modeling of Psychological Knowledge
Jiangnan Li, Zheng Lin, Peng Fu and Weiping Wang
An unsupervised framework for tracing textual sources of moral change
Aida Ramezani, Zining Zhu, Frank Rudzicz and Yang Xu
Topic-Aware Contrastive Learning for Abstractive Dialogue Summarization
Junpeng Liu, Yanyan Zou, Hainan Zhang, Hongshen Chen, Zhuoye Ding, Caixia Yuan and Xiaojie WANG
TWT: Table with Written Text for Controlled Data-to-Text Generation
Tongliang Li, Lei Fang, Jian-Guang LOU and Zhoujun Li
Which is Making the Contribution: Modulating Unimodal and Cross-modal Dynamics for Multimodal Sentiment Analysis
Ying Zeng, Sijie Mai and Haifeng Hu
Combining Curriculum Learning and Knowledge Distillation for Dialogue Generation
Qingqing Zhu, Xiuying Chen, Pengfei Wu, JunFei Liu and Dongyan Zhao
Multilingual Neural Machine Translation: Can Linguistic Hierarchies Help?
Fahimeh Saleh, Wray Buntine, Gholamreza Haffari and Lan Du
Self Question-answering: Aspect-based Sentiment Analysis by Role Flipped Machine Reading Comprehension
Guoxin Yu, Jiwei Li, Ling Luo, Yuxian Meng, Xiang Ao and Qing He
Generalization in Text-based Games via Hierarchical Reinforcement Learning
Yunqiu Xu, Meng Fang, Ling Chen, Yali Du and Chengqi Zhang
A Finer-grain Universal Dialogue Semantic Structures based Model For Abstractive Dialogue Summarization
Yuejie Lei, Fujia Zheng, Yuanmeng Yan, Keqing He and Weiran Xu
Constructing contrastive samples via summarization for text classification with limited annotations
Yangkai Du, Tengfei Ma, Lingfei Wu, Fangli Xu, Xuhong Zhang, Bo Long and Shouling Ji
End-to-end Neural Information Status Classification
Yufang Hou
EventKE: Event-Enhanced Knowledge Graph Embedding
Zixuan Zhang, Hongwei Wang, Han Zhao, Hanghang Tong and Heng Ji
Modeling Concentrated Cross-Attention for Neural Machine Translation with Gaussian Mixture Model
Shaolei Zhang and Yang Feng
Inconsistency Matters: A Knowledge-guided Dual-inconsistency Network for Multi-modal Rumor Detection
Mengzhu Sun, Xi Zhang, Jianqiang Ma and Yazheng Liu
EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation
Chenhe Dong, Guangrun Wang, Hang XU, jiefeng Peng, Xiaozhe REN and Xiaodan Liang
Uni-FedRec: A Unified Privacy-Preserving News Recommendation Framework for Model Training and Online Serving
Tao Qi, Fangzhao Wu, Chuhan Wu, Yongfeng Huang and Xing Xie
Mapping Language to Programs using Multiple Reward Components with Inverse Reinforcement Learning
Sayan Ghosh and Shashank Srivastava
Topic-Guided Abstractive Multi-Document Summarization
Peng Cui and Le Hu
An Edge-Enhanced Hierarchical Graph-to-Tree Network for Math Word Problem Solving
Qinzhuo Wu, Qi Zhang and Zhongyu Wei
SciXGen: A Scientific Paper Dataset for Context-Aware Text Generation
Hong Chen, Hiroya Takamura and Hideki Nakayama
Don’t Miss the Potential Customers! Retrieving Similar Ads to Improve User Targeting
Yi Feng, Ting Wang, Chuanyi Li, Vincent Ng, Jidong Ge, Bin Luo, Yucheng Hu and Xiaopeng Zhang
Cross-lingual Transfer for Text Classification with Dictionary-based Heterogeneous Graph
Nuttapong Chairatanakul, Noppayut Sriwatanasakdi, Nontawat Charoenphakdee, XIN LIU and Tsuyoshi Murata
Improving Distantly-Supervised Named Entity Recognition with Self-Collaborative Denoising Learning
Xinghua Zhang, Bowen Yu, Tingwen Liu, Zhenyu Zhang, Jiawei Sheng, Xue Mengge and Hongbo Xu
Entity-Based Semantic Adequacy for Data-to-Text Generation
Juliette Faille, Albert Gatt and Claire Gardent
MiRANews: Dataset and Benchmarks for Multi-Resource-Assisted News Summarization
Xinnuo Xu, Ondřej Dušek, Shashi Narayan, Verena Rieser and Ioannis Konstas
A Conditional Generative Matching Model for Multi-lingual Reply Suggestion
Budhaditya Deb, Guoqing Zheng, Milad Shokouhi and Ahmed Hassan Awadallah
Rethinking Sentiment Style Transfer
Ping Yu, Yang Zhao, Chunyuan Li and Changyou Chen
HypoGen: Hyperbole Generation with Commonsense and Counterfactual Knowledge
Yufei Tian, Arvind krishna Sridhar and Nanyun Peng
Profiling News Discourse Structure Using Explicit Subtopic Structures Guided Critics
Prafulla Kumar Choubey and Ruihong Huang
ProtoInfoMax: Prototypical Networks with Mutual Information Maximization for Out-of-Domain Detection
Iftitahu Nimah, Meng Fang, Vlado Menkovski and Mykola Pechenizkiy
Learning from Language Description: Low-shot Named Entity Recognition via Decomposed Framework
Yaqing Wang, Haoda Chu, Chao Zhang and Jing Gao
Char2Subword: Extending the Subword Embedding Space Using Robust Character Compositionality
Gustavo Aguilar, Bryan McCann, Tong Niu, Nazneen Fatema Rajani, Nitish Shirish Keskar and Thamar Solorio
Conical Classification For Efficient One-Class Topic Determination
Sameer Khanna
Improving Dialogue State Tracking with Turn-based Loss Function and Sequential Data Augmentation
Jarana Manotumruksa, Jeff Dalton, Edgar Meij and Emine Yilmaz
Optimal Neural Program Synthesis from Multimodal Specifications
Xi Ye, Qiaochu Chen, Isil Dillig and Greg Durrett
Sent2Span: Span Detection for PICO Extraction in the Biomedical Text without Span Annotations
Shifeng Liu, Yifang Sun, Bing Li, Wei Wang, Florence Bourgeois and Adam Dunn
APGN: Adversarial and Parameter Generation Networks for Multi-Source Cross-Domain Dependency Parsing
Ying Li, Meishan Zhang, Zhenghua Li, Min Zhang, Zhefeng Wang, baoxing Huai and Nicholas Jing Yuan
“Let Your Characters Tell Their Story’’: A Dataset for Character-Centric Narrative Understanding
Faeze Brahman, Meng Huang, Oyvind Tafjord, Chao Zhao, Mrinmaya Sachan and Snigdha Chaturvedi
Towards Developing a Multilingual and Code-Mixed Visual Question Answering System by Knowledge Distillation
Humair Raj Khan, Deepak Gupta and Asif Ekbal
An Iterative Multi-Knowledge Transfer Network for Aspect-Based Sentiment Analysis
Yunlong Liang, Fandong Meng, Jinchao Zhang, Yufeng Chen, Jinan Xu and Jie Zhou
Semantic Alignment with Calibrated Similarity for Multilingual Sentence Embedding
Jiyeon Ham and Eun-Sol Kim
WIKIBIAS: Detecting Multi-Span Subjective Biases in Language
Yang Zhong, Jingfeng Yang, Wei Xu and Diyi Yang
UnClE: Explicitly Leveraging Semantic Similarity to Reduce the Parameters of Word Embeddings
Zhi Li, Yuchen Zhai, Chengyu Wang, Minghui Qiu, Kailiang Li and Yin Zhang
Grounded Graph Decoding improves Compositional Generalization in Question Answering
Yu Gai, Paras Jain, Wendi Zhang, Joseph Gonzalez, Dawn Song and Ion Stoica
Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented Guesser
Duo Zheng, Zipeng Xu, Fandong Meng, Xiaojie WANG, Jiaan Wang and Jie Zhou
A Pretraining Numerical Reasoning Model for Ordinal Constrained Question Answering on Knowledge Base
Yu Feng, Jing Zhang, Gaole He, Wayne Xin Zhao, Lemao Liu, Quan Liu, Cuiping Li and Hong Chen
RoR: Read-over-Read for Long Document Machine Reading Comprehension
Jing Zhao, Junwei Bao, Yifan Wang, Yongwei Zhou, Youzheng Wu, Xiaodong He and Bowen Zhou
Span Pointer Networks for Non-Autoregressive Task-Oriented Semantic Parsing
Akshat Shrivastava, Pierce Chuang, Arun Babu, Shrey Desai, Abhinav Arora, Alexander Zotov and Ahmed Aly
Language Resource Efficient Learning for Captioning
Jia Chen, Yike Wu, Shiwan Zhao and Qin Jin
Translation as Cross-Domain Knowledge: Attention Augmentation for Unsupervised Cross-Domain Segmenting and Labeling Tasks
Ruixuan Luo, Yi Zhang, Sishuo Chen and Xu Sun
ContractNLI: A Dataset for Document-level Natural Language Inference for Contracts
Yuta Koreeda and Christopher Manning
Japanese Zero Anaphora Resolution Can Benefit from Parallel Texts Through Neural Transfer Learning
Masato Umakoshi, Yugo Murawaki and Sadao Kurohashi
Grouped-Attention for Content-Selection and Content-Plan Generation
Bayu Distiawan Trisedya, Xiaojie Wang, Jianzhong Qi, Rui Zhang and Qingjun Cui
An Explicit-Joint and Supervised-Contrastive Learning Framework for Few-Shot Intent Classification and Slot Filling
Han Liu, Feng Zhang, Xiaotong Zhang, Siyang Zhao and Xianchao Zhang
Retrieve, Discriminate and Rewrite: A Simple and Effective Framework for Obtaining Affective Response in Retrieval-Based Chatbots
Xin Lu, Yijian Tian, Yanyan Zhao and Bing Qin
Span Fine-tuning for Pre-trained Language Models
Rongzhou Bao, Zhuosheng Zhang and Hai Zhao
DIRECT: Direct and Indirect Responses in Conversational Text Corpus
Junya Takayama, Tomoyuki Kajiwara and Yuki Arase
Retrieval, Analogy, and Composition: A framework for Compositional Generalization in Image Captioning
Zhan Shi, Hui Liu, Martin Renqiang Min, Christopher Malon, Li Erran Li and Xiaodan Zhu
TURINGBENCH: A Benchmark Environment for Turing Test in the Age of Neural Text Generation
Adaku Uchendu, Zeyu Ma, Thai Le, Rui Zhang and Dongwon Lee
Say `YES’ to Positivity: Detecting Toxic Language in Workplace Communications
Meghana Moorthy Bhat, saghar Hosseini, Ahmed Hassan Awadallah, Paul Bennett and Weisheng Li
Natural SQL: Making SQL Easier to Infer from Natural Language Specifications
Yujian Gan, Xinyun Chen, Jinxia Xie, Matthew Purver, John R. Woodward, John Drake and qiaofu zhang
Self- and Pseudo-self-supervised Prediction of Speaker and Key-utterance for Multi-party Dialogue Reading Comprehension
Yiyang Li and Hai Zhao
Few-Shot Novel Concept Learning for Semantic Parsing
Soham Dan, Osbert Bastani and Dan Roth
Are Factuality Checkers Reliable? Adversarial Meta-evaluation of Factuality in Summarization
Yiran Chen, Pengfei Liu and Xipeng Qiu
Detecting Polarized Topics Using Partisanship-aware Contextualized Topic Embeddings
Zihao He, Negar Mokhberian, António Câmara, Andres Abeliuk and Kristina Lerman
Re-entry Prediction for Online Conversations via Self-Supervised Learning
Lingzhi Wang, Xingshan Zeng, Huang Hu, Kam-Fai Wong and Daxin Jiang
proScript: Partially Ordered Scripts Generation
Keisuke Sakaguchi, Chandra Bhagavatula, Ronan Le Bras, Niket Tandon, Peter Clark and Yejin Choi
Unsupervised Domain Adaptation Method with Semantic-Structural Alignment for Dependency Parsing
Boda Lin, Mingzheng Li, Si Li and Yong Luo
SideControl: Controlled Open-domain Dialogue Generation via Additive Side Networks
Wanyu Du and Yangfeng Ji
Leveraging Bidding Graphs for Advertiser-Aware Relevance Modeling in Sponsored Search
Shuxian Bi, Chaozhuo Li, Xiao Han, Zheng Liu, Xing Xie, Haizhen Huang and Zengxuan Wen
GPT3Mix: Leveraging Large-scale Language Models for Text Augmentation
Kang Min Yoo, Dongju Park, Jaewook Kang, Sang-Woo Lee and Woomyoung Park
Context-aware Entity Typing in Knowledge Graphs
Weiran Pan, Wei Wei and Xian-Ling Mao
Attribute Alignment: Controlling Text Generation from Pre-trained Language Models
Dian Yu, Zhou Yu and Kenji Sagae
Generate & Rank: A Multi-task Framework for Math Word Problems
Jianhao Shen, Yichun Yin, Lin Li, Lifeng Shang, Xin Jiang, Ming Zhang and Qun Liu
MIRTT: Learning Multimodal Interaction Representations from Trilinear Transformers for Visual Question Answering
Junjie Wang, Yatai Ji, Jiaqi Sun, Yujiu Yang and Tetsuya Sakai
UniteD-SRL: A Unified Dataset for Span- and Dependency-Based Multilingual and Cross-Lingual Semantic Role Labeling
Rocco Tripodi, Simone Conia and Roberto Navigli
Enhancing Dual-Encoders with Question and Answer Cross-Embeddings for Answer Retrieval
Yanmeng Wang, Jun Bai, Ye Wang, Jianfei Zhang, Wenge Rong, Zongcheng Ji, Shaojun Wang and Jing Xiao
GiBERT: Enhancing BERT with Linguistic Information using a Lightweight Gated Injection Method
Nicole Peinelt, Marek Rei and Maria Liakata
RollingLDA: An Update Algorithm of Latent Dirichlet Allocation to Construct Consistent Time Series from Textual Data
Jonas Rieger, Carsten Jentsch and Jörg Rahnenführer
What If Sentence-hood is Hard to Define: A Case Study in Chinese Reading Comprehension
Jiawei Wang, Hai Zhao, Yinggong Zhao and Libin Shen
Refining BERT Embeddings for Document Hashing via Mutual Information Maximization
Zijing Ou, Qinliang Su, Jianxing Yu, Ruihui Zhao, Yefeng Zheng and Bang Liu
REBEL: Relation Extraction By End-to-end Language generation
Pere-Lluís Huguet Cabot and Roberto Navigli
Wine is not v i n. On the Compatibility of Tokenizations across Languages
Antonis Maronikolakis, Philipp Dufter and Hinrich Schütze
Temporal Adaptation of BERT and Performance on Downstream Document Classification: Insights from Social Media
Paul Röttger and Janet Pierrehumbert
Skim-Attention: Learning to Focus via Document Layout
Laura Nguyen, Thomas Scialom, Jacopo Staiano and Benjamin Piwowarski
Give the Truth: Incorporate Semantic Slot into Abstractive Dialogue Summarization
Lulu Zhao, Weihao Zeng, Weiran Xu and Jun Guo
Challenges in Detoxifying Language Models
Johannes Welbl, Amelia Glaese, Jonathan Uesato, Sumanth Dathathri, John Mellor, Lisa Anne Hendricks, Kirsty Anderson, Pushmeet Kohli, Ben Coppin and Po-Sen Huang
Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation
shahar levy, Koren Lazar and Gabriel Stanovsky
Competence-based Curriculum Learning for Multilingual Machine Translation
Mingliang Zhang, Fandong Meng, Yunhai Tong and Jie Zhou
Informed Sampling for Diversity in Concept-to-Text NLG
Giulio Zhou and Gerasimos Lampouras
Novel Natural Language Summarization of Program Code via Leveraging Multiple Input Representations
Fuxiang Chen, Mijung Kim and Jaegul Choo
WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER
Simone Tedeschi, Valentino Maiorca, Niccolò Campolungo, Francesco Cecconi and Roberto Navigli
Classification and Geotemporal Analysis of Quality-of-Life Issues in Tenant Reviews
Adam Haber and Zeev Waks
Uncovering the Limits of Text-based Emotion Detection
Nurudin Alvarez-Gonzalez, Andreas Kaltenbrunner and Vicenç Gómez
Named Entity Recognition for Entity Linking: What Works and What’s Next
Simone Tedeschi, Simone Conia, Francesco Cecconi and Roberto Navigli
Weakly Supervised Semantic Parsing by Learning from Mistakes
Jiaqi Guo, Jian-Guang LOU, Ting Liu and Dongmei Zhang
CodeQA: A Question Answering Dataset for Source Code Comprehension
Chenxiao Liu and Xiaojun Wan
Subword Mapping and Anchoring across Languages
Giorgos Vernikos and Andrei Popescu-Belis
CDLM: Cross-Document Language Modeling
Avi Caciularu, Arman Cohan, Iz Beltagy, Matthew Peters, Arie Cattan and Ido Dagan
Patterns of Polysemy and Homonymy in Contextualised Language Models
Janosch Haber and Massimo Poesio
Controlled Neural Sentence-Level Reframing of News Articles
Wei-Fan Chen, Khalid Al Khatib, Benno Stein and Henning Wachsmuth
DialogueTRM: Exploring Multi-Modal Emotional Dynamics in a Conversation
yuzhao mao, Guang Liu, Xiaojie WANG, Weiguo Gao and Xuan Li
Retrieval Augmented Code Generation and Summarization
Md Rizwan Parvez, Wasi Ahmad, Saikat Chakraborty, Baishakhi Ray and Kai-Wei Chang
Multilingual Translation via Grafting Pre-trained Language Models
Zewei Sun, Mingxuan Wang and Lei Li
A Comprehensive Comparison of Word Embeddings in Event & Entity Coreference Resolution.
Judicael POUMAY and Ashwin Ittoo
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Guolin Zheng, Yubei Xiao, Ke Gong, Pan Zhou, Xiaodan Liang and Liang Lin
Multilingual AMR Parsing with Noisy Knowledge Distillation
Deng Cai, Xin Li, Jackie Chun-Sing Ho, Lidong Bing and Wai Lam
Open-Domain Contextual Link Prediction and its Complementarity with Entailment Graphs
Mohammad Javad Hosseini, Shay B. Cohen, Mark Johnson and Mark Steedman
Counter-Interference Adapter for Multilingual Machine Translation
Yaoming ZHU, Jiangtao Feng, Chengqi Zhao, Mingxuan Wang and Lei Li
“Be nice to your wife! The restaurants are closed”: Can Gender Stereotype Detection Improve Sexism Classification?
Patricia Chiril, Farah Benamara and Véronique MORICEAU
Automatic Discrimination between Inherited and Borrowed Latin Words in Romance Languages
Alina Maria Cristea, Liviu P. Dinu, Simona Georgescu, Mihnea-Lucian Mihai and Ana Sabina Uban
Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections
Ruiqi Zhong, Kristy Lee, Zheng Zhang and Dan Klein
Knowledge-Interactive Network with Sentiment Polarity Intensity-Aware Multi-Task Learning for Emotion Recognition in Conversations
Yunhe Xie, Kailai Yang, Chengjie Sun, Bingquan Liu and zhenzhou Ji
Minimizing Annotation Effort via Max-Volume Spectral Sampling
Ariadna Quattoni and Xavier Carreras
Lexicon-Based Graph Convolutional Network for Chinese Word Segmentation
Kaiyu Huang, Hao Yu, Junpeng Liu, Wei Liu, Jingxiang Cao and Degen Huang
KFCNet: Knowledge Filtering and Contrastive Learning for Generative Commonsense Reasoning
Haonan Li, Yeyun Gong, Jian Jiao, Ruofei Zhang, Timothy Baldwin and Nan Duan
Monolingual and Cross-Lingual Acceptability Judgments with the Italian CoLA corpus
Daniela Trotta, Raffaele Guarasci, Elisa Leonardelli and Sara Tonelli
A Discourse-Aware Graph Neural Network for Emotion Recognition in Multi-Party Conversation
Yang Sun, Nan Yu and Guohong Fu
Extract, Integrate, Compete: Towards Verification Style Reading Comprehension
Chen Zhang, Yuxuan Lai, Yansong Feng and Dongyan Zhao
Comparing learnability of two dependency schemes: ‘semantic’ (UD) and ‘syntactic’ (SUD)
Ryszard Tuora, Adam Przepiórkowski and Aleksander Leczkowski
Eliminating Sentiment Bias for Aspect-Level Sentiment Classification with Unsupervised Opinion Extraction
Bo Wang, Tao Shen, Guodong Long, Tianyi Zhou and Yi Chang
Data Efficient Masked Language Modeling for Vision and Language
Yonatan Bitton, Michael Elhadad, Gabriel Stanovsky and Roy Schwartz
Improving Multilingual Neural Machine Translation with Auxiliary Source Languages
Weijia Xu, Yuwei Yin, Shuming Ma, Dongdong Zhang and Haoyang Huang
Locality Preserving Sentence Encoding
Changrong Min, yonghe chu, Liang Yang, Bo Xu and Hongfei LIN
Knowledge Representation Learning with Contrastive Completion Coding
Bo Ouyang, Wenbing Huang, Runfa Chen, Zhixing Tan, Yang Liu, Maosong Sun and Jihong Zhu
Knowledge-Enhanced Evidence Retrieval for Counterargument Generation
Yohan Jo, Haneul Yoo, JinYeong Bak, Alice Oh, Chris Reed and Eduard Hovy
Modeling Mathematical Notation Semantics in Academic Papers
Hwiyeol Jo, Dongyeop Kang, Andrew Head and Marti A. Hearst
Constructing Emotional Consensus and Utilizing Unpaired Data for Empathetic Dialogue Generation
Lei Shen, Jinchao Zhang, Jiao Ou, Xiaofang Zhao and Jie Zhou
Automatic rule generation for time expression normalization
Wentao Ding, Jianhao Chen, Jinmao Li and Yuzhong Qu
Visual Cues and Error Correction for Translation Robustness
Zhenhao Li, Marek Rei and Lucia Specia
Bandits Don’t Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits
Julia Kreutzer, David Vilar and Artem Sokolov
Sometimes We Want Ungrammatical Translations
Prasanna Parthasarathi, Koustuv Sinha, Joelle Pineau and Adina Williams
An animated picture says at least a thousand words: Selecting Gif-based Replies in Multimodal Dialog
Xingyao Wang and David Jurgens
Translate & Fill: Improving Zero-Shot Multilingual Semantic Parsing with Synthetic Data
Massimo Nicosia, Zhongdi Qu and Yasemin Altun
NewsBERT: Distilling Pre-trained Language Model for Intelligent News Application
Chuhan Wu, Fangzhao Wu, Yang Yu, Tao Qi, Yongfeng Huang and Qi Liu
SD-QA: Spoken Dialectal Question Answering for the Real World
Fahim Faisal, Sharlina Keshava, Md Mahfuz Ibn Alam and Antonios Anastasopoulos
The Low-Resource Double Bind: An Empirical Study of Pruning for Low-Resource Machine Translation
Orevaoghene Ahia, Julia Kreutzer and Sara Hooker
Self-Supervised Neural Topic Modeling
Seyed Ali Bahrainian, Martin Jaggi and Carsten Eickhoff
Distilling the Knowledge of Large-scale Generative Models into Retrieval Models for Efficient Open-domain Conversation
Beomsu Kim, Seokjun Seo, Seungju Han, Enkhbayar Erdenee and Buru Chang
Modeling Users and Online Communities for Abuse Detection: A Position on Ethics and Explainability
Pushkar Mishra, Helen Yannakoudakis and Ekaterina Shutova
Detecting Community Sensitive Norm Violations in Online Conversations
Chan Young Park, Julia Mendelsohn, Karthik Radhakrishnan, Kinjal Jain, Tushar Kanakagiri, David Jurgens and Yulia Tsvetkov
mDAPT: Multilingual Domain Adaptive Pretraining in a Single Model
Rasmus Kær Jørgensen, Mareike Hartmann, Xiang Dai and Desmond Elliott
COSMic: A Coherence-Aware Generation Metric for Image Descriptions
Mert Inan, Piyush Sharma, Baber Khalid, Radu Soricut, Matthew Stone and Malihe Alikhani
Relation-Guided Pre-Training for Open-Domain Question Answering
Ziniu Hu, Yizhou Sun and Kai-Wei Chang
MURAL: Multimodal, Multitask Representations Across Languages
Aashi Jain, Mandy Guo, Krishna Srinivasan, Ting Chen, Sneha Kudugunta, Chao Jia, Yinfei Yang and Jason Baldridge
AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language Models
Harish Tayyar Madabushi, Edward Gow-Smith, Carolina Scarton and Aline Villavicencio
Refine and Imitate: Reducing Repetition and Inconsistency in Persuasion Dialogues via Reinforcement Learning and Human Demonstration
Weiyan Shi, Yu Li, Saurav Sahay and Zhou Yu
Evidence-based Fact-Checking of Health-related Claims
Mourad Sarrouti, Asma Ben Abacha, Yassine Mrabet and Dina Demner-Fushman
Learning and Analyzing Generation Order for Undirected Sequence Models
Yichen Jiang and Mohit Bansal
Automatic Bilingual Markup Transfer
Thomas Zenkel, Joern Wuebker and John DeNero
Exploring a Unified Sequence-To-Sequence Transformer for Medical Product Safety Monitoring in Social Media
Shivam Raval, Hooman Sedghamiz, Enrico Santus, Tuka Alhanai, Mohammad Ghassemi and Emmanuele Chersoni
Disentangling Generative Factors in Natural Language with Discrete Variational Autoencoders
Giangiacomo Mercatali and André Freitas
MSD: Saliency-aware Knowledge Distillation for Multimodal Understanding
Woojeong Jin, Maziar Sanjabi, Shaoliang Nie, Liang Tan, Xiang Ren and Hamed Firooz
Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference
Sneha Kudugunta, Yanping Huang, Ankur Bapna, Maxim Krikun, Dmitry Lepikhin, Minh-Thang Luong and Orhan Firat
TAG: Gradient Attack on Transformer-based Language Models
Jieren Deng, Yijue Wang, Ji Li, CHENGHONG Wang, Chao Shang, Hang Liu, Sanguthevar Rajasekaran and Caiwen Ding
Generating Realistic Natural Language Counterfactuals
Marcel Robeer, Floris Bex and Ad Feelders
Gated Transformer for Robust De-noised Sequence-to-Sequence Modelling
Ayan Sengupta, Amit Kumar, Sourabh Kumar Bhattacharjee and Suman Roy
Token-wise Curriculum Learning for Neural Machine Translation
Chen Liang, Haoming Jiang, Xiaodong Liu, Pengcheng He, Weizhu Chen, Jianfeng Gao and Tuo Zhao
RelDiff: Enriching Knowledge Graph Relation Representations for Sensitivity Classification
Hitarth Narvala, Graham McDonald and Iadh Ounis
Post-Editing Extractive Summaries by Definiteness Prediction
Jad Kabbara and Jackie Chi Kit Cheung
Leveraging Pretrained Models for Automatic Summarization of Doctor-Patient Conversations
Longxiang Zhang, Renato Negrinho, Arindam Ghosh, Vasudevan Jagannathan, Hamid Reza Hassanzadeh, Thomas Schaaf and Matthew R. Gormley
Distilling Knowledge for Empathy Detection
Mahshid Hosseini and Cornelia Caragea
Retrieval Augmentation Reduces Hallucination in Conversation
Kurt Shuster, Spencer Poff, Moya Chen, Douwe Kiela and Jason Weston
Searching for More Efficient Dynamic Programs
Tim Vieira, Ryan Cotterell and Jason Eisner
Revisiting Robust Neural Machine Translation: A Transformer Case Study
Peyman Passban, Puneeth Saladi and Qun Liu
Can NLI Models Verify QA Systems’ Predictions?
Jifan Chen, Eunsol Choi and Greg Durrett
Parameter-Efficient Domain Knowledge Integration from Multiple Sources for Biomedical Pre-trained Language Models
Qiuhao Lu, Dejing Dou and Thien Huu Nguyen
Contrastive Document Representation Learning with Graph Attention Networks
Peng Xu, Xinchi Chen, Xiaofei Ma, zhiheng huang and Bing Xiang
Convex Aggregation for Opinion Summarization
Hayate Iso, Xiaolan Wang, Yoshihiko Suhara, Stefanos Angelidis and Wang-Chiew Tan
Using Optimal Transport as Alignment Objective for fine-tuning Multilingual Contextualized Embeddings
Sawsan Alqahtani, Garima Lalwani, Yi Zhang, Salvatore Romeo and Saab Mansour
Uncertainty-Aware Machine Translation Evaluation
Taisiya Glushkova, Chrysoula Zerva, Ricardo Rei and André F. T. Martins
Neural Unification for Logic Reasoning over Natural Language
Gabriele Picco, Thanh Lam Hoang, Marco Luca Sbodio and Vanessa Lopez
Benchmarking Meta-embeddings: What Works and What Does Not
Iker García, Rodrigo Agerri and German Rigau
A Plug-and-Play Method for Controlled Text Generation
Damian Pascual, Beni Egressy, Clara Meister, Ryan Cotterell and Roger Wattenhofer
A Corpus-based Syntactic Analysis of Two-termed Unlike Coordination
Julie Kallini and Christiane Fellbaum
Table-based Fact Verification With Salience-aware Learning
Fei Wang, Kexuan Sun, Jay Pujara, Pedro Szekely and Muhao Chen
Detecting Frames in News Headlines and Lead Images in U.S. Gun Violence Coverage
Isidora Tourni, Lei Guo, Taufiq Daryanto, Fabian Zhafransyah, Edward Edberg Halim, Mona Jalal, Boqi Chen, Sha Lai, Hengchang Hu, Margrit Betke, Prakash Ishwar and Derry Tanti Wijaya
Attend, Memorize and Generate: Towards Faithful Table-to-Text Generation in Few Shots
Wenting Zhao, Ye Liu, Yao Wan and Philip Yu
ARCH: Efficient Adversarial Regularized Training with Caching
Simiao Zuo, Chen Liang, Haoming Jiang, Pengcheng He, Xiaodong Liu, Jianfeng Gao, Weizhu Chen and Tuo Zhao
Probing Commonsense Explanation in Dialogue Response Generation
Pei Zhou, Pegah Jandaghi, Hyundong Cho, Bill Yuchen Lin, Jay Pujara and Xiang Ren
NOAHQA: Numerical Reasoning with Interpretable Graph Question Answering Dataset
Qiyuan Zhang, Lei Wang, SICHENG YU, Shuohang Wang, Yang Wang, Jing Jiang and Ee-Peng Lim
Textual Time Travel: A Temporally Informed Approach to Theory of Mind
Akshatha Arodi and Jackie Chi Kit Cheung
HyperExpan: Taxonomy Expansion with Hyperbolic Representation Learning
Mingyu Derek Ma, Muhao Chen, Te-Lin Wu and Nanyun Peng
Want To Reduce Labeling Cost? GPT-3 Can Help
Shuohang Wang, Yang Liu, Yichong Xu, Chenguang Zhu and Michael Zeng
Written Justifications are Key to Aggregate Crowdsourced Forecasts
Saketh Kotamraju and Eduardo Blanco
Cleaning Dirty Books: Post-OCR Processing for Previously Scanned Texts
Allen Kim, Charuta Pethe, Naoya Inoue and Steve Skiena
The Topic Confusion Task: A Novel Evaluation Scenario for Authorship Attribution
Malik Altakrori, Jackie Chi Kit Cheung and Benjamin C. M. Fung
Micromodels for Efficient, Explainable, and Reusable Systems: A Case Study on Mental Health
Andrew Lee, Jonathan K. Kummerfeld, Larry An and Rada Mihalcea
Discovering Explanatory Sentences in Legal Case Decisions Using Pre-trained Language Models
Jaromir Savelka and Kevin Ashley
FCM: A Fine-grained Comparison Model for Multi-turn Dialogue Reasoning
Xu Wang, Hainan Zhang, Shuai Zhao, Yanyan Zou, Hongshen Chen, Zhuoye Ding, Bo Cheng and Yanyan Lan
A Deep Decomposable Model for Disentangling Syntax and Semantics in Sentence Representation
Dingcheng Li, Hongliang Fei, Shaogang Ren and Ping Li
Improved Word Sense Disambiguation with Enhanced Sense Representations
Yang Song, Xin Cai Ong, Hwee Tou Ng and Qian Lin
FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition
Yichong Leng, Xu Tan, Rui Wang, Linchen Zhu, Jin Xu, Wenjie Liu, Linquan Liu, Xiang-Yang Li, Tao Qin, Edward Lin and Tie-Yan Liu
Task-Oriented Clustering for Dialogues
chenxu lv, Hengtong Lu, Shuyu Lei, Huixing Jiang, Wei Wu, Caixia Yuan and Xiaojie WANG
Character-based PCFG Induction for Modeling the Syntactic Acquisition of Morphologically Rich Languages
Lifeng Jin, Byung-Doh Oh and William Schuler
Block-wise Word Embedding Compression Revisited: Better Weighting and Structuring
Jong-Ryul Lee, Yong-Ju Lee and Yong-Hyuk Moon
Influence Tuning: Demoting Spurious Correlations via Instance Attribution and Instance-Driven Updates
Xiaochuang Han and Yulia Tsvetkov
Competing Independent Modules for Knowledge Integration and Optimization
Parsa Bagherzadeh and Sabine Bergler
MOMENTA: A Multimodal Framework for Detecting Harmful Memes and Their Targets
Shraman Pramanick, Shivam Sharma, Dimitar Dimitrov, Md. Shad Akhtar, Preslav Nakov and Tanmoy Chakraborty
NICE: Neural Image Commenting with Empathy
Kezhen Chen, Qiuyuan Huang, Daniel McDuff, Xiang Gao, Hamid Palangi, Jianfeng Wang, Kenneth Forbus and Jianfeng Gao
HAConvGNN: Hierarchical Attention Based Convolutional Graph Neural Network for Code Documentation Generation in Jupyter Notebooks
Xuye Liu, Dakuo Wang, April Wang, Yufang Hou and Lingfei Wu
A multilabel approach to morphosyntactic probing
Naomi Shapiro, Amandalynne Paullada and Shane Steinert-Threlkeld
Co-Teaching Student-Model through Submission Results of Shared Task
Kouta Nakayama, Shuhei Kurita, Akio Kobayashi, Yukino Baba and Satoshi Sekine
Active Learning for Rumor Identification on Social Media
Parsa Farinneya, Mohammad Mahdi Abdollah Pour, Sardar Hamidian and Mona Diab
Aspect-based Sentiment Analysis in Question Answering Forums
Wenxuan Zhang, Yang Deng, Xin Li, Lidong Bing and Wai Lam
Question Answering over Electronic Devices: A New Benchmark Dataset and a Multi-Task Learning based QA Framework
Abhilash Nandy, Soumya Sharma, Shubham Maddhashiya, Kapil Sachdeva, Pawan Goyal and NIloy Ganguly
Comprehensive Punctuation Restoration for English and Polish
Michał Pogoda and Tomasz Walkowiak
Syntactically Diverse Adversarial Network for Knowledge-Grounded Conversation Generation
Fuwei Cui, Hui Di, Hongjie Ren, Kazushige Ouchi, Ze Liu and Jinan Xu
Simple or Complex? Complexity-controllable Question Generation with Soft Templates and Deep Mixture of Experts Model
Sheng Bi, Xiya Cheng, Yuan-Fang Li, Lizhen Qu, Shirong Shen, Guilin Qi, Lu Pan and Yinlin Jiang
Predicting Anti-Asian Hateful Users on Twitter during COVID-19
Jisun An, Haewoon Kwak, Claire Seungeun Lee, Bogang Jun and Yong-Yeol Ahn
Fine-grained Typing of Emerging Entities in Microblogs
Satoshi Akasaki, Naoki Yoshinaga and Masashi Toyoda
Beyond Glass-Box Features: Uncertainty Quantification Enhanced Quality Estimation for Neural Machine Translation
Ke Wang, Yangbin Shi, Jiayi Wang, Yuqi Zhang, Yu Zhao and Xiaolin Zheng
Stacked AMR Parsing with Silver Data
Qingrong Xia, Zhenghua Li, Rui Wang and Min Zhang
MAD-G: Multilingual Adapter Generation for Efficient Cross-Lingual Transfer
Alan Ansell, Edoardo Maria Ponti, Jonas Pfeiffer, Sebastian Ruder, Goran Glavaš, Ivan Vulić and Anna Korhonen
Sustainable Modular Debiasing of Language Models
Anne Lauscher, Tobias Lueken and Goran Glavaš
A Divide-And-Conquer Approach for Multi-label Multi-hop Relation Detection in Knowledge Base Question Answering
Deyu Zhou, Yanzheng Xiang, Linhai Zhang, Chenchen Ye, Qian-Wen Zhang and Yunbo Cao
Counterfactual Adversarial Learning with Representation Interpolation
Wei Wang, Boxin Wang, Ning Shi, Jinfeng Li, Bingyu Zhu, Xiangyu Liu and Rong Zhang
‘Just What do You Think You’re Doing, Dave?’ A Checklist for Responsible Data Use in NLP
Anna Rogers, Timothy Baldwin and kobi leins
Incorporating Circumstances into Narrative Event Prediction
Shichao Wang, Xiangrui Cai, HongBin Wang and Xiaojie Yuan
HOTTER: Hierarchical Optimal Topic Transport with Explanatory Context Representations
Sabine Wehnert, Christian Scheel, Simona Szakács-Behling, Maret Nieländer, Patrick Mielke and Ernesto William De Luca
Improving Unsupervised Commonsense Reasoning Using Knowledge-Enabled Natural Language Inference
Canming Huang, Weinan He and Yongmei Liu
Does Putting a Linguist in the Loop Improve NLU Data Collection
Alicia Parrish, William Huang, Omar Agha, Soo-Hwan Lee, Nikita Nangia, Alex Warstadt, Karmanya Aggarwal, Emily Allaway, Tal Linzen and Samuel R. Bowman
Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding
Shane Storks, Qiaozi Gao, Yichi Zhang and Joyce Chai
Making Heads and Tails of Models with Marginal Calibration for Sparse Tagsets
Michael Kranzlein, Nelson F. Liu and Nathan Schneider
GeDi: Generative Discriminator Guided Decoding for Faster Controllable Sequence Generation
Ben Krause, Akhilesh Deepak Gotmare, Bryan McCann, Nitish Shirish Keskar, Shafiq Joty, Richard Socher and Nazneen Fatema Rajani

Short Papers

Language Clustering for Multilingual Named Entity Recognition
Kyle Shaffer
A Web Scale Entity Extraction System
Xuanting Cai, Quanbin Ma, Jianyu Liu, Pan Li, Qi Zeng, Zhengkan Yang and pushkar tripathi
Euphemistic Phrase Detection by Masked Language Model
Wanzheng Zhu and Suma Bhat
Segmenting Natural Language Sentences via Lexical Unit Analysis
Yangming Li, Lemao Liu and Shuming Shi
WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach
Junjie Huang, Duyu Tang, Wanjun Zhong, Shuai Lu, Linjun Shou, Ming Gong, Daxin Jiang and Nan Duan
Mixup Decoding for Diverse Machine Translation
Jicheng Li, Pengzhi Gao, Xuanfu Wu, Yang Feng, Zhongjun He, Hua Wu and Haifeng Wang
An Alignment-Agnostic Model for Chinese Text Error Correction
Liying Zheng, Yue Deng, Weishun Song, Liang Xu and Jing Xiao
Using Question Answering Rewards to Improve Abstractive Summarization
Chulaka Gunasekara, Guy Feigenblat, Benjamin Sznajder, Ranit Aharonov and Sachindra Joshi
Effect Generation Based on Causal Reasoning
Feiteng Mu, Wenjie Li and Zhipeng Xie
Unseen Entity Handling in Complex Question Answering over Knowledge Base via Language Generation
Xin Huang, Jung-Jae Kim and Bowei Zou
Casting the Same Sentiment Classification Problem
Erik Körner, Ahmad Dawar Hakimi, Gerhard Heyer and Martin Potthast
Detecting Compositionally Out-of-Distribution Examples in Semantic Parsing
Denis Lukovnikov, Sina Daubener and Asja Fischer
Stream-level Latency Evaluation for Simultaneous Machine Translation
Javier Iranzo-Sánchez, Jorge Civera Saiz and Alfons Juan
Rethinking Why Intermediate-Task Fine-Tuning Works
Ting-Yun Chang and Chi-Jen Lu
Efficient Test Time Adapter Ensembling for Low-resource Language Varieties
Xinyi Wang, Yulia Tsvetkov, Sebastian Ruder and Graham Neubig
Learning Hard Retrieval Decoder Attention for Transformers
Hongfei Xu, Qiuhui Liu, Josef van Genabith and Deyi Xiong
Few-Shot Table-to-Text Generation with Prototype Memory
Yixuan Su, Zaiqiao Meng, Simon Baker and Nigel Collier
Leveraging Word-Formation Knowledge for Chinese Word Sense Disambiguation
Hua Zheng, Lei Li, Damai Dai, Deli Chen, Tianyu Liu, Xu Sun and Yang Liu
Robust Fragment-Based Framework for Cross-lingual Sentence Retrieval
Nattapol Trijakwanich, Peerat Limkonchotiwat, Raheem Sarwar, Wannaphong Phatthiyaphaibun, Ekapol Chuangsuwanich and Sarana Nutanong
Calibrate your listeners! Robust communication-based training for pragmatic speakers
Rose Wang, Julia White, Jesse Mu and Noah Goodman
CNNBiF: CNN-based Bigram Features for Named Entity Recognition
Chul Sung, Vaibhava Goel, Etienne Marcheret, Steven Rennie and David Nahamoo
Exploring Decomposition for Table-based Fact Verification
Xiaoyu Yang and Xiaodan Zhu
Effectiveness of Pre-training for Few-shot Intent Classification
Haode Zhang, Yuwei Zhang, Li-Ming Zhan, Jiaxin Chen, Guangyuan SHI, Xiao-Ming Wu and Albert Y.S. Lam
Winnowing Knowledge for Multi-choice Question Answering
Yeqiu Li, Bowei Zou, Zhifeng Li, Ai Ti Aw, Yu Hong and Qiaoming Zhu
ArabicTransformer: Efficient Large Arabic Language Model with Funnel Transformer and ELECTRA Objective
Sultan Alrowili and Vijay Shanker
CVAE-based Re-anchoring for Implicit Discourse Relation Classification
Zujun Dou, Yu Hong, Yu Sun and Guodong Zhou
Improving End-to-End Task-Oriented Dialog System with A Simple Auxiliary Task
Yohan Lee
EDTC: A Corpus for Discourse-Level Topic Chain Parsing
Longyin Zhang, Xin Tan, Fang Kong and Guodong Zhou
BERT might be Overkill: A Tiny but Effective Biomedical Entity Linker based on Residual Convolutional Neural Networks
Tuan Lai, Heng Ji and ChengXiang Zhai
Exploring Multitask Learning for Low-Resource Abstractive Summarization
Ahmed Magooda, Diane Litman and Mohamed Elaraby
TIAGE: A Benchmark for Topic-Shift Aware Dialog Modeling
Huiyuan Xie, Zhenghao Liu, Chenyan Xiong, Zhiyuan Liu and Ann Copestake
When in Doubt: Improving Classification Performance with Alternating Normalization
Menglin Jia, Austin Reiter, Ser-Nam Lim, Yoav Artzi and Claire Cardie
fBERT: A Neural Transformer for Identifying Offensive Content
Diptanu Sarkar, Marcos Zampieri, Tharindu Ranasinghe and Alexander Ororbia
Mitigating Data Scarceness through Data Synthesis, Augmentation and Curriculum for Abstractive Summarization
Ahmed Magooda and Diane Litman
Compositional Data and Task Augmentation for Instruction Following
Soham Dan, Xinran Han and Dan Roth
On the Effects of Transformer Size on In- and Out-of-Domain Calibration
Soham Dan and Dan Roth
GenerativeRE: Incorporating a Novel Copy Mechanism and Pretrained Model for Joint Entity and Relation Extraction
Jiarun Cao and Sophia Ananiadou
Speaker Turn Modeling for Dialogue Act Classification
Zihao He, Leili Tavabi, Kristina Lerman and Mohammad Soleymani
Devil’s Advocate: Novel Boosting Ensemble Method from Psychological Findings for Text Classification
Hwiyeol Jo, Jaeseo Lim and Byoung-Tak Zhang
Is BERT a Cross-Disciplinary Knowledge Learner? A Surprising Finding of Pre-trained Models’ Transferability
Wei-Tsung Kao and Hung-yi Lee
Geo-BERT Pre-training Model for Query Rewriting in POI Search
xiao liu, Juan Hu, Qi Shen and Huan Chen
A Neural Graph-based Local Coherence Model
Mohsen Mesgar, Leonardo F. R. Ribeiro and Iryna Gurevych
Attention-based Contrastive Learning for Winograd Schemas
Tassilo Klein and Moin Nabi
Beyond Grammatical Error Correction: Improving L1-influenced research writing in English using pre-trained encoder-decoder models
Gustavo Zomer and Ana Frankenberg-Garcia
Probing Pre-trained Language Models for Semantic Attributes and their Values
Meriem Beloucif and Chris Biemann
Learning Numeracy: A Simple Yet Effective Number Embedding Approach Using Knowledge Graph
Hanyu Duan, Yi Yang and Kar Yan Tam
Cross-Lingual Leveled Reading Based on Language-Invariant Features
Simin Rao, Hua Zheng and Sujian Li
Adversarial Examples for Evaluating Math Word Problem Solvers
Vivek Kumar, Rishabh Maheshwary and Vikram Pudi
Improving Numerical Reasoning Skills in the Modular Approach for Complex Question Answering on Text
Xiao-Yu Guo, Yuan-Fang Li and Gholamreza Haffari
AEDA: An Easier Data Augmentation Technique for Text Classification
Akbar Karimi, Leonardo Rossi and Andrea Prati
Analysis of Language Change in Collaborative Instruction Following
Anna Effenberger, Rhia Singh, Eva Yan, Alane Suhr and Yoav Artzi
Progressive Transformer-Based Generation of Radiology Reports
Farhad Nooralahzadeh, Nicolas Perez Gonzalez, Thomas Frauenfelder, Koji Fujimoto and Michael Krauthammer
On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation
Xuebo Liu, Longyue Wang, Derek F. Wong, Liang Ding, Lidia S. Chao, Shuming Shi and Zhaopeng Tu
Hyperbolic Hierarchy-Aware Knowledge Graph Embedding for Link Prediction
Zhe Pan and Peng Wang
MeLT: Message-Level Transformer with Masked Document Representations as Pre-Training for Stance Detection
Matthew Matero, Nikita Soni, Niranjan Balasubramanian and H. Andrew Schwartz
LMSOC: An Approach for Socially Sensitive Pretraining
Vivek Kulkarni, Shubhanshu Mishra and Aria Haghighi
Argumentation-Driven Evidence Association in Criminal Cases
Yefei Teng and WenHan Chao
How Does Fine-tuning Affect the Geometry of Embedding Space: A Case Study on Isotropy
Sara Rajaee and Mohammad Taher Pilehvar
Investigating Numeracy Learning Ability of a Text-to-Text Transfer Model
Kuntal Kumar Pal and Chitta Baral
Unpacking the Interdependent Systems of Discrimination: Ableist Bias in NLP Systems through an Intersectional Lens
Saad Hassan, Matt Huenerfauth and Cecilia Ovesdotter Alm
RW-KD: Sample-wise Loss Terms Re-Weighting for Knowledge Distillation
Peng Lu, Abbas Ghaddar, Ahmad Rashid, Mehdi Rezagholizadeh, Ali Ghodsi and Philippe Langlais
Beyond the Tip of the Iceberg: Assessing Coherence of Text Classifiers
Shane Storks and Joyce Chai
Does Pretraining for Summarization Require Knowledge Transfer?
Kundan Krishna, Jeffrey Bigham and Zachary C. Lipton
SciCap: Generating Captions for Scientific Figures
Ting-Yao Hsu, C Lee Giles and Ting-Hao Huang
SentNoB: A Dataset for Analysing Sentiment on Noisy Bangla Texts
Khondoker Ittehadul Islam, Sudipta Kar, Md Saiful Islam and Mohammad Ruhul Amin
Transformer over Pre-trained Transformer for Neural Text Segmentation with Enhanced Topic Coherence
Kelvin Lo, YUAN JIN, Weicong Tan, Ming Liu, Lan Du and Wray Buntine
Coreference-aware Surprisal Predicts Brain Response
Evan Jaffe, Byung-Doh Oh and William Schuler
SupCL-Seq: Supervised Contrastive Learning for Downstream Optimized Sequence Representations
Hooman Sedghamiz, Shivam Raval, Enrico Santus, Tuka Alhanai and Mohammad Ghassemi
A Computational Exploration of Pejorative Language in Social Media
Liviu P. Dinu, Ioan-Bogdan Iordache, Ana Sabina Uban and Marcos Zampieri
Do UD Trees Match Mention Spans in Coreference Annotations?
Martin Popel, Zdeněk Žabokrtský, Anna Nedoluzhko, Michal Novák and Daniel Zeman
Unsupervised Chunking as Syntactic Structure Induction with a Knowledge-Transfer Approach
Anup Anand Deshmukh, Qianqiu Zhang, Ming Li, Jimmy Lin and Lili Mou
Model-based analysis of brain activity reveals the hierarchy of language in 305 subjects
charlotte caucheteux, Alexandre Gramfort and Jean-Remi King
Adapting Entities across Languages and Cultures
Denis Peskov, Viktor Hangya, Jordan Boyd-Graber and Alexander Fraser
ODIST: Open World Classification via Distributionally Shifted Instances
Lei Shu, Yassine Benajiba, Saab Mansour and Yi Zhang
LAMAD: A Linguistic Attentional Model for Arabic Text Diacritization
Raeed AL-SABRI and Jianliang Gao
Sequence-to-Lattice Models for Fast Translation
Yuntian Deng and Alexander Rush
Towards Realistic Single-Task Continuous Learning Research for NER
Justin Payan, Yuval Merhav, He Xie, Satyapriya Krishna, Anil Ramakrishna, Mukund Sridhar and Rahul Gupta
Towards Automatic Bias Detection in Knowledge Graphs
Daphna Keidar, Mian Zhong, Ce Zhang, Yash Shrestha and Bibek Paudel
Uncovering Implicit Gender Bias in Narratives through Commonsense Inference
Tenghao Huang, Faeze Brahman, Vered Shwartz and Snigdha Chaturvedi
From None to Severe: Predicting Severity in Movie Scripts
Yigeng Zhang, Mahsa Shafaei, Fabio Gonzalez and Thamar Solorio
Weakly Supervised Contrastive Learning for Chest X-Ray Report Generation
An Yan, Zexue He, Xing Lu, Jiang Du, Eric Chang, Amilcare Gentili, Julian McAuley and Chun-Nan Hsu
NUANCED: Natural Utterance Annotation for Nuanced Conversation with Estimated Distributions
Zhiyu Chen, Honglei Liu, Hu Xu, Seungwhan Moon, Hao Zhou and Bing Liu
Multi-task Learning to Enable Location Mention Identification in the Early Hours of a Crisis Event
Sarthak Khanal and Doina Caragea
Graph-Based Decoding for Task Oriented Semantic Parsing
Jeremy R. Cole, Nanjiang Jiang, Panupong Pasupat, Luheng He and Peter Shaw
Expected Validation Performance and Estimation of a Random Variable’s Maximum
Jesse Dodge, Suchin Gururangan, Dallas Card, Roy Schwartz and Noah A. Smith
How May I Help You? Using Neural Text Simplification to Improve Downstream NLP Tasks
Hoang Van, Zheng Tang and Mihai Surdeanu
Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers
Machel Reid, Edison Marrese-Taylor and Yutaka Matsuo
Leveraging Information Bottleneck for Scientific Document Summarization
Jiaxin Ju, Ming Liu, Huan Yee Koh, YUAN JIN, Lan Du and Shirui Pan
Reconsidering the Past: Optimizing Hidden States in Language Models
Davis Yoshida and Kevin Gimpel
Detect and Perturb: Neutral Rewriting of Biased and Sensitive Text via Gradient-based Decoding
Zexue He, Bodhisattwa Prasad Majumder and Julian McAuley
Bag of Tricks for Optimizing Transformer Efficiency
Ye Lin, Yanyang Li, Tong Xiao and Jingbo Zhu
Non-Parametric Unsupervised Domain Adaptation for Neural Machine Translation
Xin Zheng, Zhirui Zhang, Shujian Huang, Boxing Chen, Jun Xie, Weihua Luo and Jiajun CHEN
Reference-based Weak Supervision for Answer Sentence Selection using Web Data
Vivek Krishnamurthy, Thuy Vu and Alessandro Moschitti
Rethinking Zero-shot Neural Machine Translation: From a Perspective of Latent Variables
Weizhi Wang, Zhirui Zhang, Yichao Du, Boxing Chen, Jun Xie and Weihua Luo
Mitigating Data Poisoning in Text Classification with Differential Privacy
Chang Xu, Jun Wang, Francisco Guzmán, Benjamin Rubinstein and Trevor Cohn
Does Vision-and-Language Pretraining Improve Lexical Grounding?
Tian Yun, Chen Sun and Ellie Pavlick
Switch Point biased Self-Training: Re-purposing Pretrained Models for Code-Switching
Parul Chopra, Sai Krishna Rallabandi, Alan W Black and Khyathi Raghavi Chandu
Learning Task Sampling Policy for Multitask Learning
Dhanasekar Sundararaman, Henry Tsai, Kuang-Huei Lee, Iulia Turc and Lawrence Carin
An Exploratory Study on Long Dialogue Summarization: What Works and What’s Next
Yusen Zhang, Ansong Ni, Tao Yu, Rui Zhang, Chenguang Zhu, Budhaditya Deb, Asli Celikyilmaz, Ahmed Hassan Awadallah and Dragomir Radev
Improving Text Auto-Completion with Next Phrase Prediction
Dong-Ho Lee, Zhiqiang Hu and Roy Ka-Wei Lee
KLMo: Knowledge Graph Enhanced Pretrained Language Model with Fine-Grained Relationships
Lei He, Suncong Zheng, Tao Yang and Feng Zhang
Do We Know What We Don’t Know? Studying Unanswerable Questions beyond SQuAD 2.0
Elior Sulem, Jamaal Hay and Dan Roth
Glyph Enhanced Chinese Character Pre-Training for Lexical Sememe Prediction
Boer Lyu, Lu Chen and Kai Yu
Cross-Domain Data Integration for Named Entity Disambiguation in Biomedical Text
Maya Varma, Laurel Orr, Sen Wu, Megan Leszczynski, Xiao Ling and Christopher Ré
Self-Training using Rules of Grammar for Few-Shot NLU
Joonghyuk Hahn, Hyunjoon Cheon, Kyuyeol Han, Cheongjae Lee, Junseok Kim and Yo-Sub Han
ForumSum: A Multi-Speaker Conversation Summarization Dataset
Misha Khalman, Yao Zhao and Mohammad Saleh
QACE: Asking Questions to Evaluate an Image Caption
Hwanhee Lee, Thomas Scialom, Seunghyun Yoon, Franck Dernoncourt and Kyomin Jung
Secoco: Self-Correcting Encoding for Neural Machine Translation
Tao Wang, Chengqi Zhao, Mingxuan Wang, Lei Li, Hang Li and Deyi Xiong
Data-Efficient Language Shaped Few-shot Image Classification
Zhenwen Liang and Xiangliang Zhang
Fight Fire with Fire: Fine-tuning Hate Detectors using Large Samples of Generated Hate Speech
Tomer Wullach, Amir Adler and Einat Minkov
AutoEQA: Auto-Encoding Questions for Extractive Question Answering
Stalin Varanasi, Saadullah Amin and Guenter Neumann
A Multi-label Multi-hop Relation Detection Model based on Relation-aware Sequence Generation
Linhai Zhang, Deyu Zhou, Chao Lin and Yulan He
Don’t Discard All the Biased Instances: Investigating a Core Assumption in Dataset Bias Mitigation Techniques
Hossein Amirkhani and Mohammad Taher Pilehvar
Speculative Sampling in Variational Autoencoders for Dialogue Response Generation
Shoetsu Sato, Naoki Yoshinaga, Masashi Toyoda and Masaru Kitsuregawa
Perceived and Intended Sarcasm Detection with Graph Attention Networks
Joan Plepi and Lucie Flek
Contrastive Representation Learning for Exemplar-Guided Paraphrase Generation
Haoran Yang, Wai Lam and Piji Li
Counter-Contrastive Learning for Language GANs
Yekun Chai, Haidong Zhang, Qiyue Yin and Junge Zhang
MultiFix: Learning to Repair Multiple Errors by Optimal Alignment Learning
HyeonTae Seo, Yo-Sub Han and Sang-Ki Ko
Grammatical Error Correction with Contrastive Learning in Low Error Density Domains
Hannan Cao, Wenmian Yang and Hwee Tou Ng