Main Conference Accepted Papers

Long Papers

A Human Subject Study of Named Entity Recognition in Conversational Music Recommendation Queries
Elena V. Epure and Romain Hennequin

A Hybrid Detection and Generation Framework with Separate Encoders for Event Extraction
Ge Shi, Yunyue Su, Yongliang Ma and Ming Zhou

A Kind Introduction to Lexical and Grammatical Aspect, with a Survey of Computational Approaches
Annemarie Friedrich, Nianwen Xue and Alexis Palmer

A Psycholinguistic Analysis of BERT’s Representations of Compounds
Lars Buijtelaar and Sandro Pezzelle

A Survey of Methods for Addressing Class Imbalance in Deep-Learning Based Natural Language Processing
Sophie Henning, William Beluch, Alexander Fraser and Annemarie Friedrich

A Survey of Multi-task Learning in Natural Language Processing: Regarding Task Relatedness and Training Methods
Zhihan Zhang, Wenhao Yu, Mengxia Yu, Zhichun Guo and Meng Jiang

A Systematic Search for Compound Semantics in Pretrained BERT Architectures
Filip Miletic and Sabine Schulte im Walde

A Two-Sided Discussion of Preregistration of NLP Research
Anders Søgaard, Daniel Hershcovich and Miryam de Lhoneux

A User-Centered, Interactive, Human-in-the-Loop Topic Modelling System
Zheng Fang, Lama Abdulrahman Alqazlan, Du Liu, Yulan He and Rob Procter

A weakly supervised textual entailment approach to zero-shot text classification
Marc Pàmies, Joan Llop, Francesco Multari, Nicolau Duran-Silva, César Parra-Rojas, Aitor Gonzalez-Agirre, Francesco Alessandro Massucci and Marta Villegas

AbLit: A Resource for Analyzing and Generating Abridged Versions of English Literature
Melissa Roemmele, Kyle Shaffer, Katrina R. Olsen, Yiyi Wang and Steve DeNeefe

Adding Instructions during Pretraining: Effective way of Controlling Toxicity in Language Models
Shrimai Prabhumoye, Mostofa Patwary, Mohammad Shoeybi and Bryan Catanzaro

Aggregating Crowdsourced and Automatic Judgments to Scale Up a Corpus of Anaphoric Reference for Fiction and Wikipedia Texts
Juntao Yu, Silviu Paun, Maris Camilleri, Paloma Carretero Garcia, Jon Chamberlain, Udo Kruschwitz and Massimo Poesio

An Empirical Study of Clinical Note Generation from Doctor-Patient Encounters
Asma Ben Abacha, Wen-wai Yim, Yadan Fan and Thomas Lin

An In-depth Analysis of Implicit and Subtle Hate Speech Messages
Nicolas Benjamin Ocampo, Ekaterina Sviridova, Elena Cabrio and Serena Villata

Analyzing Challenges in Neural Machine Translation for Software Localization
Sai Koneru, Matthias Huck, Miriam Exel and Jan Niehues

Assessing Out-of-Domain Language Model Performance from Few Examples
Prasann Singhal, Jarad M. Forristal, Xi Ye and Greg Durrett

Augmenting Pre-trained Language Models with QA-Memory for Open-Domain Question Answering
Wenhu Chen, Pat Verga, Michiel de Jong, John Wieting and William Cohen

Automatic Evaluation and Analysis of Idioms in Neural Machine Translation
Christos Baziotis, Prashant Mathur and Eva Hasler

AutoTriggER: Label-Efficient and Robust Named Entity Recognition with Auxiliary Trigger Extraction
Dong-Ho Lee, Ravi Kiran Selvam, Sheikh muhammad Sarwar, Bill Yuchen Lin, Fred Morstatter, Jay Pujara, Elizabeth Boschee, James Allan and Xiang Ren

BERT Is Not The Count: Learning to Match Mathematical Statements with Proofs
Weixian Waylon Li, Yftah Ziser, Maximin Coavoux and Shay B. Cohen

BERT Shows Garden Path Effects
Tovah Irwin, Kyra Wilson and Alec Marantz

BLM-AgrF: A New French Benchmark to Investigate Generalization of Agreement in Neural Networks
Aixiu An, Chunyang Jiang, Maria A. Rodriguez, Vivi Nastase and Paola Merlo

Bootstrapping Multilingual Semantic Parsers using Large Language Models
Abhijeet Awasthi, Nitish Gupta, Bidisha Samanta, Shachi Dave, Sunita Sarawagi and Partha Talukdar

Bridging the Gap Between BabelNet and HowNet: Unsupervised Sense Alignment and Sememe Prediction
Xiang Zhang, Ning Shi, Bradley M. Hauer and Grzegorz Kondrak

Can Pretrained Language Models (Yet) Reason Deductively?
Zhangdie Yuan, Songbo Hu, Ivan Vulić, Anna Korhonen and Zaiqiao Meng

Can Synthetic Text Help Clinical Named Entity Recognition? A Study of Electronic Health Records in French
Nicolas Hiebel, Olivier Ferret, Karen Fort and Aurélie Névéol

Characterizing the Entities in Harmful Memes: Who is the Hero, the Villain, the Victim?
Shivam Sharma, Atharva Kulkarni, Tharun Suresh, Himanshi Mathur, Preslav Nakov, Md. Shad Akhtar and Tanmoy Chakraborty

CHARD: Clinical Health-Aware Reasoning Across Dimensions for Text Generation Models
Steven Y. Feng, Vivek Khetan, Bogdan Sacaleanu, Anatole Gershman and Eduard H. Hovy

CLICK: Contrastive Learning for Injecting Contextual Knowledge to Conversational Recommender System
Hyeongjun Yang, Heesoo Won, Youbin Ahn and Kyong-Ho Lee

Closed-book Question Generation via Contrastive Learning
Xiangjue Dong, Jiaying Lu, Jianling Wang and James Caverlee

Cluster-Guided Label Generation in Extreme Multi-Label Classification
Taehee Jung, Joo-Kyung Kim, Sungjin Lee and Dongyeop Kang

Combining Parameter-efficient Modules for Task-level Generalisation
Edoardo Maria Ponti, Alessandro Sordoni, Yoshua Bengio and Siva Reddy

COMBO: A Complete Benchmark for Open KG Canonicalization
Chengyue Jiang, Yong Jiang, Weiqi Wu, Yuting Zheng, Pengjun Xie and Kewei Tu

Compositional Generalisation with Structured Reordering and Fertility Layers
Matthias Lindemann, Alexander Koller and Ivan Titov

COMPS: Conceptual Minimal Pair Sentences for testing Robust Property Knowledge and its Inheritance in Pre-trained Language Models
Kanishka Misra, Julia Rayz and Allyson Ettinger

ComSearch: Equation Searching with Combinatorial Strategy for Solving Math Word Problems with Weak Supervision
Qianying Liu, Wenyu Guan, Jianhao Shen, Fei Cheng and Sadao Kurohashi

Concept-based Persona Expansion for Improving Diversity of Persona-Grounded Dialogue
Donghyun Kim, Youbin Ahn, Chanhee Lee, Wongyu Kim, Kyong-Ho Lee, DongHoon Shin and Yeonsoo Lee

Conclusion-based Counter-Argument Generation
Milad Alshomary and Henning Wachsmuth

ConEntail: An Entailment-based Framework for Universal Zero and Few Shot Classification with Supervised Contrastive Pretraining
Haoran Zhang, Aysa Xuemo Fan and Rui Zhang

Contextual Semantic Parsing for Multilingual Task-Oriented Dialogues
Mehrad Moradshahi, Victoria C. Tsai, Giovanni Campagna and Monica S. Lam

Contrastive Learning with Keyword-based Data Augmentation for Code Search and Code Question Answering
Shinwoo Park, Youngwook Kim and Yo-Sub Han

Conversational Emotion-Cause Pair Extraction with Guided Mixture of Experts
DongJin Jeong and JinYeong Bak

Conversational Tree Search: A New Hybrid Dialog Task
Dirk Väth, Lindsey Vanderlyn and Ngoc Thang Vu

Counter-GAP: Counterfactual Bias Evaluation through Gendered Ambiguous Pronouns
Zhongbin Xie, Vid Kocijan, Thomas Lukasiewicz and Oana-Maria Camburu

COVID-VTS: Fact Extraction and Verification on Short Video Platforms
Fuxiao Liu, Yaser Yacoob and ABHINAV SHRIVASTAVA

Creation and evaluation of timelines for longitudinal user posts
Anthony R. Hills, Adam Tsakalidis, Federico Nanni, Ioannis Zachos and Maria Liakata

CTC Alignments Improve Autoregressive Translation
Brian Yan, Siddharth Dalmia, Yosuke Higuchi, Graham Neubig, Florian Metze, Alan W Black and Shinji Watanabe

CylE: Cylinder Embeddings for Multi-hop Reasoning over Knowledge Graphs
Chau Duc Minh Nguyen, Tim N. French, Wei Liu and Michael Stewart

DeepMaven: Deep Question Answering on Long-Distance Movie/TV Show Videos with Multimedia Knowledge Extraction and Synthesis
Yi Fung, Han Wang, Tong Wang, Ali Kebarighotbi, Mohit Bansal, Heng Ji and Prem Natarajan

DiffQG: Generating Questions to Summarize Factual Changes
Jeremy R. Cole, Palak Jain, Julian Martin Eisenschlos, Michael J.Q. Zhang, Eunsol Choi and Bhuwan Dhingra

DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence
Wei Zhao, Michael Strube and Steffen Eger

DiTTO: A Feature Representation Imitation Approach for Improving Cross-Lingual Transfer
Shanu Kumar, Soujanya Abbaraju, Sandipan Dandapat, Sunayana Sitaram and Monojit Choudhury

Do dialogue representations align with perception? An empirical study
Sarenne Carrol Wallbridge, Peter Bell and Catherine Lai

Do we need Label Regularization to Fine-tune Pre-trained Language Models?
Ivan Kobyzev, Aref Jafari, Mehdi Rezagholizadeh, Tianda Li, Alan Do-Omri, Peng Lu, Pascal Poupart and Ali Ghodsi

Document Flattening: Beyond Concatenating Context for Document-Level Neural Machine Translation
Minghao Wu, George Foster, Lizhen Qu and Gholamreza Haffari

Document-Level Planning for Text Simplification
Liam Cripwell, Joël Legrand and Claire Gardent

Don’t Mess with Mister-in-Between: Improved Negative Search for Knowledge Graph Completion
Fan Jiang, Tom Drummond and Trevor Cohn

DREEAM: Guiding Attention with Evidence for Improving Document-Level Relation Extraction
Youmi Ma, An Wang and Naoaki Okazaki

DyLoRA: Parameter-Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation
Mojtaba Valipour, Mehdi Rezagholizadeh, Ivan Kobyzev and Ali Ghodsi

Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views
Katerina Margatina, Shuai Wang, Yogarshi Vyas, Neha Anna John, Yassine Benajiba and Miguel Ballesteros

Efficient CTC Regularization via Coarse Labels for End-to-End Speech Translation
Biao Zhang, Barry Haddow and Rico Sennrich

Efficient Encoders for Streaming Sequence Tagging
Ayush Kaushal, Aditya Gupta, Shyam Upadhyay and Manaal Faruqui

Efficient Hybrid Generation Framework for Aspect-Based Sentiment Analysis
Haoran Lv, Junyi Liu, Henan Wang, Yaoming Wang, Jixiang Luo and Yaxiao Liu

Efficiently Upgrading Multilingual Machine Translation Models to Support More Languages
Simeng Sun, Maha Elbayad, Anna Sun and James Cross

End-to-end Case-Based Reasoning for Commonsense Knowledge Base Completion
Zonglin Yang, Xinya Du, Erik Cambria and Claire Cardie

Enhancing Dialogue Summarization with Topic-Aware Global- and Local- Level Centrality
Xinnian Liang, Shuangzhi Wu, Chenhao Cui, Jiaqi Bai, Chao Bian and Zhoujun Li

Enhancing Multi-Document Summarization with Cross-Document Graph-based Information Extraction
Zixuan Zhang, Heba Elfardy, Markus Dreyer, Kevin Small, Heng Ji and Mohit Bansal

Enriching Biomedical Knowledge for Low-resource Language Through Large-scale Translation
Long Phan, Tai Dang, Hieu Tran, Trieu H. Trinh, Vy Phan, Lam Duc Chau and Minh-Thang Luong

Evaluating and Improving the Coreference Capabilities of Machine Translation Models
Asaf Yehudai, Arie Cattan, Omri Abend and Gabriel Stanovsky

Evaluating the Robustness of Discrete Prompts
Yoichi Ishibashi, Danushka Bollegala, Katsuhito Sudoh and Satoshi Nakamura

Event Linking: Grounding Event Mentions to Wikipedia
Xiaodong Yu, Wenpeng Yin, Nitish Gupta and Dan Roth

Event Temporal Relation Extraction with Bayesian Translational Model
Xingwei Tan, Gabriele Pergola and Yulan He

Exploiting Summarization Data to Help Text Simplification
Renliang Sun, Zhixian Yang and Xiaojun Wan

Exploring Category Structure with Contextual Language Models and Lexical Semantic Networks
Joseph Renner, Pascal Denis, Remi Gilleron and Ang�le Brunelli�re

Exploring Segmentation Approaches for Neural Machine Translation of Code-Switched Egyptian Arabic-English Text
Marwa Gaser, Manuel Mager, Injy Hamed, Nizar Habash, Slim Abdennadher and Ngoc Thang Vu

External Knowledge Acquisition for End-to-End Document-Oriented Dialog Systems
Tuan M. Lai, Giuseppe Castellucci, Saar Kuzi, Heng Ji and Oleg Rokhlenko

Extracting or Guessing? Improving Faithfulness of Event Temporal Relation Extraction
Haoyu Wang, Hongming Zhang, Yuqian Deng, Jacob Gardner, Dan Roth and Muhao Chen

Extracting Victim Counts from Text
Mian Zhong, Shehzaad Dhuliawala and Niklas Stoehr

Fair Enough: Standardizing Evaluation and Model Selection for Fairness Research in NLP
Xudong Han, Timothy Baldwin and Trevor Cohn

Faithfulness-Aware Decoding Strategies for Abstractive Summarization
David Wan, Mengwen Liu, Kathleen McKeown, Markus Dreyer and Mohit Bansal

FastKASSIM: A Fast Tree Kernel-Based Syntactic Similarity Metric
Maximillian Chen, Caitlyn K. Chen, Xiao Yu and Zhou Yu

Fiction-Writing Mode: An Effective Control for Human-Machine Collaborative Writing
Wenjie Zhong, Jason Naradowsky, Hiroya Takamura, Ichiro Kobayashi and Yusuke Miyao

Find Parent then Label Children: A Two-stage Taxonomy Completion Method with Pre-trained Language Model
Fei Xia, Yixuan Weng, Shizhu He, Kang Liu and Jun Zhao

Finding the Law: Enhancing Statutory Article Retrieval via Graph Neural Networks
Antoine Louis, Gijs van Dijck and Gerasimos Spanakis

Friend-training: Learning from Models of Different but Related Tasks
Mian Zhang, Lifeng Jin, Linfeng Song, Haitao Mi, Xiabing Zhou and Dong Yu

Generation-Based Data Augmentation for Offensive Language Detection: Is It Worth It?
Camilla Casula and Sara Tonelli

Generative Replay Inspired by Hippocampal Memory Indexing for Continual Language Learning
Aru Maekawa, Hidetaka Kamigaito, Kotaro Funakoshi and Manabu Okumura

GLADIS: A General and Large Acronym Disambiguation Benchmark
Lihu Chen, Gael Varoquaux and Fabian Suchanek

Gold Doesn’t Always Glitter: Spectral Removal of Linear and Nonlinear Guarded Attribute Information
Shun Shao, Yftah Ziser and Shay B. Cohen

GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models
Archiki Prasad, Peter Hase, Xiang Zhou and Mohit Bansal

How Far Can It Go? On Intrinsic Gender Bias Mitigation for Text Classification
Ewoenam Kwaku Tokpo, Pieter Delobelle, Bettina Berendt and Toon Calders

How people talk about each other: Modeling Generalized Intergroup Bias and Emotion
Venkata Subrahmanyan Govindarajan, Katherine Atwell, Barea Sinno, Malihe Alikhani, David I. Beaver and Junyi Jessy Li

Identifying the limits of transformers when performing model-checking with natural language
Tharindu Madusanka, Riza Batista-Navarro and Ian Pratt-Hartmann

Improving Cross-modal Alignment for Text-Guided Image Inpainting
Yucheng Zhou and Guodong Long

Improving the Generalizability of Collaborative Dialogue Analysis With Multi-Feature Embeddings
Ayesha Enayet and Gita Sukthankar

Improving Visual-Semantic Embedding with Adaptive Pooling and Optimization Objective
Zijian Zhang, Chang Shu, Ya Xiao, Yuan Shen, Di Zhu, Youxin Chen, Jing Xiao, Jey Han Lau, Qian Zhang and Zheng Lu

In-Depth Look at Word Filling Societal Bias Measures
Matúš Pikuliak, Ivana Beňová and Viktor Bachratý

Incorporating Context into Subword Vocabularies
Shaked Yehezkel and Yuval Pinter

Incorporating Question Answering-Based Signals into Abstractive Summarization via Salient Span Selection
Daniel Deutsch and Dan Roth

Incorporating Task-Specific Concept Knowledge into Script Learning
Chenkai Sun, Tie Xu, ChengXiang Zhai and Heng Ji

Instruction Clarification Requests in Multimodal Collaborative Dialogue Games: Tasks, and an Analysis of the CoDraw Dataset
Brielen Madureira and David Schlangen

Integrating Translation Memories into Non-Autoregressive Machine Translation
Jitao Xu, Josep Crego and François Yvon

Investigating Multi-source Active Learning for Natural Language Inference
Ard Snijders, Douwe Kiela and Katerina Margatina

Investigating UD Treebanks via Dataset Difficulty Measures
Artur Kulmizev and Joakim Nivre

Iterative Document-level Information Extraction via Imitation Learning
Yunmo Chen, William Gantt, Weiwei Gu, Tongfei Chen, Aaron Steven White and Benjamin Van Durme

K-hop neighbourhood regularization for few-shot learning on graphs: A case study of text classification
Niels van der Heijden, Ekaterina Shutova and Helen Yannakoudakis

KGVL-BART: Knowledge Graph Augmented Visual Language BART for Radiology Report Generation
Kaveri Kale, Pushpak Bhattacharyya, Milind Gune, Aditya Shetty and Rustom Lawyer

Know your audience: specializing grounded language models with listener subtraction
Aaditya K. Singh, David Ding, Andrew M. Saxe, Felix Hill and Andrew Kyle Lampinen

Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey
Sachin Kumar, Vidhisha Balachandran, Lucille Njoo, Antonios Anastasopoulos and Yulia Tsvetkov

Large Scale Multi-Lingual Multi-Modal Summarization Dataset
Yash Verma, Anubhav Jangra, Raghvendra Verma and Sriparna Saha

Learning the Legibility of Visual Text Perturbations
Dev Seth, Rickard Stureborg, Danish Pruthi and Bhuwan Dhingra

Learning to Ignore Adversarial Attacks
Yiming Zhang, Yangqiaoyu Zhou, Samuel Carton and Chenhao Tan

Lessons Learned from a Citizen Science Project for Natural Language Processing
Jan-Christoph Klie, Ji-Ung Lee, Kevin Stowe, Gözde Gül Şahin, Nafise Sadat Moosavi, Luke Bates, Dominic Petrak, Richard Eckart de Castilho and Iryna Gurevych

Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence Reasoning
Hongyin Luo and James Glass

LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization
Kalpesh Krishna, Erin Bransom, Bailey E. Kuehl, Mohit Iyyer, Pradeep Dasigi, Arman Cohan and Kyle Lo

Looking for a Needle in a Haystack: A Comprehensive Study of Hallucinations in Neural Machine Translation
Nuno M. Guerreiro, Elena Voita and André Martins

LoRaLay: A Multilingual and Multimodal Dataset for Long Range and Layout-Aware Summarization
Laura Kim-Anh Nguyen, Thomas Scialom, Benjamin Piwowarski and Jacopo Staiano

Low-Resource Compositional Semantic Parsing with Concept Pretraining
Subendhu Rongali, Mukund Sridhar, Haidar Khan, Konstantine Arkoudas, Wael Hamza and Andrew McCallum

Made of Steel? Learning Plausible Materials for Components in the Vehicle Repair Domain
Annerose Eichel, Helena Schlipf and Sabine Schulte im Walde

MAPL: Parameter-Efficient Adaptation of Unimodal Pre-Trained Models for Vision-Language Few-Shot Prompting
Oscar Mañas, Pau Rodriguez Lopez, Saba Ahmadi, Aida Nematzadeh, Yash Goyal and Aishwarya Agrawal

Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Abteen Ebrahimi, Arya D. McCarthy, Arturo Oncevay, John E. Ortega, Luis Chiruzzo, Gustavo A. Giménez-Lugo, Rolando Coto-Solano and Katharina Kann

Memory-efficient Temporal Moment Localization in Long Videos
Cristian Rodriguez, Edison Marrese-Taylor, Basura Fernando, Hiroya Takamura and Qi Wu

Meta Self-Refinement for Robust Learning with Weak Supervision
Dawei Zhu, Xiaoyu Shen, Michael A. Hedderich and Dietrich Klakow

MetaQA: Combining Expert Agents for Multi-Skill Question Answering
Haritz Puerto, Gözde Gül Şahin and Iryna Gurevych

Methods for Measuring, Updating, and Visualizing Factual Beliefs in Language Models
Peter Hase, Mona Diab, Asli Celikyilmaz, Xian Li, Zornitsa Kozareva, Veselin Stoyanov, Mohit Bansal and Srinivasan Iyer

Mind the Labels: Describing Relations in Knowledge Graphs With Pretrained Models
Zdeněk Kasner, Ioannis Konstas and Ondrej Dusek

MiniALBERT: Model Distillation via Parameter-Efficient Recursive Transformers
Mohammadmahdi Nouriborji, Omid Rohanian, Samaneh Kouchaki and David A. Clifton

Mitigating Exposure Bias in Grammatical Error Correction with Data Augmentation and Reweighting
Hannan Cao, Wenmian Yang and Hwee Tou Ng

Modeling Complex Event Scenarios via Simple Entity-focused Questions
Mahnaz Koupaee, Greg Durrett, Nathanael Chambers and Niranjan Balasubramanian

Modelling Temporal Document Sequences for Clinical ICD Coding
Boon Liang Clarence Ng, Diogo Santos and Marek Rei

Models Teaching Models: Improving Model Accuracy with Slingshot Learning
Lachlan S. O’Neill, Nandini Anantharama, Satya Borgohain and Simon D. Angus

MTEB: Massive Text Embedding Benchmark
Niklas Muennighoff, Nouamane Tazi, Loic Magne and Nils Reimers

Multi-Modal Bias: Introducing a Framework for Stereotypical Bias Assessment beyond Gender and Race in Vision–Language Models
Sepehr Janghorbani and Gerard de Melo

Multi2Claim: Generating Scientific Claims from Multi-Choice Questions for Scientific Fact-Checking
Neset Ozkan TAN, Trung Nguyen, Josh Bensemann, Alex Peng, Qiming Bao, Yang Chen, Mark Gahegan and Michael Witbrock

Multilingual Content Moderation: A Case Study on Reddit
Meng Ye, Karan Sikka, Katherine Atwell, Sabit Hassan, Ajay Divakaran and Malihe Alikhani

Multilingual Representation Distillation with Contrastive Learning
Weiting Tan, Kevin Heffernan, Holger Schwenk and Philipp Koehn

Multimodal Event Transformer for Image-guided Story Ending Generation
Yucheng Zhou and Guodong Long

Multimodal Graph Transformer for Multimodal Question Answering
Xuehai He and Xin Eric Wang

NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages
Genta Indra Winata, Alham Fikri Aji, Samuel Cahyawijaya, Rahmad Mahendra, Fajri Koto, Ade Romadhony, Kemal Maulana Kurniawan, David Moeljadi, Radityo Eko Prasojo, Pascale Fung, Timothy Baldwin, Jey Han Lau, Rico Sennrich and Sebastian Ruder

On Evaluation of Document Classifiers using RVL-CDIP
Stefan Larson, Gordon Lim and Kevin Leach

On Robustness of Prompt-based Semantic Parsing with Large Pre-trained Language Model: An Empirical Study on Codex
Terry Yue Zhuo, Zhuang Li, Yujin Huang, Fatemeh Shiri, Weiqing Wang, Gholamreza Haffari and Yuan-Fang Li

Opportunities and Challenges in Neural Dialog Tutoring
Jakub Macina, Nico Daheim, Lingzhi Wang, Tanmay Sinha, Manu Kapur, Iryna Gurevych and Mrinmaya Sachan

PANCETTA: Phoneme Aware Neural Completion to Elicit Tongue Twisters Automatically
Sedrick Scott Keh, Steven Y. Feng, Varun Gangal, Malihe Alikhani and Eduard H. Hovy

Parameter-efficient Modularised Bias Mitigation via AdapterFusion
Deepak Kumar, Oleg Lesota, George Zerveas, Daniel Cohen, Carsten Eickhoff, Markus Schedl and Navid Rekabsaz

Paraphrase Acquisition from Image Captions
Marcel Gohsen, Matthias Hagen, Martin Potthast and Benno Stein

Path Spuriousness-aware Reinforcement Learning for Multi-Hop Knowledge Graph Reasoning
Chunyang Jiang, Tianchen Zhu, Haoyi Zhou, Chang Liu, Ting Deng, Chunming Hu and Jianxin Li

Patient Outcome and Zero-shot Diagnosis Prediction with Hypernetwork-guided Multitask Learning
Shaoxiong Ji and Pekka Marttinen

PCC: Paraphrasing with Bottom-k Sampling and Cyclic Learning for Curriculum Data Augmentation
Hongyuan Lu and Wai Lam

PECO: Examining Single Sentence Label Leakage in Natural Language Inference Datasets through Progressive Evaluation of Cluster Outliers
Michael S. Saxon, Xinyi Wang, Wenda Xu and William Yang Wang

Penguins Don’t Fly: Reasoning about Generics through Instantiations and Exceptions
Emily Allaway, Jena D. Hwang, Chandra Bhagavatula, Kathleen McKeown, Doug Downey and Yejin Choi

Pento-DIARef: A Diagnostic Dataset for Learning the Incremental Algorithm for Referring Expression Generation from Examples
Philipp Sadler and David Schlangen

Performance Prediction via Bayesian Matrix Factorisation for Multilingual Natural Language Processing Tasks
Viktoria Schram, Daniel Beck and Trevor Cohn

Persona Expansion with Commonsense Knowledge for Diverse and Consistent Response Generation
Donghyun Kim, Youbin Ahn, Wongyu Kim, Chanhee Lee, KyungChan Lee, Kyong-Ho Lee, jeonguk kim, DongHoon Shin and Yeonsoo Lee

PiC: A Phrase-in-Context Dataset for Phrase Understanding and Semantic Search
Thang Minh Pham, Seunghyun Yoon, Trung Bui and Anh Nguyen

Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training
Wenliang Dai, Zihan Liu, Ziwei Ji, Dan Su and Pascale Fung

Policy-based Reinforcement Learning for Generalisation in Interactive Text-based Environments
Edan Toledo, Jan Buys and Jonathan Shock

Poor Man’s Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
Vilém Zouhar, Shehzaad Dhuliawala, Wangchunshu Zhou, Nico Daheim, Tom Kocmi, Yuchen Eleanor Jiang and Mrinmaya Sachan

Probabilistic Robustness for Data Filtering
Yu Yu, Abdul Rafae Khan, Shahram Khadivi and Jia Xu

Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence Encoders
Ivan Vulić, Goran Glavaš, Fangyu Liu, Nigel Collier, Edoardo Maria Ponti and Anna Korhonen

Probing Power by Prompting: Harnessing Pre-trained Language Models for Power Connotation Framing
Shima Khanehzar, Trevor Cohn, Gosia Mikolajczak and Lea Frermann

Prompt Tuning with Contradictory Intentions for Sarcasm Recognition
Yiyi Liu, Ruqing Zhang, Yixing Fan, Jiafeng Guo and Xueqi Cheng

PromptDA: Label-guided Data Augmentation for Prompt-based Few Shot Learners
Canyu Chen and Kai Shu

Quantifying Context Mixing in Transformers
Hosein Mohebbi, Willem Zuidema, Grzegorz Chrupała and Afra Alishahi

Question Generation Using Sequence-to-Sequence Model with Semantic Role Labels
Alireza Naeiji, Aijun An, Heidar Davoudi, Marjan Delpisheh and Muath Alzghool

Question-Answer Sentence Graph for Joint Modeling Answer Selection
Roshni G. Iyer, Thuy Vu, Alessandro Moschitti and Yizhou Sun

Real-Time Visual Feedback to Guide Benchmark Creation: A Human-and-Metric-in-the-Loop Workflow
Anjana Arunkumar, Swaroop Mishra, Bhavdeep Singh Sachdeva, Chitta Baral and Chris Bryan

Realistic Conversational Question Answering with Answer Selection based on Calibrated Confidence and Uncertainty Measurement
Soyeong Jeong, Jinheon Baek, Sung Ju Hwang and Jong Park

Reinforced Sequence Training based Subjective Bias Correction
Karthic Madanagopal and James Caverlee

Representation biases in sentence transformers
Dmitry Nikolaev and Sebastian Padó

Retrieval-augmented Image Captioning
Rita Parada Ramos, Desmond Elliott and Bruno Martins

Retrieve-and-Fill for Scenario-based Task-Oriented Semantic Parsing
Akshat Shrivastava, Shrey Desai, Anchit Gupta, Ali Elkahky, Aleksandr Livshits, Alexander Zotov and Ahmed Aly

RevUp: Revise and Update Information Bottleneck for Event Representation
Mehdi Rezaee and Francis Ferraro

Robustification of Multilingual Language Models to Real-world Noise in Crosslingual Zero-shot Settings with Robust Contrastive Pretraining
Asa Cooper Stickland, Sailik Sengupta, Jason Krone, Saab Mansour and He He

Robustness Challenges in Model Distillation and Pruning for Natural Language Understanding
Mengnan Du, Subhabrata Mukherjee, Yu Cheng, Milad Shokouhi, Xia Hu and Ahmed Hassan Awadallah

RPTCS: A Reinforced Persona-aware Topic-guiding Conversational System
Zishan Ahmad, Kshitij Mishra, Asif Ekbal and Pushpak Bhattacharyya

Scaling Back-Translation with Domain Text Generation for Sign Language Gloss Translation
Jinhui Ye, Wenxiang Jiao, Xing Wang and Zhaopeng Tu

Selective In-Context Data Augmentation for Intent Detection using Pointwise V-Information
Yen Ting Lin, Alexandros Papangelis, Seokhwan Kim, Sungjin Lee, Devamanyu Hazarika, Mahdi Namazifar, Di Jin, Yang Liu and Dilek Hakkani-Tur

Self-Adapted Utterance Selection for Suicidal Ideation Detection in Lifeline Conversations
Zhong-Ling Wang, Po-Hsien Huang, Wen-Yau Hsu and Hen-Hsen Huang

Self-Adaptive Named Entity Recognition by Retrieving Unstructured Knowledge
Kosuke Nishida, Naoki Yoshinaga and Kyosuke Nishida

Self-imitation Learning for Action Generation in Text-based Games
Zijing Shi, Yunqiu Xu, Meng Fang and Ling Chen

Self-training Reduces Flicker in Retranslation-based Simultaneous Translation
Sukanta Sen, Rico Sennrich, Biao Zhang and Barry Haddow

Semantic Frame Induction with Deep Metric Learning
Kosuke Yamada, Ryohei Sasano and Koichi Takeda

Semantic Parsing for Conversational Question Answering over Knowledge Graphs
Laura Perez-Beltrachini, Parag Jain, Emilio Monti and Mirella Lapata

Semantic Specialization for Knowledge-based Word Sense Disambiguation
Sakae Mizuki and Naoaki Okazaki

Semi-supervised New Event Type Induction and Description via Contrastive Loss-Enforced Batch Attention
Carl Edwards and Heng Ji

Semi-supervised Relation Extraction via Data Augmentation and Consistency-training
Komal Teru

Sentiment as an Ordinal Latent Variable
Niklas Stoehr, Ryan Cotterell and Aaron Schein

Shapley Head Pruning: Identifying and Removing Interference in Multilingual Transformers
William Held and Diyi Yang

Shironaam: Bengali News Headline Generation using Auxiliary Information
Abu Ubaida Akash, Mir Tafseer Nayeem, Faisal Tareque Shohan and Tanvir Islam

Shortcomings of Question Answering Based Factuality Frameworks for Error Localization
Ryo Kamoi, Tanya Goyal and Greg Durrett

Shorten the Long Tail for Rare Entity and Event Extraction
Pengfei Yu and Heng Ji

Should You Mask 15% in Masked Language Modeling?
Alexander Wettig, Tianyu Gao, Zexuan Zhong and Danqi Chen

Social Commonsense for Explanation and Cultural Bias Discovery
Lisa Bauer, Hanna Leth Tischer and Mohit Bansal

Social Influence Dialogue Systems: A Survey of Datasets and Models For Social Influence Tasks
Kushal Chawla, Weiyan Shi, Jingwen Zhang, Gale Lucas, Zhou Yu and Jonathan Gratch

Socratic Question Generation: A Novel Dataset, Models, and Evaluation
Beng Heng Ang, Sujatha Das Gollapalli and See-Kiong Ng

SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models
Haozhe An, Zongxia Li, Jieyu Zhao and Rachel Rudinger

Span-based Named Entity Recognition by Generating and Compressing Information
Nhung Nguyen, Makoto Miwa and Sophia Ananiadou

StyLEx: Explaining Style Using Human Lexical Annotations
Shirley Anugrah Hayati, Kyumin Park, Dheeraj Rajagopal, Lyle Ungar and Dongyeop Kang

Summarize and Generate to Back-translate: Unsupervised Translation of Programming Languages
Wasi Uddin Ahmad, Saikat Chakraborty, Baishakhi Ray and Kai-Wei Chang

Synthesizing Human Gaze Feedback for Improved NLP Performance
Varun Khurana, Yaman Kumar, Nora Hollenstein, Rajesh Kumar and Balaji Krishnamurthy

Task and Sentiment Adaptation for Appraisal Tagging
Lin Tian, Xiuzhen Zhang, Myung Hee Kim and Jennifer Biggs

Teacher Intervention: Improving Convergence of Quantization Aware Training for Ultra-Low Precision Transformers
Minsoo Kim, Kyuhong Shim, Seongmin Park, Wonyong Sung and Jungwook Choi

The Devil is in the Details: On Models and Training Regimes for Few-Shot Intent Classification
Mohsen Mesgar, Thy Thy Tran, Goran Glavaš and Iryna Gurevych

The Impacts of Unanswerable Questions on the Robustness of Machine Reading Comprehension Models
Son Q. Tran, Phong Nguyen-Thuan Do, Uyen Phuong Le and Matt Kretchmar

The NLP Task Effectiveness of Long-Range Transformers
Guanghui Qin, Yukun Feng and Benjamin Van Durme

The StatCan Dialogue Dataset: Retrieving Data Tables through Conversations with Genuine Intents
Xing Han Lu, Siva Reddy and Harm de Vries

Towards Integration of Discriminability and Robustness for Document-Level Relation Extraction
Jia Guo, Stanley Kok and Lidong Bing

TraVLR: Now You See It, Now You Don’t! A Bimodal Dataset for Evaluating Visio-Linguistic Reasoning
Keng Ji Chow, Samson Tan and Min-Yen Kan

Triple-Hybrid Energy-based Model Makes Better Calibrated Natural Language Understanding Models
haotian xu and Yingying Zhang

TwiRGCN: Temporally Weighted Graph Convolution for Question Answering over Temporal Knowledge Graphs
Aditya Sharma, Apoorv Saxena, Chitrank Gupta, Seyed Mehran Kazemi, Partha Talukdar and Soumen Chakrabarti

UDAPTER - Efficient Domain Adaptation Using Adapters
Bhavitvya Malik, Abhinav Ramesh Kashyap, Min-Yen Kan and Soujanya Poria

Uncovering Implicit Inferences for Improved Relational Argument Mining
Ameer Hassan Saadat-Yazdi, Jeff Z. Pan and Nadin Kokciyan

Understanding Transformer Memorization Recall Through Idioms
Adi Haviv, Ido Cohen, Jacob Gidron, Roei Schuster, Yoav Goldberg and Mor Geva

UnifEE: Unified Evidence Extraction for Fact Verification
Nan Hu, Zirui Wu, Yuxuan Lai, Chen Zhang and Yansong Feng

Unified Neural Topic Model via Contrastive Learning and Term Weighting
Sungwon Han, Mingi Shin, Sungkyu Park, Changwook Jung and Meeyoung Cha

Unsupervised Anomaly Detection in Multi-Topic Short-Text Corpora
Mira Ait-Saada and Mohamed Nadif

UScore: An Effective Approach to Fully Unsupervised Evaluation Metrics for Machine Translation
Jonas Belouadi and Steffen Eger

ViHOS: Hate Speech Spans Detection for Vietnamese
Phu Gia Hoang, Canh Duc Luu, Khanh Quoc Tran, Kiet Van Nguyen and Ngan Nguyen

Vote’n’Rank: Revision of Benchmarking with Social Choice Theory
Mark Rofin, Vladislav Mikhailov, Mikhail Florinsky, Andrey Kravchenko, Tatiana Shavrina, Elena Tutubalina, Daniel Karabekyan and Ekaterina Artemova

Weakly-Supervised Questions for Zero-Shot Relation Extraction
Saeed Najafi and Alona Fyshe

What Clued the AI Doctor In? On the Influence of Data Source and Quality for Transformer-Based Medical Self-Disclosure Detection
Mina Valizadeh, Xing XQ Qian, Pardis Ranjbar-Noiey, Cornelia Caragea and Natalie Parde

What Did You Learn To Hate? A Topic-Oriented Analysis of Generalization in Hate Speech Detection
Tom Bourgeade, Patricia Chiril, Farah Benamara and Véronique MORICEAU

What happens before and after: Multi-Event Commonsense in Event Coreference Resolution
Sahithya Ravi, Chris Tanner, Raymond Ng and Vered Shwartz

What Makes Sentences Semantically Related? A Textual Relatedness Dataset and Empirical Study
Mohamed Abdalla, Krishnapriya Vishnubhotla and Saif M. Mohammad

What’s New? Summarizing Contributions in Scientific Literature
Hiroaki Hayashi, Wojciech Kryscinski, Bryan McCann, Nazneen Rajani and Caiming Xiong

When Do Pre-Training Biases Propagate to Downstream Tasks? A Case Study in Text Summarization
Faisal Ladhak, Esin Durmus, Mirac Suzgun, Tianyi Zhang, Dan Jurafsky, Kathleen McKeown and Tatsunori Hashimoto

Why Can’t Discourse Parsing Generalize? A Thorough Investigation of the Impact of Data Diversity
Yang Janet Liu and Amir Zeldes

Why Don’t You Do It Right? Analysing Annotators’ Disagreement in Subjective Tasks
Marta Sandri, Elisa Leonardelli, Sara Tonelli and Elisabetta Jezek

ZELDA: A Comprehensive Benchmark for Supervised Entity Disambiguation
Marcel Milich and Alan Akbik

Zero and Few-Shot Localization of Task-Oriented Dialogue Agents with a Distilled Representation
Mehrad Moradshahi, Sina Semnani and Monica S. Lam

Short Papers

“John is 50 years old, can his son be 65?” Evaluating NLP Models’ Understanding of Feasibility
Himanshu Gupta, Neeraj Varshney, Swaroop Mishra, Kuntal Kumar Pal, Saurabh Arjun Sawant, Kevin Joseph Scaria, Siddharth Goyal and Chitta Baral

A Discerning Several Thousand Judgments: GPT-3 Rates the Article + Adjective + Numeral + Noun Construction
Kyle Mahowald

A Federated Approach for Hate Speech Detection
Jay Gala, Deep Rajesh Gandhi, Jash Jayesh Mehta and Zeerak Talat

A simple but effective model for attachment in discourse parsing with multi-task learning for relation labeling
Zineb Bennis, Julie Hunter and Nicholas Asher

Assistive Recipe Editing through Critiquing
Diego Antognini, Shuyang Li, Boi Faltings and Julian McAuley

Behavior Cloned Transformers are Neurosymbolic Reasoners
Ruoyao Wang, Peter Jansen, Marc-Alexandre Cote and Prithviraj Ammanabrolu

Comparing Intrinsic Gender Bias Evaluation Measures without using Human Annotated Examples
Masahiro Kaneko, Danushka Bollegala and Naoaki Okazaki

Contextual Dynamic Prompting for Response Generation in Task-oriented Dialog Systems
Sandesh Swamy, Narges Tabari, Chacha Chen and Rashmi Gangadharaiah

Detecting Lexical Borrowings from Dominant Languages in Multilingual Wordlists
John Edward Miller and Johann-Mattis List

Do Deep Neural Networks Capture Compositionality in Arithmetic Reasoning?
Keito Kudo, Yoichi Aoki, Tatsuki Kuribayashi, Ana Brassard, Masashi Yoshikawa, Keisuke Sakaguchi and Kentaro Inui

Do Neural Topic Models Really Need Dropout? Analysis of the Effect of Dropout in Topic Modeling
Suman Adhya, Avishek Lahiri and Debarshi Kumar Sanyal

Do Pretrained Contextual Language Models Distinguish between Hebrew Homograph Analyses?
Avi Shmidman, Cheyn Shmuel Shmidman, Dan Bareket, Moshe Koppel and Reut Tsarfaty

Don’t Blame the Annotator: Bias Already Starts in the Annotation Instructions
Mihir Parmar, Swaroop Mishra, Mor Geva and Chitta Baral

Empathy Identification Systems are not Accurately Accounting for Context
Andrew Lee, Jonathan K. Kummerfeld, Larry An and Rada Mihalcea

Entity Disambiguation with Entity Definitions
Luigi Procopio, Simone Conia, Edoardo Barba and Roberto Navigli

Entity Tracking via Effective Use of Multi-Task Learning Model and Mention-guided Decoding
Janvijay Singh, Fan Bai and Zhen Wang

Exploring Paracrawl for Document-level Neural Machine Translation
Yusser Al Ghussin, Jingyi Zhang and Josef van Genabith

FrameBERT: Conceptual Metaphor Detection with Frame Embedding Learning
Yucheng Li, Shun Wang, Chenghua Lin, Frank Guerin and Loic Barrault

Guide the Learner: Controlling Product of Experts Debiasing Method Based on Token Attribution Similarities
Ali Modarressi, Hossein Amirkhani and Mohammad Taher Pilehvar

How do Words Contribute to Sentence Semantics? Revisiting Sentence Embeddings with a Perturbation Method
Wenlin Yao, Lifeng Jin, Hongming Zhang, Xiaoman Pan, Kaiqiang Song, Dian Yu, Dong Yu and Jianshu Chen

How Many and Which Training Points Would Need to be Removed to Flip this Prediction?
Jinghan Yang, Sarthak Jain and Byron C. Wallace

Improving Sign Recognition with Phonology
Lee Kezar, Jesse Thomason and Zed Sevcikova Sehyr

Investigating data partitioning strategies for crosslinguistic low-resource ASR evaluation
Zoey Liu, Justin Spence and Emily Prud’hommeaux

Investigating the Effect of Relative Positional Embeddings on AMR-to-Text Generation with Structural Adapters
Sebastien Montella, Alexis Nasr, Johannes Heinecke, Frederic Bechet and Lina M. Rojas Barahona

IRMA: the 335-million-word Italian coRpus for studying MisinformAtion
Fabio Carrella, Alessandro Miani and Stephan Lewandowsky

LEALLA: Learning Lightweight Language-agnostic Sentence Embeddings with Knowledge Distillation
Zhuoyuan Mao and Tetsuji Nakagawa

Leveraging Task Dependency and Contrastive Learning for Case Outcome Classification on European Court of Human Rights Cases
Santosh T.Y.S.S, Marcel Leon Perez San Blas, Phillip Kemper and Matthias Grabmair

LingMess: Linguistically Informed Multi Expert Scorers for Coreference Resolution
Shon Otmazgin, Arie Cattan and Yoav Goldberg

LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control
Yilun Zhao, Zhenting Qi, Linyong Nan, Lorenzo Jaime Yu Flores and Dragomir Radev

Measuring Normative and Descriptive Biases in Language Models Using Census Data
Samia Touileb, Lilja Øvrelid and Erik Velldal

Metaphor Detection with Effective Context Denoising
Shun Wang, Yucheng Li, Chenghua Lin, Loic Barrault and Frank Guerin

Multilingual Normalization of Temporal Expressions with Masked Language Models
Lukas Lange, Jannik Strötgen, Heike Adel and Dietrich Klakow

Nationality Bias in Text Generation
Pranav Narayanan Venkit, Sanjana Gautam, Ruchi Panchanadikar, Ting-Hao Kenneth Huang and Shomir Wilson

On the inconsistency of separable losses for structured prediction
Caio Corro

On the Intersection of Context-Free and Regular Languages
Clemente Pasti, Andreas Opedal, Tiago Pimentel, Tim Vieira, Jason Eisner and Ryan Cotterell

Parameter-Efficient Korean Character-Level Language Modeling
Marco Cognetta, Sangwhan Moon, Lawrence Wolf-Sonkin and Naoaki Okazaki

Parameter-Efficient Tuning with Special Token Adaptation
Xiaocong Yang, James Y. Huang, Wenxuan Zhou and Muhao Chen

Retrieval Enhanced Data Augmentation for Question Answering on Privacy Policies
Md Rizwan Parvez, Jianfeng Chi, Wasi Uddin Ahmad, Yuan Tian and Kai-Wei Chang

Salient Span Masking for Temporal Understanding
Jeremy R. Cole, Aditi Chaudhary, Bhuwan Dhingra and Partha Talukdar

Step by Step Loss Goes Very Far: Multi-Step Quantization for Adversarial Text Attacks
Piotr Gaiński and Klaudia Patrycja Bałazy

SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domains
Koustava Goswami, Lukas Lange, Jun Araki and Heike Adel

Syntax-guided Neural Module Distillation to Probe Compositionality in Sentence Embeddings
Rohan Pandey

Systematic Investigation of Strategies Tailored for Low-Resource Settings for Low-Resource Dependency Parsing
Jivnesh Sandhan, Laxmidhar Behera and Pawan Goyal

The Functional Relevance of Probed Information: A Case Study
Michael Hanna, Roberto Zamparelli and David Mareček

Towards a Unified Multi-Domain Multilingual Named Entity Recognition Model
Mayank Kulkarni, Daniel Preotiuc-Pietro, Karthik Radhakrishnan, Genta Indra Winata, Shijie Wu, Lingjue Xie and Shaohua Yang

Towards More Efficient Insertion Transformer with Fractional Positional Encoding
Zhisong Zhang, Yizhe Zhang and Bill Dolan

Towards preserving word order importance through Forced Invalidation
Hadeel Al-Negheimish, Pranava Madhyastha and Alessandra Russo

Unsupervised Improvement of Factual Knowledge in Language Models
Nafis Sadeq, Byungkyu Kang, Prarit Lamba and Julian McAuley

WinoDict: Probing language models for in-context word acquisition
Julian Martin Eisenschlos, Jeremy R. Cole, Fangyu Liu and William Cohen