Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Fine-Grained Controllable Text Generation Using Non-Residual Prompting
Show others and affiliations
2022 (English)In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: (Volume 1: Long Papers), Association for Computational Linguistics, 2022, p. 6837-6857Conference paper, Published paper (Refereed)
Abstract [en]

The introduction of immensely large Causal Language Models (CLMs) has rejuvenated the interest in open-ended text generation. However, controlling the generative process for these Transformer-based models is at large an unsolved problem. Earlier work has explored either plug-and-play decoding strategies, or more powerful but blunt approaches such as prompting. There hence currently exists a trade-off between fine-grained control, and the capability for more expressive high-level instructions. To alleviate this trade-off, we propose an encoder-decoder architecture that enables intermediate text prompts at arbitrary time steps. We propose a resource-efficient method for converting a pre-trained CLM into this architecture, and demonstrate its potential on various experiments, including the novel task of contextualized word inclusion. Our method provides strong results on multiple experimental settings, proving itself to be both expressive and versatile.

Place, publisher, year, edition, pages
Association for Computational Linguistics, 2022. p. 6837-6857
National Category
Natural Language Processing
Research subject
Computational Linguistics
Identifiers
URN: urn:nbn:se:uu:diva-492063DOI: 10.18653/v1/2022.acl-long.471ISI: 000828702306067ISBN: 978-1-955917-21-6 (print)OAI: oai:DiVA.org:uu-492063DiVA, id: diva2:1722921
Conference
60th Annual Meeting of the Association for Computational Linguistics, 22-27 May, 2022, Dublin, Ireland
Funder
Vinnova, 2019-02996Available from: 2023-01-01 Created: 2023-01-01 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records

Nivre, Joakim

Search in DiVA

By author/editor
Nivre, Joakim
Natural Language Processing

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 44 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf