Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Finding Structural Knowledge in Multimodal-BERT
Katholieke Univ Leuven, Dept Comp Sci, Leuven, Belgium..
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Languages, Department of Linguistics and Philology. Katholieke Univ Leuven, Dept Comp Sci, Leuven, Belgium.;Univ Copenhagen, Dept Comp Sci, Copenhagen, Denmark..ORCID iD: 0000-0001-8844-2126
Katholieke Univ Leuven, Dept Comp Sci, Leuven, Belgium..
2022 (English)In: PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), ASSOC COMPUTATIONAL LINGUISTICS-ACL Association for Computational Linguistics, 2022, p. 5658-5671Conference paper, Published paper (Refereed)
Abstract [en]

In this work, we investigate the knowledge learned in the embeddings of multimodal-BERT models. More specifically, we probe their capabilities of storing the grammatical structure of linguistic data and the structure learned over objects in visual data. To reach that goal, we first make the inherent structure of language and visuals explicit by a dependency parse of the sentences that describe the image and by the dependencies between the object regions in the image, respectively. We call this explicit visual structure the scene tree, that is based on the dependency tree of the language description. Extensive probing experiments show that the multimodal-BERT models do not encode these scene trees.

Place, publisher, year, edition, pages
ASSOC COMPUTATIONAL LINGUISTICS-ACL Association for Computational Linguistics, 2022. p. 5658-5671
National Category
General Language Studies and Linguistics
Identifiers
URN: urn:nbn:se:uu:diva-484791ISI: 000828702305053ISBN: 978-1-955917-21-6 (print)OAI: oai:DiVA.org:uu-484791DiVA, id: diva2:1696782
Conference
60th Annual Meeting of the Association-for-Computational-Linguistics (ACL), MAY 22-27, 2022, Dublin, IRELAND
Funder
EU, European Research Council, 788506Swedish Research Council, 2020-00437Available from: 2022-09-19 Created: 2022-09-19 Last updated: 2024-01-15Bibliographically approved

Open Access in DiVA

No full text in DiVA

Authority records

de Lhoneux, Miryam

Search in DiVA

By author/editor
de Lhoneux, Miryam
By organisation
Department of Linguistics and Philology
General Language Studies and Linguistics

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 26 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf