Higher-order Comparisons of Sentence Encoder RepresentationsShow others and affiliations
2019 (English)In: 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, ASSOC COMPUTATIONAL LINGUISTICS-ACL , 2019, p. 5838-5845Conference paper, Published paper (Refereed)
Abstract [en]
Representational Similarity Analysis (RSA) is a technique developed by neuroscientists for comparing activity patterns of different measurement modalities (e.g., fMRI, electrophysiology, behavior). As a framework, RSA has several advantages over existing approaches to interpretation of language encoders based on probing or diagnostic classification: namely, it does not require large training samples, is not prone to overfitting, and it enables a more transparent comparison between the representational geometries of different models and modalities. We demonstrate the utility of RSA by establishing a previously unknown correspondence between widely-employed pre-trained language encoders and human processing difficulty via eye-tracking data, showcasing its potential in the interpretability toolbox for neural models.
Place, publisher, year, edition, pages
ASSOC COMPUTATIONAL LINGUISTICS-ACL , 2019. p. 5838-5845
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:uu:diva-494310ISI: 000854193306004ISBN: 978-1-950737-90-1 (print)OAI: oai:DiVA.org:uu-494310DiVA, id: diva2:1728012
Conference
Conference on Empirical Methods in Natural Language Processing / 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), NOV 03-07, 2019, Hong Kong, HONG KONG
2023-01-172023-01-172023-01-17Bibliographically approved