Intrepretability and Analysis of Language Models

With the advent of large language models, a little is known about the exact mechanism which contributes to the model success. In this project, we developed different analyses methods (e.g. attention pattern analysis) to understand how information is represented in large language models.

Publications

V. Lialin K. Zhao N. Shivagunde A. Rumshisky Life after BERT: What do Other Muppets Understand about Language? Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

@article{lialin2022life, title={Life after BERT: What do Other Muppets Understand about Language?}, author={Lialin, Vladislav and Zhao, Kevin and Shivagunde, Namrata and Rumshisky, Anna}, journal={arXiv preprint arXiv:2205.10696}, year={2022}}

S. Prasanna A. Rogers A. Rumshisky When BERT Plays the Lottery, All Tickets Are Winning. Proceedings of EMNLP 2020.

@article{prasanna2020bert, title={When BERT Plays the Lottery, All Tickets Are Winning}, author={Prasanna, Sai and Rogers, Anna and Rumshisky, Anna}, journal={arXiv preprint arXiv:2005.00561}, year={2020} }

O. Kovaleva S. Kulshreshtha A. Rogers A. Rumshisky BERT Busters: Outlier Dimensions that Disrupt Transformers. arXiv preprint arXiv:2105.06990

@article{kovaleva2021bert, title={BERT busters: Outlier dimensions that disrupt transformers}, author={Kovaleva, Olga and Kulshreshtha, Saurabh and Rogers, Anna and Rumshisky, Anna}, journal={arXiv preprint arXiv:2105.06990}, year={2021}}

O. Kovaleva A. Romanov A. Rogers A. Rumshisky Revealing the Dark Secrets of BERT. Proceedings of EMNLP 2019. Hong Kong, China.
s

@inproceedings{kovaleva2019revealing, title={Revealing the Dark Secrets of BERT}, author={Kovaleva, Olga and Romanov, Alexey and Rogers, Anna and Rumshisky, Anna}, booktitle={Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)}, pages={4356\–4365}, year={2019} }

A. Rogers O.Kovaleva A.Rumshisky A Primer in BERTology: What We Know about How BERT Works. Accepted to TACL 2020.

@article{rogers2020primer, title={A primer in bertology: What we know about how bert works}, author={Rogers, Anna and Kovaleva, Olga and Rumshisky, Anna}, journal={arXiv preprint arXiv:2002.12327}, year={2020} }