4

B-cos LM: Efficiently Transforming Pre-trained Language Models for Improved Explainability

Post-hoc explanation methods for black-box models often struggle with faithfulness and human interpretability due to the lack of explainability in current neural models. Meanwhile, B-cos networks have been introduced to improve model explainability …