2024.10.14 [24’ EMNLP] TroL: Traversal of Layers for Large Language and Vision Models Multimodal Adapter
2024.10.14 [24’ EMNLP] MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model Multimodal Interpretability
2024.10.14 [24’ ACL] Cross-Modal Projection in Multimodal LLMs Doesn’t Really Project Visual Attributes to Textual Space Multimodal Alignment
2024.10.13 [24’ ECCV] BLINK: Multimodal Large Language Models Can See but Not Perceive Multimodal Benchmark Perception
2024.10.11 [24’] Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations Multimodal Interpretability