2024.06.11 [23’ NIPS] Direct Preference Optimization: Your Language Model is Secretly a Reward Model Language DPO RLHF
2024.06.10 [24’ CVPR] LLaVA-1.5: Improved Baselines with Visual Instruction Tuning Multimodal Adapter Instruction Tuning
2024.06.07 [24’ CVPR] Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs Multimodal Analysis Visual Encoder
2024.06.04 [24’ CVPR] Honeybee: Locality-enhanced Projector for Multimodal LLM Multimodal Adapter Analysis