2024.06.22 [24’ TMLR] Multimodal Chain-of-Thought Reasoning in Language Models Multimodal Chain-of-Thought
2024.06.21 [24’ CVPR] Training Like a Medical Resident: Context-Prior Learning Toward Universal Medical Image Segmentation Medical Prompt Tuning Segmentation Universal Segmentation
2024.06.21 [24’ CVPR] PixelLM: Pixel Reasoning with Large Multimodal Model Multimodal Reasoning Segmentation
2024.06.21 [24’ CVPR] GSVA: Generalized Segmentation via Multimodal Large Language Models Multimodal Referring Segmentation
2024.06.21 [24’ CVPR] GLaMM: Pixel Grounding Large Multimodal Model Multimodal Segmentation Dataset Visual Grounding