2024.06.21 [24’ CVPR] GSVA: Generalized Segmentation via Multimodal Large Language Models Multimodal Referring Segmentation
2024.06.21 [23’ CVPR] GRES: Generalized Referring Expression Segmentation Multimodal Referring Segmentation
2024.06.21 [24’ CVPR] GLaMM: Pixel Grounding Large Multimodal Model Multimodal Segmentation Dataset Visual Grounding
2024.06.21 [24’ CVPR] Training Like a Medical Resident: Context-Prior Learning Toward Universal Medical Image Segmentation Medical Prompt Tuning Segmentation Universal Segmentation
2024.06.20 [23’ ICLR] Auto-CoT: Automatic Chain of Thought Prompting in Large Language Models Language Chain-of-Thought