Abstract: Multimodal (MM) large language models (MLLMs) have achieved remarkable success in image- and region-level remote sensing (RS) image understanding tasks, such as image captioning (IC), visual ...
Abstract: Medical Image Captioning aims to obtain an accurate and textual description from medical images, supporting healthcare professionals in diagnostic processes and reducing manual workload.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results