Click here to download the CFP file.
Recent advances in large-scale foundation models, including large language models (LLMs), vision-language models (VLMs), and multimodal foundation models, have significantly reshaped the landscape of multimedia processing. Beyond traditional perception and understanding tasks, these models are increasingly being integrated into embodied systems, such as robots, autonomous agents, mixed reality devices, and intelligent environments, where perception, reasoning, and action are tightly coupled.
Embodied multimedia processing introduces new challenges that go beyond static multimedia analysis. These challenges include multimodal sensory fusion (vision, audio, tactile, and proprioception), long-horizon temporal reasoning, real-time interaction, physical world grounding, and safety-critical decision-making. Large models offer unprecedented opportunities to address these challenges by providing unified representations, cross-modal reasoning capabilities, and scalable learning paradigms.
This special session aims to bring together researchers and practitioners from multimedia, robotics, computer vision, natural language processing, and embodied AI communities to explore how large models can be designed, adapted, and deployed for embodied multimedia processing. More importantly, it will foster interdisciplinary discussions on novel algorithms, system architectures, datasets, evaluation protocols, and real-world applications.
The special session invites submissions addressing, but not limited to, the following areas:
All deadlines are at the end of the day specified, anywhere on Earth (UTC-12).
Please use the below link to submit your work.
https://cmt3.research.microsoft.com/MIPR2026
Please note: Submissions to this special session must follow the same formatting guidelines, templates, page limits, and review policies as the “Regular Paper Track” of the main conference. Authors are encouraged to refer to https://mipr2026.org/authors/ for detailed instructions.
Please use the below link to submit your work.