A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 3 Home / Browse M / Multimodal Fusion Multimodal Fusion Intermediate EN Share Print Combining signals from multiple modalities. AdvertisementAd space — term-top Definition Full Definition Combining signals from multiple modalities. Keywords cross-modal Domains Computer Vision Related Terms Cross-Attention related to Attention between different modalities. Optical Flow related to Pixel motion estimation between frames. SLAM related to Simultaneous Localization and Mapping for robotics. 3D Reconstruction related to Recovering 3D structure from images. Instance Segmentation related to Pixel-level separation of individual object instances. Semantic Segmentation related to Pixel-wise classification of image regions. Vision Transformer related to Transformer applied to image patches. CLIP related to Joint vision-language model aligning images and text.