Field
Multimodal
Systems that connect text, vision, audio, and other modalities.
4 papers · latest 2026-04-22
Common topics in this field
Seunghee Han, Jaewoong Lee, Jihan Kim
Multimodal Transformer models sample-level variability in MOFs, not just framework identity—enabling accurate property prediction for real experimental materials, transforming ML in materials science.
Habibeh Naderi, Behrouz Haji Soleimani, Stan Matwin
CALIBER introduces Bayesian low-rank adaptation for uncertainty-aware multimodal learning, enabling robust, efficient fine-tuning in low-resource settings—essential for builders deploying reliable multimodal systems under data scarcity.
Farbod Alinezhad, Jianfei Cao, Gary J. Young et al.
CDM is the first diffusion model for counterfactual longitudinal outcomes, enabling accurate, uncertainty-quantified treatment effect predictions—vital for clinical decision systems and causal AI in healthcare.
Kaiqi Hu, Linda Xiao, Shiyue Xu et al.
Introduces the first rigorous benchmark proving whether VLMs truly understand candlestick patterns—not just correlate them—essential for financial AI builders relying on visual market signal interpretation.