Embracing the potential of multimodal learning