Composition 2.0: Toward a multilingual and multimodal framework