X-VILA: Cross-Modality Alignment for Large Language Model

Abstract

Abstract

Publication
In arxiv