With Five New Multimodal Models Across the 3B, 4B, and 9B Scales, the OpenFlamingo Team Releases OpenFlamingo v2 which Outperforms the Previous Model
A group of researchers from the University of Washington, Stanford, AI2, UCSB, and Google recently developed the OpenFlamingo project, which aims to build models similar to those DeepMind’s Flamingo team. OpenFlamingo models can handle any mixed text and image sequences and produce text as an output. Captioning, visual question answering, and image classification are just…