We found that 50% of LAION-2B samples contain the Parrot Captions (concurent text in captions and print pixels) and the Parrot Captions do huge impact on CLIP-style Vision-Language Alignment...
I'm a researcher at Shanghai AI Lab working on data-centric research. Before that I obtained bachelor and master degree at Sun Yat-Sen University (SYSU).