Johannes Rückert, Louise Bloch, Christoph M. Friedrich
Large Vision Language Models (VLMs) can effectively analyze scientific diagrams for compliance with visualization guidelines, though they struggle with certain aspects like image quality and tick marks.
In scientific publications, diagrams are crucial for conveying data, but they often don't follow established visualization guidelines, potentially leading to misinformation. This study uses advanced AI models, known as Vision Language Models, to examine diagrams and identify where they might violate these guidelines. The models were found to be quite effective at spotting issues like missing labels and unnecessary 3D effects, although they were less reliable at assessing image quality and tick marks. This research suggests that such AI tools could help improve the accuracy and clarity of data visualizations in scientific literature.