The increasing amount of slide presentations in various sectors has amplified the need for effective slide layout and semantic analysis. However, we found that current slide datasets contain inconsistencies, mislabels, and incomplete annotations. Using them as a basis for developing deep learning-based slide analysis models could lead to models that are not robust and suboptimal. Addressing these challenges, we introduce SlideCraft, a tool for creating synthetic slide datasets that imitate real-world presentations. This tool overcomes the drawbacks of existing datasets by allowing users to create balanced, diverse, and accurately annotated slide data. We demonstrate SlideCraft's efficacy in enhancing slide layout analysis algorithms, focusing on its capability to improve dataset quality and object detection performance.
@InProceedings{10.1007/978-3-031-70533-5_6,
author="Seng, Travis
and Carlier, Axel
and Forgione, Thomas
and Charvillat, Vincent
and Ooi, Wei Tsang",
editor="Barney Smith, Elisa H.
and Liwicki, Marcus
and Peng, Liangrui",
title="SlideCraft: Synthetic Slides Generation for Robust Slide Analysis",
booktitle="Document Analysis and Recognition - ICDAR 2024",
year="2024",
publisher="Springer Nature Switzerland",
address="Cham",
pages="79--96",
abstract="The increasing amount of slide presentations in various sectors has amplified the need for effective slide layout and semantic analysis. However, we found that current slide datasets contain inconsistencies, mislabels, and incomplete annotations. Using them as a basis for developing deep learning-based slide analysis models could lead to models that are not robust and suboptimal. Addressing these challenges, we introduce SlideCraft, a tool for creating synthetic slide datasets that imitate real-world presentations. This tool overcomes the drawbacks of existing datasets by allowing users to create balanced, diverse, and accurately annotated slide data. We demonstrate SlideCraft's efficacy in enhancing slide layout analysis algorithms, focusing on its capability to improve dataset quality and object detection performance.",
isbn="978-3-031-70533-5"
}