-
Magi: Automatically transcribe comics into text and automatically generate scripts
The Visual Geometry Group at the Department of Engineering Science at the University of Oxford has developed a model called Magi that can automatically transcribe comic pages into text and generate scripts. The model enables fully automatic script generation by identifying panels, text blocks, and characters on comic pages. Its main functions include panel detection, identifying individual panels on a comic page, and text block detection, identifying text blocks within panels, which typically contain dialogue or narrative text. In addition, the model is able to detect character images on the page and cluster them according to their identity to distinguish between different characters. The Magi model can also associate text with speech…- 11.3k