Abstract: In recent years, the increasing volume of audio-based content in academic and professional environments has necessitated efficient tools for automated note-taking and summarization. The ...
Large multimodal models are now extensively used worldwide, with the most powerful ones trained on massive, general-purpose datasets. Despite their rapid deployment, concerns persist regarding the ...