Home | << 1 >> |
Record | |||||
---|---|---|---|---|---|
Author | Pau Torras; Arnau Baro; Alicia Fornes; Lei Kang | ||||
Title | Improving Handwritten Music Recognition through Language Model Integration | Type | Conference Article | ||
Year | 2022 | Publication | 4th International Workshop on Reading Music Systems (WoRMS2022) | Abbreviated Journal | |
Volume | Issue | Pages | 42-46 | ||
Keywords | optical music recognition; historical sources; diversity; music theory; digital humanities | ||||
Abstract | Handwritten Music Recognition, especially in the historical domain, is an inherently challenging endeavour; paper degradation artefacts and the ambiguous nature of handwriting make recognising such scores an error-prone process, even for the current state-of-the-art Sequence to Sequence models. In this work we propose a way of reducing the production of statistically implausible output sequences by fusing a Language Model into a recognition Sequence to Sequence model. The idea is leveraging visually-conditioned and context-conditioned output distributions in order to automatically find and correct any mistakes that would otherwise break context significantly. We have found this approach to improve recognition results to 25.15 SER (%) from a previous best of 31.79 SER (%) in the literature. | ||||
Address | November 18, 2022 | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | Expedition | Conference | WoRMS | ||
Notes | DAG; 600.121; 600.162; 602.230 | Approved | no | ||
Call Number | Admin @ si @ TBF2022 | Serial | 3735 | ||
Permanent link to this record |