This chapter describes how to set up repositories of Glyph samples and how to carefully filter the global repository so that it can be used for training.
Note: To use this feature, make sure the advanced
SAMPLES topic is activated.
If not, select the
SAMPLES topic in the
Tools | Advanced Topics... menu
(and restart the application).
Audiveris 5.1 basic classifier is a Glyph classifier, which means that it needs to be trained with glyph samples (a sample is basically a glyph with the related shape).
There is an Audiveris 5.1 Google drive located at https://drive.google.com/drive/folders/1yS8DDSZjn3kWKkHvgJqyAEGwnOu0Gpc_?usp=sharing
This folder contains several files:
||4 MB||~15000 samples|
||453 MB||~3500 sheet images|
||0.3 MB||Resulting trained model|
samples.zip file is the
It's the one that has been used to train the Glyph classifier provided with
Begin by downloading this archive (without expanding it) into your own
(see its precise location in Folders section).
You will then be able to augment this collection on your own.
You can also download the
images.zip file which is not mandatory for training, but which will
help you see most samples within their sheet context.
Sampling a Sheet or a Book
After perhaps some manual corrections, when you are really satisfied with all the glyphs recognized
in a given sheet, you can save the sheet data as training samples.
This is done via the pulldown
Sheet | Sample sheet Symbols menu.
You can also use the pulldown
Book | Sample book Symbols menu if you are comfortable with all the
Data is saved under two zip archives located in the book folder.
In say the
foo book folder, there will be:
foo-samples.zipfor the collection of samples (this is the book sample repository)
foo-images.zipfor the containing sheet images.
It is a good practice to work on one book at a time, and only merge a book repository into the global repository when the book data has been thoroughly verified and its samples carefully filtered.
The purpose of the Sample Repository dialog is to provide a user interface to visually review some or all of the various SAMPLES which could be used for training the classifier.
The main objective is to easily identify samples which have been assigned a wrong shape. Using them as part of the training base could severely impact Audiveris recognition efficiency. So the strategy, when such a wrong sample has been identified, is to remove it from the repository or to assign it a correct shape.
To work on the GLOBAL repository, use the
Tools | Browse Global Repository... pulldown menu.
It will work on the
images.zip files if any are found in your own
folder (see Folders section).
To work on a book repository, use the
Book | View Book Repository... pulldown menu.
You will be able to pick and merge book repositories into the GLOBAL repository later on,
because the training process uses only the GLOBAL repository.
Either way, the repository user interface is the same:
The repository interface is organized as follows:
- Left column:
- Shapes selector
- Middle column:
- Sheets selector
- Sample board
- Classifier board
- Right column:
- Samples selector
- Sample context
Initially, all panes are empty except the
Sheetsselector which appears populated with sheets names.
These are the sheets available in the underlying repository (either the book sheets for a book repository, or all registered sheets so far for the global repository).
There is a special sheet, named
ALL_FONT_BASED_SYMBOLS, which is not a real sheet, but gathers all the synthetic samples built from Audiveris musical font.
You can select one or several sheets in the Sheets selector.
Shapesselector gets populated as soon as sheets are selected.
You can now select the shapes of interest.
Samplesselector gets populated as soon as shapes get selected.
The samples are gathered by shape. In a shape collection, via a right-click, you can sort samples by width, height, height or grade. The synthetic sample (there is exactly one of it in each shape collection) is displayed with a green background.
Only one sample can be selected at a time.
If an image is available for the sheet which contains the sample at hand, the Sample
contextpanel displays the selected sample in its sheet context.
This can be helpful for visual checking.
The selected sample can be:
- Removed from the repository.
This is done by typing the
DELETEkey or clicking on the
Removebutton in the Sample board or selecting the
Removeitem in the sample right-click menu.
- Assigned to a new shape.
This is done by clicking on the
Assign tobutton in the Sample board or selecting the
Assign toitem in the sample right-click menu, and then selecting the new shape.
When you are satisfied with a book repository you can push its content to the global repository.
You do so from the book repository interface, by selecting the pulldown menu
Repository | Push to Global.
If you leave a repository with unsaved modifications, you will be prompted to save them or to forget them.