[MEI-L] Seeking help with Audiveris engine/Tesseract

Kijas, Anna E Anna.Kijas at tufts.edu
Sat Mar 21 19:27:34 CET 2020


Hi Max and all,

Thank you for all your helpful suggestions! Good news, I was finally able to get the Tesseract to run correctly. It was an issue with not having the training data in the correct folder, so thankfully something simple, but frustrating!

Best,
Anna


Please note:  Lilly Music Library librarian & staff are working remotely, beginning March 13, 2020, because of COVID-19<https://coronavirus.tufts.edu/>. Information about library services and support available during this time is available here<https://tischlibrary.tufts.edu/>. Meetings and consultations will be conducted over Zoom.

Anna Kijas
Head, Lilly Music Library
Granoff Music Center
Tufts University
20 Talbot Avenue, Medford, MA 02155
Pronouns: she, her, hers
Book an appointment<https://tufts.libcal.com/appointments/kijas/lilly> | (617) 627-2846

From: mei-l <mei-l-bounces+anna.kijas=tufts.edu at lists.uni-paderborn.de> on behalf of Max Poliakovski <maximumspatium at googlemail.com>
Reply-To: Music Encoding Initiative <mei-l at lists.uni-paderborn.de>
Date: Saturday, March 21, 2020 at 12:47 PM
To: Music Encoding Initiative <mei-l at lists.uni-paderborn.de>
Subject: Re: [MEI-L] Seeking help with Audiveris engine/Tesseract

Hi Anna,

well, on Mac you'll need to install everything manually. First of all, download a couple of language files from https://github.com/tesseract-ocr/tessdata

Please keep in mind that english language is required while all others are optional. Let's assume you want to use English and German languages. Then you need to download the following files:

eng.traineddata
deu.traineddata

Copy them in a directory of your choice and rename this directory to "tessdata".

To run Audiveris, open your terminal and invoke the following commands:

export TESSDATA_PREFIX=/Users/Anna/path/to/tessdata/directory/ <--- this placeholder need to be set to the actual path!
./gradlew run

When Audiveris window shows up, go to "Book/Set Book Parameters..." menu. Now you should be able to select one or more languages in the OCR language(s) section.

Please tell me how it goes.

Cheers
Max



Am Sa., 21. März 2020 um 16:45 Uhr schrieb Kijas, Anna E <Anna.Kijas at tufts.edu<mailto:Anna.Kijas at tufts.edu>>:
Thank you, Tim. I’ll take a look at the links you shared. My guess is that it is my install of Tesseract language files that is the issue.

Max – I have read through the issues on Audiveris’ GitHub and did see the tickets regarding the 3.04 language files, but I can’t get it to work. Also, I am working on a Mac.

I’ll keep trying and see if I can figure this out. If anyone else has any tips, please send them my way! Thank you 😊

Anna

Please note:  Lilly Music Library librarian & staff are working remotely, beginning March 13, 2020, because of COVID-19<https://coronavirus.tufts.edu/>. Information about library services and support available during this time is available here<https://tischlibrary.tufts.edu/>. Meetings and consultations will be conducted over Zoom.

Anna Kijas
Head, Lilly Music Library
Granoff Music Center
Tufts University
20 Talbot Avenue, Medford, MA 02155
Pronouns: she, her, hers
Book an appointment<https://tufts.libcal.com/appointments/kijas/lilly> | (617) 627-2846

From: mei-l <mei-l-bounces at lists.uni-paderborn.de<mailto:mei-l-bounces at lists.uni-paderborn.de>> on behalf of Max Poliakovski <maximumspatium at googlemail.com<mailto:maximumspatium at googlemail.com>>
Reply-To: Music Encoding Initiative <mei-l at lists.uni-paderborn.de<mailto:mei-l at lists.uni-paderborn.de>>
Date: Saturday, March 21, 2020 at 11:18 AM
To: Music Encoding Initiative <mei-l at lists.uni-paderborn.de<mailto:mei-l at lists.uni-paderborn.de>>
Subject: Re: [MEI-L] Seeking help with Audiveris engine/Tesseract

Audiveris has an issue tracker: https://github.com/Audiveris/audiveris/issues

BTW, the installation issue with Tesseract you're currently experiencing, has been discussed (and solved) there several times in the past.
Just search the issues for "Tesseract".

I hope this helps.

Cheers
Max


Am Sa., 21. März 2020 um 15:05 Uhr schrieb Kijas, Anna E <Anna.Kijas at tufts.edu<mailto:Anna.Kijas at tufts.edu>>:
Hello all,

I hope that everyone is doing well during this public health crisis. As I am stuck at home for the unforeseeable future I have a bit more time (no more driving to work!). I wanted to build and test out the Audiveris engine on my own machine to see if I can process sheet music and use the OMR to extract musicXML. Has anyone worked or is working with the Audiveris engine to extract music notation? Here is the link to the development guide: https://bacchushlg.gitbooks.io/audiveris-5-1/content/install/sources.html.

I have built the engine on my machine and have installed the dependencies (JDK 8, Git, Tesseract, FreeType Library), but I am running into an issue with Tesseract. The Audiveris engine requires that you use Tesseract 3.04 language data instead of 4.0 (it won’t work with the newer version). I installed Tesseract on my machine, but when I add the 3.04 language data it doesn’t see it and I keep getting the following error messages, which I believe are because I don’t have Tesseract set up correctly and/or the right version language files.

2020-03-21 10:00:33,027 WARN  [IMSLP273329]            TesseractOrder 166  | Could not initialize Tesseract with lang deu+eng+fra
2020-03-21 10:00:33,031 WARN  [IMSLP273329]                 SheetStub 845  | Error in performing [SCALE, GRID, HEADERS, STEM_SEEDS, BEAMS, LEDGERS, HEADS, STEMS, REDUCTION, CUE_BEAMS, TEXTS, MEASURES, CHORDS, CURVES, SYMBOLS, LINKS, RHYTHMS, PAGE] java.util.concurrent.ExecutionException: java.lang.NullPointerException
java.util.concurrent.ExecutionException: java.lang.NullPointerException

If anyone is able to provide some assistance, please let me know.

Thanks!
Anna

Please note:  Lilly Music Library librarian & staff are working remotely, beginning March 13, 2020, because of COVID-19<https://coronavirus.tufts.edu/>. Information about library services and support available during this time is available here<https://tischlibrary.tufts.edu/>. Meetings and consultations will be conducted over Zoom.

Anna Kijas
Head, Lilly Music Library
Granoff Music Center
Tufts University
20 Talbot Avenue, Medford, MA 02155
Pronouns: she, her, hers
Book an appointment<https://tufts.libcal.com/appointments/kijas/lilly> | (617) 627-2846
_______________________________________________
mei-l mailing list
mei-l at lists.uni-paderborn.de<mailto:mei-l at lists.uni-paderborn.de>
https://lists.uni-paderborn.de/mailman/listinfo/mei-l
_______________________________________________
mei-l mailing list
mei-l at lists.uni-paderborn.de<mailto:mei-l at lists.uni-paderborn.de>
https://lists.uni-paderborn.de/mailman/listinfo/mei-l
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.uni-paderborn.de/pipermail/mei-l/attachments/20200321/74cfa844/attachment.htm>


More information about the mei-l mailing list