Re: New version of TesseractOCR add-on


Rui Fontes
 

Yes, you can see in documentation the several languages it support:

Afrikans
Albanian
Amharik
Arabic
Armenian
Assamese
Azerbaijani (Latin)
Basque
Belarusian
Bengali
Bosnian
Breton
Bulgarian
Burnese
Catalan/Valencian
Cebuano
Cherokee
Chinese simplified
Chinese traditional
Corsican
Croatian
Czech
Dannish
Deutch
Dhivehi
Dutch (Flemish)
Dzongkha
English
Esperanto
Estonian
Faroese
Filipino
Finnish
French
Galician
Georgian
Greek
Gujarati
Haitian
Hebrew
Hindi
Hungarian
Icelandic
Indonesian
Inuktitut
Irish
Italian
Javanese
Japanese
Kannada
Kazakh
Khmer (Central)
Kirghiz
Korean
Kurdish Kurmanji
Lao
Latin
Lativia
Lituanian
Luxembourgish
Macedonian
Malay
Malayalam
Maltese
Maori
Marathi
Math / equation detection module
Mongolian
Nepali
Norwegian
Occitan
Oriya
Panjabi
Pashto
Persian
Polish
Portuguese
Quechua
Romanian/Moldave
Russian
Sanskrit
Scottish Gaelic
Serbian (Latin)
Slovak)
Slovenian)
Sindhi
Sinhalese
Spanish
Sundanese
Swahili
Swedish
Syriac
Tajik
Tamil
Tatar
Telugu
Thai
Tibetan
Tigrinya
Tonga
Turkish
Uighur
Ukrainian
Urdu
Uzbek (Latin)
Vietnamese
Welsh
West Frisian
Yiddish
Yoruba

Best regards,

Rui Fontes
NVDA portuguese team



Às 08:19 de 15/07/2022, mukesh jain escreveu:

hello,
does it support Hindi language?
thanks,
Mukesh

On 7/14/22, Rui Fontes <rui.fontes@...> wrote:
Hello!


Yes, you should select Tamil, and if necessary, other languages, and
place Tamil in first place.


Regarding your problem navegating the results, it is strange since it is
a normal text file in the NotePad application...


I suppose the threading problem updating is already solved...


Waiting for yours future observations...


Best regards,

Rui Fontes
NVDA portuguese team



Às 15:24 de 14/07/2022, Ravindran V.S. escreveu:
Hello,

Thank you for clarifying.
So, it will automatically pick the language in the selected language list
In the order.
Mean-wile, I just encountered a new problem now, when I started my PC.
Once the Windows loaded, NVDA startup sound came, but no voice to speak.
Tried to restart the NVDA few times with the shortcut key.(Ctrl+Alt+N),
same result.
Then loaded alternative screen reader, and it announced " New version of
TesseractOCR add-on is available; do you want to install? Yes/ No"
When I clicked "Yes" took a short while but NVDA did not restart. But the
voice started to speak.
Repeated to restart NVDA,the result was as previous.
Then saw this link and updated via this download.
Now no issue.
Only this afternoon, before shutting the PC, I checked the checkbox to
check for new updates in the TesseractOCR add-on settings in NVDA.
I use Windows 10 64bit; NVDA 2022.1; Vocalizer voices.
Also, the OCR results in Tamil language seems to be bit unclear.
I mean, after the OCR the results does not seems to be in an order to make
the sentences meaningful.
And moving to them with the normal cursor is difficult. Have to use the
Object navigation.
This is my initial experience. Wil give it more attempts and confirm.
Please advice if am I missing anything.
Thanks,
Ravi.
V.S.Ravindran.
Excuses leads to failure!””

-----Original Message-----
From: nvda@nvda.groups.io <nvda@nvda.groups.io> On Behalf Of Rui Fontes
Sent: Thursday, July 14, 2022 5:39 PM
To: nvda@nvda.groups.io
Subject: Re: [nvda] New version of TesseractOCR add-on

Hello!


It is already available a new version, 2022.07.13.

The change log is:

- Corrected the threading for the update routine;
- Updated turkish translation;
- Small code corrections...


The direct link is:

https://github.com/ruifontes/tesseractOCR/releases/download/2022.07.13/tesseractOCR-2022.07.13.nvda-addon


In the NVDA, Preferences, Options, you will find a TesseractOCR section.

There you can select the languages to be used in the recognition process
and its order...


Best regards,

Rui Fontes
NVDA portuguese team


Às 06:24 de 14/07/2022, Ravindran V.S. escreveu:
Hello,
Just a question about the below :
- Introduced the option to select a second language to be used in OCR of
documents with multiple languages and a button to forget it;""

How can we select the second language? Where are these options please?
I have added the required second language(Tamil) in the list.
I am running Win 10 64 bit; NVDA 2022.1; TesseractOCR add-on v:
2022.07(downloaded from the direct link you shared.)

Thanks,
Ravi
V.S.Ravindran.
Excuses leads to failure!””

-----Original Message-----
From: nvda@nvda.groups.io <nvda@nvda.groups.io> On Behalf Of Rui Fontes
Sent: Wednesday, July 13, 2022 4:56 PM
To: nvda@nvda.groups.io
Subject: Re: [nvda] New version of TesseractOCR add-on

Hello!


From 2022.06 to 2022.06.27:

- Updated Tesseract from version 5.0 Alpha (64-bit) to 5.1 (32-bit);
- Added several more recognition languages;
- Introduced the option to select a second language to be used in OCR of
documents with multiple languages and a button to forget it;
- Introduced a new document type, "With auto-orientation", that allows
the OCR engine to rotate the image as necessary;
- Introduced beeps to signal the add-on is working;
- Corrected code to avoid the non population of the download languages
combobox;
- Corrected a problem with controlTypes roles preventing compatibility
with NVDA 2020.4;
- Added russian translation.


From 2022.06.27 to 2022.07:

- Allow using any number of recognition languages;
- Complete code re-wrote, including:
- Split in various modules to make code clear;
- End using batch files;
- Allow recognize files on Desktop;
- Added translation to spanish, french, russian and ukranian.


Best regards,

Rui Fontes
NVDA portuguese team


Às 07:12 de 13/07/2022, Brian's Mail list account via groups.io
escreveu:
What is the difference between the old and new ones?
Brian

















Join nvda@nvda.groups.io to automatically receive all group messages.