Mercurial > hgrepos > Python2 > PyMuPDF
diff mupdf-source/thirdparty/tesseract.txt @ 2:b50eed0cc0ef upstream
ADD: MuPDF v1.26.7: the MuPDF source as downloaded by a default build of PyMuPDF 1.26.4.
The directory name has changed: no version number in the expanded directory now.
| author | Franz Glasner <fzglas.hg@dom66.de> |
|---|---|
| date | Mon, 15 Sep 2025 11:43:07 +0200 |
| parents | |
| children |
line wrap: on
line diff
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/mupdf-source/thirdparty/tesseract.txt Mon Sep 15 11:43:07 2025 +0200 @@ -0,0 +1,19 @@ +If you want to build with Tesseract functionality, you need to run make +with a "tesseract=yes" argument. + +You will also need a suitable set of traineddata for the languages you +wish to run. Only the LSTM engine (the latest and most accurate engine) +is built into Tesseract, so the traineddata contained within the +repository itself is no good. + +Suitable data can be retrieved from either: + + https://github.com/tesseract-ocr/tessdata_best + +or + + https://github.com/tesseract-ocr/tessdata_fast + +e.g. + + wget https://github.com/tesseract-ocr/tessdata_fast/raw/master/eng.traineddata
