Mercurial > hgrepos > Python2 > PyMuPDF
view mupdf-source/thirdparty/tesseract.txt @ 2:b50eed0cc0ef upstream
ADD: MuPDF v1.26.7: the MuPDF source as downloaded by a default build of PyMuPDF 1.26.4.
The directory name has changed: no version number in the expanded directory now.
| author | Franz Glasner <fzglas.hg@dom66.de> |
|---|---|
| date | Mon, 15 Sep 2025 11:43:07 +0200 |
| parents | |
| children |
line wrap: on
line source
If you want to build with Tesseract functionality, you need to run make with a "tesseract=yes" argument. You will also need a suitable set of traineddata for the languages you wish to run. Only the LSTM engine (the latest and most accurate engine) is built into Tesseract, so the traineddata contained within the repository itself is no good. Suitable data can be retrieved from either: https://github.com/tesseract-ocr/tessdata_best or https://github.com/tesseract-ocr/tessdata_fast e.g. wget https://github.com/tesseract-ocr/tessdata_fast/raw/master/eng.traineddata
