comparison mupdf-source/thirdparty/tesseract.txt @ 2:b50eed0cc0ef upstream

ADD: MuPDF v1.26.7: the MuPDF source as downloaded by a default build of PyMuPDF 1.26.4. The directory name has changed: no version number in the expanded directory now.
author Franz Glasner <fzglas.hg@dom66.de>
date Mon, 15 Sep 2025 11:43:07 +0200
parents
children
comparison
equal deleted inserted replaced
1:1d09e1dec1d9 2:b50eed0cc0ef
1 If you want to build with Tesseract functionality, you need to run make
2 with a "tesseract=yes" argument.
3
4 You will also need a suitable set of traineddata for the languages you
5 wish to run. Only the LSTM engine (the latest and most accurate engine)
6 is built into Tesseract, so the traineddata contained within the
7 repository itself is no good.
8
9 Suitable data can be retrieved from either:
10
11 https://github.com/tesseract-ocr/tessdata_best
12
13 or
14
15 https://github.com/tesseract-ocr/tessdata_fast
16
17 e.g.
18
19 wget https://github.com/tesseract-ocr/tessdata_fast/raw/master/eng.traineddata