comparison mupdf-source/thirdparty/tesseract/doc/ambiguous_words.1.asc @ 2:b50eed0cc0ef upstream

ADD: MuPDF v1.26.7: the MuPDF source as downloaded by a default build of PyMuPDF 1.26.4. The directory name has changed: no version number in the expanded directory now.
author Franz Glasner <fzglas.hg@dom66.de>
date Mon, 15 Sep 2025 11:43:07 +0200
parents
children
comparison
equal deleted inserted replaced
1:1d09e1dec1d9 2:b50eed0cc0ef
1 AMBIGUOUS_WORDS(1)
2 ==================
3 :doctype: manpage
4
5 NAME
6 ----
7 ambiguous_words - generate sets of words Tesseract is likely to find ambiguous
8
9 SYNOPSIS
10 --------
11 *ambiguous_words* [-l lang] 'TESSDATADIR' 'WORDLIST' 'AMBIGUOUSFILE'
12
13 DESCRIPTION
14 -----------
15 ambiguous_words(1) runs Tesseract in a special mode, and for each word
16 in word list, produces a set of words which Tesseract thinks might be
17 ambiguous with it. 'TESSDATADIR' must be set to the absolute path of
18 a directory containing 'tessdata/lang.traineddata'.
19
20 SEE ALSO
21 --------
22 tesseract(1)
23
24 COPYING
25 -------
26 Copyright \(C) 2012 Google, Inc.
27 Licensed under the Apache License, Version 2.0
28
29 AUTHOR
30 ------
31 The Tesseract OCR engine was written by Ray Smith and his research groups
32 at Hewlett Packard (1985-1995) and Google (2006-2018).