view mupdf-source/thirdparty/tesseract/doc/ambiguous_words.1.asc @ 36:14b91574d44a v1.26.4+2

+++++ v1.26.4+2
author Franz Glasner <fzglas.hg@dom66.de>
date Mon, 22 Sep 2025 11:43:48 +0200
parents b50eed0cc0ef
children
line wrap: on
line source

AMBIGUOUS_WORDS(1)
==================
:doctype: manpage

NAME
----
ambiguous_words - generate sets of words Tesseract is likely to find ambiguous

SYNOPSIS
--------
*ambiguous_words* [-l lang] 'TESSDATADIR' 'WORDLIST' 'AMBIGUOUSFILE'

DESCRIPTION
-----------
ambiguous_words(1) runs Tesseract in a special mode, and for each word
in word list, produces a set of words which Tesseract thinks might be
ambiguous with it.   'TESSDATADIR' must be set to the absolute path of
a directory containing 'tessdata/lang.traineddata'.

SEE ALSO
--------
tesseract(1)

COPYING
-------
Copyright \(C) 2012 Google, Inc.
Licensed under the Apache License, Version 2.0

AUTHOR
------
The Tesseract OCR engine was written by Ray Smith and his research groups
at Hewlett Packard (1985-1995) and Google (2006-2018).