view mupdf-source/thirdparty/tesseract/doc/set_unicharset_properties.1.asc @ 40:aa33339d6b8a upstream

ADD: MuPDF v1.26.10: the MuPDF source as downloaded by a default build of PyMuPDF 1.26.5.
author Franz Glasner <fzglas.hg@dom66.de>
date Sat, 11 Oct 2025 11:31:38 +0200
parents b50eed0cc0ef
children
line wrap: on
line source

SET_UNICHARSET_PROPERTIES(1)
============================
:doctype: manpage

NAME
----
set_unicharset_properties - set  properties about the unichars

SYNOPSIS
--------
*set_unicharset_properties*  --U 'input_unicharsetfile'  --script_dir '/path/to/langdata'   --O 'output_unicharsetfile'

DESCRIPTION
-----------
set_unicharset_properties(1) reads a unicharset file, puts the result in a UNICHARSET object, fills it with properties about the unichars it contains and writes the result back to another unicharset file.

OPTIONS
-------

'--script_dir /path/to/langdata'::
	(Input) Specify the location of directory for universal script unicharsets and font xheights  (type:string default:)

'--U unicharsetfile'::
	(Input) Specify the location of the unicharset to load as input.

'--O unicharsetfile'::
	(Output) Specify the location of the unicharset to be written with updated properties.

HISTORY
-------
set_unicharset_properties(1) was first made available for tesseract version 3.03.

RESOURCES
---------
Main web site: <https://github.com/tesseract-ocr> +
Information on training: <https://tesseract-ocr.github.io/tessdoc/Training-Tesseract.html>

SEE ALSO
--------
tesseract(1)

COPYING
-------
Copyright \(C) 2012 Google, Inc.
Licensed under the Apache License, Version 2.0

AUTHOR
------
The Tesseract OCR engine was written by Ray Smith and his research groups
at Hewlett Packard (1985-1995) and Google (2006-2018).