view mupdf-source/docs/reference/common/stext-options.md @ 46:7ee69f120f19 default tip

>>>>> tag v1.26.5+1 for changeset b74429b0f5c4
author Franz Glasner <fzglas.hg@dom66.de>
date Sat, 11 Oct 2025 17:17:30 +0200
parents b50eed0cc0ef
children
line wrap: on
line source

# Structured Text Options

preserve-images
: Keep images in output

preserve-ligatures
: Do not expand ligatures into constituent characters

preserve-spans
: Do not merge spans on the same line

preserve-whitespace
: Do not convert all whitespace into space characters

inhibit-spaces
: Don't add spaces between gaps in the text

paragraph-break
: Break blocks at paragraph boundaries

dehyphenate
: Attempt to join up hyphenated words

ignore-actualtext
: Do not apply ActualText replacements

use-cid-for-unknown-unicode
: Use character code if unicode mapping fails

use-gid-for-unknown-unicode
: Use glyph index if unicode mapping fails

accurate-bboxes
: Calculate char bboxes from the outlines

accurate-ascenders
: Calculate ascender/descender from font glyphs

accurate-side-bearings
: Expand character bboxes to completely include width of glyphs

collect-styles
: Attempt to detect text features (fake bold, strikeout, underlined etc)

clip
: Do not include text that is completely clipped

clip-rect=x0:y0:x1:y1
: Specify clipping rectangle within which to collect content

structured
: Collect structure markup

vectors
: Include vector bboxes in output

segment
: Attempt to segment the page

table-hunt
: Hunt for tables within a (segmented) page