Mercurial > hgrepos > Python2 > PyMuPDF
diff mupdf-source/docs/reference/common/stext-options.md @ 2:b50eed0cc0ef upstream
ADD: MuPDF v1.26.7: the MuPDF source as downloaded by a default build of PyMuPDF 1.26.4.
The directory name has changed: no version number in the expanded directory now.
| author | Franz Glasner <fzglas.hg@dom66.de> |
|---|---|
| date | Mon, 15 Sep 2025 11:43:07 +0200 |
| parents | |
| children |
line wrap: on
line diff
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/mupdf-source/docs/reference/common/stext-options.md Mon Sep 15 11:43:07 2025 +0200 @@ -0,0 +1,61 @@ +# Structured Text Options + +preserve-images +: Keep images in output + +preserve-ligatures +: Do not expand ligatures into constituent characters + +preserve-spans +: Do not merge spans on the same line + +preserve-whitespace +: Do not convert all whitespace into space characters + +inhibit-spaces +: Don't add spaces between gaps in the text + +paragraph-break +: Break blocks at paragraph boundaries + +dehyphenate +: Attempt to join up hyphenated words + +ignore-actualtext +: Do not apply ActualText replacements + +use-cid-for-unknown-unicode +: Use character code if unicode mapping fails + +use-gid-for-unknown-unicode +: Use glyph index if unicode mapping fails + +accurate-bboxes +: Calculate char bboxes from the outlines + +accurate-ascenders +: Calculate ascender/descender from font glyphs + +accurate-side-bearings +: Expand character bboxes to completely include width of glyphs + +collect-styles +: Attempt to detect text features (fake bold, strikeout, underlined etc) + +clip +: Do not include text that is completely clipped + +clip-rect=x0:y0:x1:y1 +: Specify clipping rectangle within which to collect content + +structured +: Collect structure markup + +vectors +: Include vector bboxes in output + +segment +: Attempt to segment the page + +table-hunt +: Hunt for tables within a (segmented) page
