diff mupdf-source/docs/reference/common/stext-options.md @ 2:b50eed0cc0ef upstream

ADD: MuPDF v1.26.7: the MuPDF source as downloaded by a default build of PyMuPDF 1.26.4. The directory name has changed: no version number in the expanded directory now.
author Franz Glasner <fzglas.hg@dom66.de>
date Mon, 15 Sep 2025 11:43:07 +0200
parents
children
line wrap: on
line diff
--- /dev/null	Thu Jan 01 00:00:00 1970 +0000
+++ b/mupdf-source/docs/reference/common/stext-options.md	Mon Sep 15 11:43:07 2025 +0200
@@ -0,0 +1,61 @@
+# Structured Text Options
+
+preserve-images
+: Keep images in output
+
+preserve-ligatures
+: Do not expand ligatures into constituent characters
+
+preserve-spans
+: Do not merge spans on the same line
+
+preserve-whitespace
+: Do not convert all whitespace into space characters
+
+inhibit-spaces
+: Don't add spaces between gaps in the text
+
+paragraph-break
+: Break blocks at paragraph boundaries
+
+dehyphenate
+: Attempt to join up hyphenated words
+
+ignore-actualtext
+: Do not apply ActualText replacements
+
+use-cid-for-unknown-unicode
+: Use character code if unicode mapping fails
+
+use-gid-for-unknown-unicode
+: Use glyph index if unicode mapping fails
+
+accurate-bboxes
+: Calculate char bboxes from the outlines
+
+accurate-ascenders
+: Calculate ascender/descender from font glyphs
+
+accurate-side-bearings
+: Expand character bboxes to completely include width of glyphs
+
+collect-styles
+: Attempt to detect text features (fake bold, strikeout, underlined etc)
+
+clip
+: Do not include text that is completely clipped
+
+clip-rect=x0:y0:x1:y1
+: Specify clipping rectangle within which to collect content
+
+structured
+: Collect structure markup
+
+vectors
+: Include vector bboxes in output
+
+segment
+: Attempt to segment the page
+
+table-hunt
+: Hunt for tables within a (segmented) page