Document conversion with the GdPicture CLI

The Document Automation Server (DAS) command-line interface (CLI) also supports the GdPicture module.

Example:

1. OCR an image PDF to create a searchable PDF
"c:\Aquaforest\Autobahn DX\bin\autobahndx.exe" /source="C:\ADX Demo\In\PDF\File\US2007246939A1.pdf" /sourcetype=file /target="C:\ADX Demo\Output" /outputtype=PDF /operation=ocrpdfgd /ocr=true /Tempfolder="C:\ADX Demo\Temp" /Dictionary="eng" /DictionaryPath="D:\Aquaforest\Autobahn DX\distribution\gdpicture\ocr" /PageRange="\*" /DPI="300" /Output=pdf /Overwrite /logfile="C:\ADX Demo\Log\LogFile.log" /debug

Validate PDF/A

This is used with validatepdfa.

ParameterNotes
/operationUse the validatepdfa operation.
/debugLog additional information.
/pdfaoptionPDF/A option. Select the values from the PDF/A options table below.
/ErrorFolderPath for error files.
/SourceSource file to be validated.
/TargetTarget path for the output file.
/TempFolderPath for temporary folder (must be unique).

PDF/A options

ValueDescription
PDFA-1aThe PDF conformance level is PDF/A-1a.
PDFA-1bThe PDF conformance level is PDF/A-1b.
PDFA-2aThe PDF conformance level is PDF/A-2a.
PDFA-2bThe PDF conformance level is PDF/A-2b.
PDFA-2uThe PDF conformance level is PDF/A-2u.
PDFA-3aThe PDF conformance level is PDF/A-3a.
PDFA-3bThe PDF conformance level is PDF/A-3b.
PDFA-3uThe PDF conformance level is PDF/A-3u.
PDFA-4The PDF conformance level is PDF/A-4.
PDFA-4eThe PDF conformance level is PDF/A-4e.
PDFA-4fThe PDF conformance level is PDF/A-4f.

Linearize PDF

This is used with linearizepdf.

ParameterNotes
/operationUse the linearizepdf operation.
/debugLog additional information.
/EnableCompressionAdd this parameter if you want to compress the PDF document during each save process.
/PackDocumentAdd this parameter to pack the output PDF when saved to reduce its size.
/SourceSource file to be linearized.
/TargetTarget path for the output file.
/TempFolderPath for temporary folder (must be unique).

Convert any file to PDF

This is used with convertany.

ParameterNotes
/operationUse the convertany operation.
/debugLog additional information.
/ConvertAttachmentsConvert any attachments.
/AttachAttachmentsAttach converted attachments rather than merge attachments.
/EmailPageHeight="num"Page height in points.
/EmailPageWidth="num"Page width in points.
/EmailPageMarginBottom="num"Page margin bottom in points.
/EmailPageMarginLeft="num"Page margin left in points.
/EmailPageMarginRight="num"Page margin right in points.
/EmailPageMarginTop="num"Page margin top in points.
/EmailPreferOnePageAdd to output document as one page.
/EnableICCWhen present, specifies the International Color Consortium (ICC) profile should be preserved during the conversion if it’s present in the loaded document.
/HtmlEmulationType="num"Specifies a type of media to emulate.
- No emulation
- Print
- Screen
/HtmlPageHeight="num"Page height in points.
/HtmlPageWidth="num"Page width in points.
/HtmlPageMarginBottom="num"Page margin bottom in points.
/HtmlPageMarginLeft="num"Page margin left in points.
/HtmlPageMarginRight="num"Page margin right in points.
/HtmlPageMarginTop="num"Page margin top in points.
/HtmlPreferOnePageInclude to specify that the output document should contain a single page.
/HtmlPreferCSSPageSizeInclude to give any CSS @page size declared in the page priority over what is declared in HtmlPageWidth and HtmlPageHeight. If not included, the renderer will scale the content to fit the paper size.
/LoadOnlyOnePageSpecifies that all executed actions with the loaded document will be processed using only the first page of the document.
/PageRange="range"Range of pages to OCR; use * for all pages.
/PdfBitonalImageCompression=”ID”Sets the scheme to be used to compress bitonal image data when converting/saving the currently loaded document to PDF format.
- 0: None
- 1: Flate
- 2: CCITT4
- 3: JPEG
- 4: JBIG2
- 5: JPEG2000
/JBIG2PMSThresholdSpecifies the threshold value for the JBIG2 encoder pattern matching and substitution between 0 and 1. Any number lower than 1 may lead to lossy compression. Default value is 0.85.
/PdfColorImageCompressionSets the scheme to be used to compress color image data when converting/saving the currently loaded document to PDF format.
/PdfImageQualitySets the level of quality used to compress images with a lossy compression scheme, which are embedded in the newly produced PDF document when converting/saving the currently loaded document to PDF format. It must be a value from 0 to 100. 0 means the worst quality and the best compression, 100 means the best quality and the worst compression.
/PDFUseDeflateOnJPEGInclude to specify if the converter will use additional deflate compression for JPEG images in PDF output.
/RasterizationDPI="DPI"Sets the rendering resolution to be used when converting vector content to raster content, if any is included in the currently loaded document.
/TiffEnableExifRotateInclude to specify that the TIFF encoder will use Exif rotate flag to handle page rotations.
/TimeoutMilliseconds="timeout in milliseconds"Specifies the timeout of the subsequent conversion process, in milliseconds. Default value is -1, which means no timeout.
/TxtFontBoldSpecifies whether the font used for the resulting document when converting from the source TXT file must have a bold style.
/TxtFontItalicSpecifies whether the font used for the resulting document when converting from the source TXT file must have an italic style.
/TxtFontFamily="font family"Specifies the name of the font to be used for the resulting document when converting from the source TXT file.
/TxtFontSize="font size"Specifies the text size, in points, to be used for the resulting document when converting from the source TXT file.
/TxtPageHeight="num"Page height in points.
/TxtPageWidth="num"Page width in points.
/TxtPageMarginBottom="num"Page margin bottom in points.
/TxtPageMarginLeft="num"Page margin left in points.
/TxtPageMarginRight="num"Page margin right in points.
/TxtPageMarginTop="num"Page margin top in points.
/SourceSource file to be converted to PDF.
/TargetTarget path for the output file. This can include the %FILENAME variable.
/TempFolderPath for temporary folder (must be unique).

Combine any file

This is used with combineany.

ParameterNotes
/operationUse the combineany operation.
/debugLog additional information.
/ConvertAttachmentsConvert any attachments.
/AttachAttachmentsAttach converted attachments rather than merge attachments.
/EmailPageHeight="num"Page height in points.
/EmailPageWidth="num"Page width in points.
/EmailPageMarginBottom="num"Page margin bottom in points.
/EmailPageMarginLeft="num"Page margin left in points.
/EmailPageMarginRight="num"Page margin right in points.
/EmailPageMarginTop="num"Page margin top in points.
/EmailPreferOnePageAdd to output document as one page.
/EnableICCWhen present, favor to preserve ICC profile, if present in the loaded document, during the conversion.
/HtmlEmulationType="num"Specifies a type of a media to emulate.
/HtmlPageHeight="num"Page height in points.
/HtmlPageWidth="num"Page width in points.
/HtmlPageMarginBottom="num"Page margin bottom in points.
/HtmlPageMarginLeft="num"Page margin left in points.
/HtmlPageMarginRight="num"Page margin right in points.
/HtmlPageMarginTop="num"Page margin top in points.
/HtmlPreferOnePageInclude to specify that the output document should contain a single page.
/HtmlPreferCSSPageSizeInclude to give any CSS @page size declared in the page priority over what is declared in HtmlPageWidth and HtmlPageHeight. If not included, the renderer will scale the content to fit the paper size.
/LoadOnlyOnePageSpecifies that all executed actions with the loaded document will be processed using only the first page of the document.
/PageRange="range"Range of pages to OCR; use * for all pages.
/PdfBitonalImageCompression=”ID”Sets the scheme to be used to compress bitonal image data when converting/saving the currently loaded document to PDF format.
- 0: None
- 1: Flate
- 2: CCITT4
- 3: JPEG
- 4: JBIG2
- 5: JPEG2000
/JBIG2PMSThresholdSpecifies the threshold value for the JBIG2 encoder pattern matching and substitution between 0 and 1. Any number lower than 1 may lead to lossy compression. Default value is 0.85.
/PdfColorImageCompressionSets the scheme to be used to compress color image data when converting/saving the currently loaded document to PDF format.
/PdfImageQualitySets the level of quality used to compress images with a lossy compression scheme, which are embedded in the newly produced PDF document when converting/saving the currently loaded document to PDF format. It must be a value from 0 to 100. 0 means the worst quality and the best compression, 100 means the best quality and the worst compression.
/PDFUseDeflateOnJPEGInclude to specify if the converter will use additional deflate compression for JPEG images in PDF output.
/RasterizationDPI="DPI"Sets the rendering resolution to be used when converting vector content to raster content, if any is included in the currently loaded document.
/TiffEnableExifRotateInclude to specify that the TIFF encoder will use Exif rotate flag to handle page rotations.
/TimeoutMilliseconds="timeout in milliseconds"Specifies the timeout of the subsequent conversion process, in milliseconds. Default value is -1, which means no timeout
/TxtFontBoldSpecifies whether the font used for the resulting document when converting from the source TXT file must have a bold style.
/TxtFontItalicSpecifies whether the font used for the resulting document when converting from the source TXT file must have an italic style.
/TxtFontFamily="font family"Specifies the name of the font to be used for the resulting document when converting from the source TXT file.
/TxtFontSize="font size"Specifies the text size, in points, to be used for the resulting document when converting from the source TXT file.
/TxtPageHeight="num"Page height in points.
/TxtPageWidth="num"Page width in points.
/TxtPageMarginBottom="num"Page margin bottom in points.
/TxtPageMarginLeft="num"Page margin left in points.
/TxtPageMarginRight="num"Page margin right in points.
/TxtPageMarginTop="num"Page margin top in points.
/SourceSource folder for the files to be combined.
/TargetTarget path for the output file. This can include the %DIRNAME variable.
/TempFolderPath for temporary folder (must be unique).

Combine PDFs

This is used with combinepdfs.

ParameterNotes
/operationUse the combinepdfs operation.
/debugLog additional information.
/NumericalOrdering=true
/NumericalOrdering=false
Choose whether the merging of file should happen in numerical order. For example, file1, file3, file11, file20, file101. Otherwise it will be ordered lexicographically. For example, file1, file101, file11, file20, file3.
/SourceSource folder for files to be combined.
/TargetTarget path for the output file.
/TempFolderPath for temporary folder (must be unique).

Convert PDF to JPEG

This is used with pdftojpeg.

ParameterNotes
/operationUse the pdftojpeg operation.
/debugLog additional information.
/DPI=”num”The DPI resolution for rendering.
/Brightness="num"Brightness adjustment between -100 and 100.
/Contrast="num"Contrast adjustment between -100 and 100.
/Saturation="num"Saturation adjustment between -100 and 100.
/Gamma="num"Gamma adjustment between -100 and 100.
/Threshold1Bpp="num"If set, converts the output image to a 1-bit BW indexed color image specifying a threshold value. Pixel values less than the threshold will be turned black, while the values equal to or larger will be turned white. Value must be between 0 and 255.
/AutoDeskewWhen this flag is set, the image is skewed between ±15 degrees to obtain the best information from the image.
/CropBlackBordersDetects and removes margins consisting of black color around a GdPicture image.
/CropBlackBordersExDetects and sets to white; margins consisting of black color around a GdPicture image. This method doesn’t have the same behavior as CropBlackBorders. The black borders aren’t removed but are set to blank. Therefore, the image dimensions are kept the same.
/CropHeight="num"Specifies the page height, in pixels, of the resulting output document after cropping.
/CropWidth="num"Specifies the page width, in pixels, of the resulting output document after cropping.
/CropLeft ="num"Specifies the distance, in pixels, to crop from the top of the resulting document.
/CropTop="num"Specifies the distance, in pixels, to crop from the top of the resulting document.
/DespecklePerforms a 3×3 despeckle filter (also known as median filter).
/DespeckleMorePerforms a 5×5 despeckle filter (also known as median filter).
/EnableICMSpecifies if color correction is used for images embedding an ICC profile.
/RemoveHolePunchRemoves all punch holes situated on the margins. Be aware this can affect some barcode types.
/RemoveLines="num"Perform line removal.
- Horizontal is 1
- Vertical is 2
/ResizeNewHeight="num"New image height in pixel.
/ResizeNewWidth="num"New image width in pixel.
/ResizeInterpolateMode=”num"The interpolation mode specifies the algorithm used when the image is stretched. Following are the different modes and their corresponding values:
- Default: 0
- Low: 1
- High: 2
- Bilinear: 3
- Bicubic: 4
- Nearest Neighbor: 5
- High Quality Bilinear: 6
- High Quality Bicubic: 7
/RotateByAngleAdd parameter to rotate the image by an angle (see /RotationAngle).
/RotationAngle="num"Rotate clockwise by this many degrees when /RotateByAngle parameter is present.
/RotationType="num"Value determines rotate and/or flip action. Some actions result in the same final outcome, so the value is shared. Following are the different actions and their associated values:
- Rotate180 Flip None: 2
- Rotate180 Flip X: 6
- Rotate180 Flip XY: 0
- Rotate180 Flip Y: 4
- Rotate270 Flip None: 3
- Rotate270 Flip X: 7
- Rotate270 Flip XY: 1
- Rotate 270 Flip Y: 5
- Rotate 90 Flip None: 1
- Rotate 90 Flip X: 5
- Rotate 90 Flip XY: 3
- Rotate 90 Flip Y: 7
- Rotate None Flip None: 0
- Rotate None Flip X: 4
- Rotate None Flip XY: 2
- Rotate None Flip Y: 6
/PageRange="page range"Range of pages to OCR; use * for all pages.
/SourceSource file to be converted to JPEG.
/TargetTarget path for the output file.
/TempFolderPath for temporary folder (must be unique).

Convert PDF to PNG

This is used with pdftopng.

ParameterNotes
/operationUse the pdftopng operation.

See the convert PDF to JPEG table above for the rest of the parameters.

Convert PDF to TIFF

This is used with pdftotiffgd.

ParameterNotes
/operationUse the pdftotiffgd operation.
/TiffCompressionFollowing are different values of this parameter and their description:
- 8: Deflate compression, as recognized by Adobe.
- 65536: Uses CCITT4 compression for bitonal image and LZW for others. This allows a mix of compression in a multipage TIFF document.
- 3: CCITT Group 3 fax encoding
-4: CCITT Group 4 fax encoding
- 32771: #1 w/ word alignment
- 32946: Deflate compression
- 7: %JPEG DCT compression
- 5: Lempel-Ziv and Welch
- 32766: NeXT 2-bit RLE
- 1: No compression
- 6: !6.0 JPEG
- 2: CCITT modified Huffman RLE
- 32809: ThunderScan RLE

See the convert PDF to JPEG table above for the rest of the parameters.

Convert PDF to text

This is used with pdftotext.

ParameterNotes
/operationUse the pdftotext operation.
/debugLog additional information.
/PageRangeRange of pages to OCR, use * for all pages.
/PageSeparator=”text”Text that will separate the pages of text.
/PageSeparatorPlacement=top
/PageSeparatorPlacement=bottom
The placement of the text separator. The separator text can either go at the top or bottom of the page text. Default is bottom.
/PreserveParagraphSpecifies that the text extraction engine must preserve text paragraphs. Default false.
/ParagraphSeparator=This property specifies the separator to be utilized for splitting paragraphs. It only takes effect when the PreserveParagraphs property is set to true.
/EnableOCREnables the use of the GdPicture OCR engine if the page in non-searchable.
/DictionaryPath=Path to GdPicture dictionary files.
/Dictionary=The codes of the chosen languages for OCR, seperated by +. For example, eng+fra
Codes can be found in the [OCR language codes][].
/CopyPdfIf the flag is present, it copies the input PDF to the output location.
/SourceSource file to be converted to text.
/TargetTarget path for the output file.
/TempFolderPath for temporary folder (must be unique).

OCR PDF to searchable PDF

This is used with ocrpdfgd.

ParameterNotes
/operationUse the ocrpdfgd operation.
/debugLog additional information.
/DictionaryList of language codes, linked by +. For example: /Dictionary="ENG+FRA".
/DictionaryPathPath to GdPicture dictionary files.
/DPIDPI used for rendering.
/PageRangeRange of pages to OCR; use * for all pages.
/ThreadLimitLimits the amount of pages that can be processed concurrently.
/ImageLimitIf the page has images that can be ignored (e.g. company logos) you can set an image limit. Any page that contains a number of images equal to or less than the limit will be skipped.
/IgnoreNonImagePagesSetting this flag will skip OCRing any pages that contain fewer images than the image limit threshold.
/IgnoreSearchablePagesSetting this flag will skip OCRing and pages that already contain visible text.
/IgnoreHiddenTextSetting this flag will skip OCRing and pages that already contain hidden text.
Note: IgnoreSearchablePages must also be set.
/CharacterThresholdIf a page has little text but you still want to OCR (e.g. bates numbers), the pages will only be skipped if the amount of characters is equal or above the threshold. If 0, any text found will cause the page to be skipped.
/AutorotateAutomatically rotate pages if the text doesn’t have the correct orientation.
/SourceSource file to be OCRed.
/TargetTarget path for the output file.
/TempFolderPath for temporary folder (must be unique).

Create PDF portfolio

This is used with pdfportfolio.

ParameterNotes
/operationUse the pdfportfolio operation.
/debugLog additional information.
/PdfPortfolioTypeSee the PDF portfolio type table below for information on portfolio type, value, and description.
/SourceSource folder.
/TargetTarget path for the output file.
/TempFolderPath for temporary folder (must be unique).

PDF portfolio type

TypeValueDescription
Detail0The collection view is presented in details mode, with all information in the schema dictionary presented in a multicolumn format. This mode provides the most information to the user. The default value.
Hidden2The collection view is initially hidden, without preventing the user from obtaining a file list via explicit action.
Tile1The collection view is presented in tile mode, with each file in the collection denoted by a small icon and a subset of information from the schema dictionary. This mode provides top-level information about the file attachments to the user.

Convert PDF to PDF/A

This is used with gdpicturepdfa.

ParameterNotes
/operationUse the gdpicturepdfa operation.
/debugLog additional information.
/AllowRasterization=falseSet this parameter to false if you don’t want the conversion engine to use page rasterization when the direct conversion and vectorization aren’t possible or allowed.
/AllowVectorization=falseSet this parameter to false, if you don’t want the conversion engine to use page vectorization when the direct conversion isn’t possible. Vectorization produces vector-based graphic elements where applicable — for example, fonts and paths — and combines them with image resources.
/Version="PDFA Option"See the PDF/A versions table below for different PDF/A versions and their descriptions.
/SourceSource file.
/TargetTarget path for the output file.
/TempFolderPath for temporary folder (must be unique).

PDF/A versions

ValueDescription
PDF_A_1aThe PDF conformance level is PDF/A-1a.
PDF_A_1bThe PDF conformance level is PDF/A-1b.
PDF_A_2aThe PDF conformance level is PDF/A-2a.
PDF_A_2bThe PDF conformance level is PDF/A-2b.
PDF_A_2uThe PDF conformance level is PDF/A-2u.
PDF_A_3aThe PDF conformance level is PDF/A-3a.
PDF_A_3bThe PDF conformance level is PDF/A-3b.
PDF_A_3uThe PDF conformance level is PDF/A-3u.
PDF_A_4The PDF conformance level is PDF/A-4.
PDF_A_4eThe PDF conformance level is PDF/A-4e.
PDF_A_4fThe PDF conformance level is PDF/A-4f.

Compress PDF

This is used with gdpicturecompression.

ParameterNotes
/operationUse the gdpicturecompression operation.
/debugLog additional information.
/Author="text"Set PDF metadata author to supplied text.
/Producer="text"Set PDF metadata producer to supplied text.
/Producername="text"Set PDF metadata producer name to supplied text.
/Title="text"Set PDF metadata title to supplied text.
/RemoveAnnotationsWhen supplied, remove annotations.
/RemoveBlankPagesWhen supplied, remove blank pages.
/RemoveBookmarksWhen supplied, remove bookmarks.
/RemoveEmbeddedFilesWhen supplied, remove embedded files.
/RemoveFormFieldsWhen supplied, remove form fields.
/RemoveJavaScriptWhen supplied, remove JavaScript.
/RemoveMetadataWhen supplied, remove metadata. RemoveMetadata only removes XMP metadata in the document. PDF information (title, author, custom PDF information, etc.) If present, it isn’t touched.
/removePageThumbnailsWhen supplied, remove page thumbnails.
/PackFonts=falseSpecifies the PDF fonts won’t be packed to reduce their size.
/PackDocument=falseSpecifies the PDF won’t be packed to reduce its size.
/RecompressImages=falseSpecifies the images from the PDF won’t be recompressed.
/EnableMRCSpecifies MRC will be used for compressing the PDF contents.
/DownscaleResolutionMRC="resolution mrc"Specifies the resolution (DPI) for downscaling the background layer by the MRC engine, if any. Default value is 100.
/PreserveSmoothing=falseSpecifies that the MRC engine won’t preserve smoothing between different layers.
/ImageQuality="image quality"Specifies the quality to be used for the compression of the images from the PDF. See the image quality settings table below for further information.
/DownscaleImages=falseSpecifies the images from the PDF won’t be downscaled.
/DownscaleResolution=downscaleresolutionSpecifies the resolution to be used to downscale images. Default value is 150.
/EnableColorDetection=falseSpecifies whether color detection won’t be performed on the images from the PDF.
/EnableCharRepairSpecifies whether character repairing will be performed during bitonal conversion.
/EnableJPEG2000=falseSpecifies that JPEG2000 compression scheme won’t be used to compress the images of the PDF.
/EnableJBIG2=falseSpecifies that the JBIG2 compression scheme won’t be used to compress the bitonal images of the PDF.
/JBIG2PMSThreshold=thresholdSpecifies the threshold value for the JBIG2 encoder pattern matching and substitution between 0 and 1. Any number lower than 1 may lead to lossy compression. Default value is 0.85.
/SourceSource folder.
/TargetTarget path for the output file.
/TempFolderPath for temporary folder (must be unique).

Image quality settings

DescriptionValue
Image Quality Low0
Image Quality Medium1
Image Quality High2
Image Quality Very High3
Image Quality Very Very High4
Image Quality Very Low-1

Smart redaction

This is used with smartredaction.

ParameterNotes
/operationUse the smartredaction operation.
/debugLog additional information.
/ResourcePathPath to GdPicture dictionary files.
/DictionaryList of language codes, linked by +. For example: /Dictionary="ENG+FRA".
/RedactCreditCardNumbers=falseSpecifies whether credit card numbers won’t be redacted.
/RedactEmailAddresses=falseSpecifies whether email addresses won’t be redacted.
/RedactPhoneNumbers=falseSpecifies whether phone numbers won’t be redacted.
/RedactVatIDs=falseSpecifies whether VAT IDs won’t be redacted.
/RedactURIs=falseSpecifies whether URIs won’t be redacted.
/RedactIBANs=falseSpecifies whether IBANs won’t be redacted.
/RedactVehicleIdentificationNumbers=falseSpecifies whether vehicle identification numbers won’t be redacted.
/RedactSocialSecurityNumbers=falseSpecifies whether Social Security Number won’t be redacted.
/RedactPostalAddresses=falseSpecifies whether postal addresses won’t be redacted.
/MarkColorThe color used to cover redacted information. The options are: Black, Transparent, Aqua, Teal, Navy, Yellow, Olive, Lime, Blue, Fuchsia, Purple, Red, Maroon, White, Gray, Silver, Green.
/DetectOrientation=falseSpecifies whether orientation won’t be detected automatically.
/PageRangeRange of pages to redact, use * for all pages.
/SourceSource file to be redacted.
/TargetTarget path for the output file.
/TempFolderPath for temporary folder (must be unique).

Detect signatures

This is used with detectsignatures.

ParameterNotes
/operationUse the detectsignatures operation.
/debugLog additional information.
/SignedFilename=The signed filename mask — for example, %FILENAME-signed. Used for the file moved/copied to the SignedFilePath.
/SignedFilePath=pathThe full path (including file name) that the input file will be copy/moved to if a signature is detected.
/CreateSignedPath
/CreateSignedPath=false
/CreateSignedPath=true
If the parameter CreateSignedPath is present or set to true, the signed file location will be created if it cannot be found. If set to false, the file will fail if the signed location doesn’t exist.
/OverwriteSigned
/OverwriteSigned=false
/OverwriteSigned=true
If the parameter OverwriteSigned is present or set to true, when a signed file is moved or copied into the signed location, it will overwrite files of the same name. If set to false, the file will fail in this situation.
/SignedAction=actionSpecifies if signed files will be copied or moved to the signed location. Use Copy or Move as values.
/AttachSignedDocToOutput
/AttachSignedDocToOutput=false
/AttachSignedDocToOutput=true
If parameter AttachSignedDocToOutput is present or set to true, then the signed source file will be attached to the no longer signed output file.
Only applied if /SignedAction="copy" is set.
/SourceSource file to be checked for signatures.
/TargetTarget path for the output file.
/TempFolderPath for temporary folder (must be unique).

Extract key value pairs

This is used with keyvaluepairs.

ParameterNotes
/operationUse the keyvaluepairs operation.
/debugLog additional information.
/DictionaryPathPath to GdPicture dictionary files.
/DictionaryThe codes of the chosen languages for OCR, separated by +. For example, eng+fra. Codes can be found in the [OCR language codes][].
/DPIDPI used for rendering.
/KVPFormatThe output formats are separated by JSON, CSV or XML.
/Autorotate
/Autorotate=false
Setting this to false will disable the ability to rotate pages if the text orientation isn’t correct. The default is true.
/TrimSymbolsSetting this to true will remove any symbols from the start/end of values, except for the hash # or period . symbols.
/IncludeKeyBoundingBoxSetting this to Yes will include the bounding box values for the key in the output.
/IncludeValueBoundingBoxSetting this to Yes will include the bounding box values for the value in the output.
/IncludePageNumberSetting this to Yes will include the page number of the key value pair in the output.
/IncludeConfidenceSetting this to Yes will include the confidence score of the key value pair in the output. Confidence is measured between 0 (no confidence) and 100 (full confidence).
/ConfidenceThresholdThe value of confidence (0 to 100) that a KVP must reach to be included in the output. Results under this confidence threshold will be discarded.
/IncludeTypeSetting this to Yes will include the data type of the key value pair in the output.
/ExpectedKeysThe path to a JSON file for the expected keys and synonyms.
/ExpectedKeysIsPath=trueFlag to show that the /ExpectedKeys flag is pointing to a JSON file location.
/SourceSource file to be checked for key-value pairs.
/TargetTarget path for the output file. No extension is needed, as extensions will be added based on the KVP output format.
/TempFolderPath for temporary folder (must be unique).

Pattern redaction

This is used with patternredaction.

ParameterNotes
/operationUse the patternredaction operation.
/debugLog additional information.
/Pattern="regex"A regex pattern. The input PDF will be searched for matches to this regex pattern, and any matches will be redacted.
/RedactedTermsPath=pathThe path to a text file containing a list of terms to redact. Each line will be treated as a pattern, and any matches will be redacted.
/CaseSensitiveDetermined whether or not the regex pattern matching should be case sensitive.
/Red=0The amount of red color to be used for the redaction region color. Use a value between 0 and 255.
/Green=0The amount of green color to be used for the redaction region color. Use a value between 0 and 255.
/Blue=0The amount of blue color to be used for the redaction region color. Use a value between 0 and 255.
/Alpha=255The transparency value of the resulting region color. Use the value between 0 (full transparency) and 255 (full opacity). The redacted text will be removed, and so won’t be behind the color layer if made transparent.
/SourceSource file to be redacted.
/TargetTarget path for the output file.
/TempFolderPath for temporary folder (must be unique).

Pattern highlight

This is used with patternhighlight.

ParameterNotes
/operationUse the patternhighlight operation.
/Alpha=100The transparency value of the resulting region color. Use the value between 0 (full transparency) and 255 (full opacity).
The highlighted text will appear behind the highlight layer.

See the pattern redaction table above for the rest of the parameters.

Split PDF (GdPicture)

This is used with splitpdfgdpicture.

ParameterNotes
/operationUse the splitpdfgdpicture operation.
/debugLog additional information.
/RetainMetadata
/RetainMetadata=false
When set to true, generated files will include metadata (such as author and title) from the original file.
/SplitType=Single
/SplitType=Ranges
/SplitType=Repeatingranges
/SplitType=bookmarks
Sets the way that the input file will be split. One of:
- Split into single pages
- Split by ranges (See below)
- Split by repeating ranges (See below)
- Split by bookmarks
/SplitRange=rangeSet of page ranges separated by commas that defines which pages from the original should be extracted.
/RepeatEvery=numApply the page range to each set of page ranges pages within the document. For example if 2 to 4 is specified for page ranges, and 4 is specified as the repeating range, then the range is reapplied every 4 pages.
/RemoveUnusedResources
/RemoveUnusedResources=false
Removes unused resources from a PDF file to minimize file size.
/SourceSource file to be redacted.
/TargetTarget path for the output file.
/TempFolderPath for temporary folder (must be unique).

Split PDF by barcode

This is used with splitbybarcode.

ParameterNotes
/operationUse the splitbybarcode operation.
/debugLog additional information.
/ReadQRCodeSet this to true to recognize QR code barcodes.
/ReadMicroQRSet this to true to recognize Micro QR barcodes.
/ReadDataMatrixSet this to true to recognize Data Matrix barcodes.
/ReadPDF417Set this to true to recognize PDF417 barcodes.
/ReadAztecSet this to true to recognize Aztec barcodes.
/ReadMaxiCodeSet this to true to recognize MaxiCode barcodes.
/ReadIndustrial2of5Set this to true to recognize Industrial 2 of 5 barcodes.
/ReadInverted2of5Set this to true to recognize Inverted 2 of 5 barcodes.
/ReadInterleaved2of5Set this to true to recognize Interleaved 2 of 5 barcodes.
/ReadIata2of5Set this to true to recognize Iata 2 of 5 barcodes.
/ReadMatrix2of5Set this to true to recognize Matrix 2 of 5 barcodes.
/ReadCodabarSet this to true to recognize Codabar barcodes.
/ReadBcdMatrixSet this to true to recognize BCD Matrix barcodes.
/ReadDataLogic2of5Set this to true to recognize Data Logic 2 of 5 barcodes.
/ReadCode128Set this to true to recognize Code128 barcodes.
/ReadCode93Set this to true to recognize Code93 barcodes.
/ReadCode39Set this to true to recognize Code39 barcodes.
/ReadEAN13Set this to true to recognize EAN-13 barcodes.
/ReadEAN8Set this to true to recognize EAN-8 barcodes.
/ReadUPCASet this to true to recognize UPCA barcodes.
/ReadUPCESet this to true to recognize UPCE barcodes.
/ReadADD5Set this to true to recognize ADD 5 barcodes.
/ReadADD2Set this to true to recognize ADD 2 barcodes.
/SplitRange=rangeSpecifies the page range to be scanned for barcodes. A value of * will scan every page for barcodes.
/Pattern="regex"A regular expression that each barcode will be compared against. Only barcodes that match the pattern will be returned.
/DPI=intDPI of TIFF images generated or converted from the source PDF file. These images are then scanned for barcodes.
/RetainMetadata
/RetainMetadata=false
When set to true, generated files will include metadata (such as author and title) from the original file.
/RemoveUnusedResources
/RemoveUnusedResources=false
Removes unused resources from a PDF file to minimize file size.
/Left=intX coordinate of the top-left point of the rectangle you want to recognize the barcode.
/Top=intY coordinate of the top-left point of the rectangle you want to recognize the barcode.
/Width=intWidth of the rectangle you want to recognize the barcode.
/Height=intHeight of the rectangle you want to recognize the barcode.
/SourceSource file to be redacted.
/TargetTarget path for the output file.
/TempFolderPath for temporary folder (must be unique).

Pattern enumeration

This is used with patternenumeration.

ParameterNotes
/operationUse the patternenumeration operation.
/debugLog additional information.
/Pattern="regex"A regex pattern. The input PDF will be searched for matches to this regex pattern, and any matches will be enumerated.
/RedactedTermsPath=pathThe path to a text file containing a list of terms to enumerate. Each line will be treated as a pattern, and any matches will be enumerated.
/CaseSensitiveDetermines whether or not the regex pattern matching should be case sensitive.
/PassThroughDetermines if the source file should be copied to the output folder alongside the report.
/SourceSource file to be enumerated.
/TargetTarget path for the output file.
/TempFolderPath for temporary folder (must be unique).

Get document information

This is used with getdocinformation.

ParameterNotes
/operationUse the getdocinformation operation.
/debugLog additional information.
/OutputFormatThe output format for the report:
- txt
- csv
- json
- xml
/PassThroughDetermines if the source file should be copied to the output folder alongside the report.
/SourceSource PDF file.
/TargetTarget path for the output file.
/TempFolderPath for temporary folder (must be unique).

Convert PDF to Office

This is used with convertpdftooffice.

ParameterNotes
/operationUse the convertpdftooffice operation.
/debugLog additional information.
/OfficeFormatThe output format for the report:
- docx
- pptx
- xlsx
- svg
/EnableICCWhen present, specifies the ICC profile should be preserved during the conversion if it’s present in the loaded document.
/PageRangeRange of pages to convert. Use * for all pages.
/TimeoutMillisecondsSpecifies the timeout of the subsequent conversion process, in milliseconds. Default value is -1, which means no timeout.
/SourceSource file to be converted.
/TargetTarget path for the output file.
/TempFolderPath for temporary folder (must be unique).

Convert any file to Office

This is used with convertanytooffice.

ParameterNotes
/operationUse the convertanytooffice operation.
/debugLog additional information.
/EmailInjectHeaderSpecifies whether the email header should be injected in the output document.
/ConvertEmailAttachmentsToOfficeConvert any attachments to Office format.
/EmailAttachmentsFilterA regular expression that specifies the email attachments that will be picked up and converted. Files that don’t match will be ignored.
/EmailPageHeight="num"Page height in points.
/EmailPageWidth="num"Page width in points.
/EmailPageMarginBottom="num"Page margin bottom in points.
/EmailPageMarginLeft="num"Page margin left in points.
/EmailPageMarginRight="num"Page margin right in points.
/EmailPageMarginTop="num"Page margin top in points.
/EmailPreferOnePageAdd to output document as one page.
/EnableICCWhen present, specifies the ICC profile should be preserved during the conversion if it’s present in the loaded document.
/HtmlEmulationType="num"Specifies a type of media to emulate:
- 1 — No emulation
- 2 — Print
- 3 — Screen
/HtmlPageHeight="num"Page height in points.
/HtmlPageWidth="num"Page width in points.
/HtmlPageMarginBottom="num"Page margin bottom in points.
/HtmlPageMarginLeft="num"Page margin left in points.
/HtmlPageMarginRight="num"Page margin right in points.
/HtmlPageMarginTop="num"Page margin top in points.
/HtmlPreferOnePageInclude to specify that the output document should contain a single page.
/HtmlPreferCSSPageSizeInclude to give any CSS @page size declared in the page priority over what is declared in HtmlPageWidth and HtmlPageHeight. If not included, the renderer will scale the content to fit the paper size.
/LoadOnlyOnePageSpecifies that all executed actions with the loaded document will be processed using only the first page of the document.
/PageRange="range"Range of pages to OCR. Use * for all pages.
/PdfBitonalImageCompression="ID"Sets the scheme to be used to compress bitonal image data when converting/saving the currently loaded document to PDF format:
- 0 — None
- 1 — Flate
- 2 — CCITT4
- 3 — JPEG
- 4 — JBIG2
- 5 — JPEG2000
/JBIG2PMSThresholdSpecifies the threshold value for the JBIG2 encoder pattern matching and substitution between 0 and 1. Any number lower than 1 may lead to lossy compression. Default value is 0.85.
/PdfColorImageCompressionSets the scheme to be used to compress color image data when converting/saving the currently loaded document to PDF format.
/PdfImageQualitySets the level of quality used to compress images with a lossy compression scheme, which is embedded in the newly produced PDF document when converting/saving the currently loaded document to PDF format. It must be a value from 0 to 100. 0 means the worst quality and the best compression, while 100 means the best quality and the worst compression.
/PDFUseDeflateOnJPEGInclude to specify if the converter will use additional Deflate compression for JPEG images in PDF output.
/RasterizationDPI="DPI"Sets the rendering resolution to be used when converting vector content to raster content, if any is included in the currently loaded document.
/RenderSheetHeadersAndFootersSpecifies that the xls and xlsx headers and footers should be rendered. Affects XLSX/XLS input only.
/SplitExcelSheetsIntoPagesSpecifies that xls and xlsx sheets should be split into pages according to the PageSetup element of each sheet. Affects XLSX/XLS input only.
/SpreadsheetBottomMarginOverrideSpecifies the spreadsheet bottom margin height in millimeters. If height isn’t given or is negative, the margin specified in the document will be used instead. Affects XLSX/XLS input only.
/SpreadsheetLeftMarginOverrideSpecifies the spreadsheet left margin width in millimeters. If height isn’t given or is negative, the margin specified in the document will be used instead. Affects XLSX/XLS input only.
/SpreadsheetMaximumContentHeightPerSheetDecimal value indicating the maximum height of the sheet content, in millimeters. Maximum content height ignores header and footer height. Affects XLSX/XLS input only.
/SpreadsheetMaximumContentWidthPerSheetDecimal value indicating the maximum width of the sheet content, in millimeters. Maximum content width ignores margins. Affects XLSX/XLS input only.
/SpreadsheetPageHeightOverrideSpecifies the spreadsheet page height in millimeters. If height isn’t given or isn’t positive, the page height specified in the document will be used instead. Affects XLSX/XLS input only.
/SpreadsheetPageWidthOverrideSpecifies the spreadsheet page width in millimeters. If width isn’t given or isn’t positive, the page width specified in the document will be used instead. Affects XLSX/XLS input only.
/SpreadsheetRenderOnlyPrintAreaFor spreadsheet, specifies that for each sheet, only the print areas must be rendered. If no print area exists, the whole sheet will be rendered. Affects XLSX/XLS input only.
/SpreadsheetRightMarginOverrideSpecifies the spreadsheet right margin width in millimeters. If width isn’t given or isn’t positive, the page height specified in the document will be used instead. Affects XLSX/XLS input only.
/SpreadsheetTopMarginOverrideSpecifies the spreadsheet top margin height in millimeters. If width isn’t given or isn’t positive, the page width specified in the document will be used instead. Affects XLSX/XLS input only.
/TiffEnableExifRotateInclude to specify that the TIFF encoder will use the Exif rotate flag to handle page rotations.
/TimeoutMilliseconds="timeout in milliseconds"Specifies the timeout of the subsequent conversion process, in milliseconds. Default value is -1, which means no timeout.
/TxtFontBoldSpecifies whether the font used for the resulting document when converting from the source TXT file must have a bold style.
/TxtFontItalicSpecifies whether the font used for the resulting document when converting from the source TXT file must have an italic style.
/TxtFontFamily="font family"Specifies the name of the font to be used for the resulting document when converting from the source TXT file.
/TxtFontSize="font size"Specifies the text size, in points, to be used for the resulting document when converting from the source TXT file.
/TxtPageHeight="num"Page height in points.
/TxtPageWidth="num"Page width in points.
/TxtPageMarginBottom="num"Page margin bottom in points.
/TxtPageMarginLeft="num"Page margin left in points.
/TxtPageMarginRight="num"Page margin right in points.
/TxtPageMarginTop="num"Page margin top in points.
/SourceSource file to be converted to PDF.
/TargetTarget path for the output file. This can include the %FILENAME variable.
/TempFolderPath for temporary folder (must be unique).

Zip Folders

This is used with zipfolders.

ParameterNotes
/operationUse the zipfolders operation.
/debugLog additional information.
/ZipOptionChoose the output zip option:
- ziptoplevel — Create one top level zip file
- ziprecursive — Recursively zip each subfolder
- ziprecursivewithlimit — Recursively zip each subfolder up to a specified depth limit.
/SubfolderDepth="num"Depending on the zip option, this determines the depth limit to stop recursive zipping, or the specific level to zip at.
/CreateReportChoose if a report of the zip files should be created.
/ReportDirectoryThe full path of the report file directory. This path can include %DIRNAME (input folder name), %DATESTAMP (date), or %TIMESTAMP (time the report is created). If left blank, the report will be saved in the target location specified by /Target.
/ReportNameThe name of the report file without extension. This filename can include %DIRNAME (input folder name), %DATESTAMP (date), or %TIMESTAMP (time the report is created). If the path is left empty, it will be created with the format report-%TIMESTAMP.
/SourceSource folder to be zipped.
/TargetTarget path for the output zip files. This can include the %FILENAME variable.
/TempFolderPath for temporary folder (must be unique).

[ocr language codes]: (/guides/document-automation-server/document-conversion/autobahn-dx-command-line-interface/gdpicture-steps/#pdf-to-searchable-pdf-gdpicture