Skip to content

CLI Reference

This page provides documentation for our command line tools.

docling

Usage:

docling [OPTIONS] source

Options:

Name Type Description Default
--from choice (docx | pptx | html | image | pdf | asciidoc | md | xlsx) Specify input formats to convert from. Defaults to all formats. None
--to choice (md | json | text | doctags) Specify output formats. Defaults to Markdown. None
--ocr / --no-ocr boolean If enabled, the bitmap content will be processed using OCR. True
--force-ocr / --no-force-ocr boolean Replace any existing text with OCR generated text over the full content. False
--ocr-engine choice (easyocr | tesseract_cli | tesseract) The OCR engine to use. OcrEngine.EASYOCR
--ocr-lang text Provide a comma-separated list of languages used by the OCR engine. Note that each OCR engine has different values for the language names. None
--pdf-backend choice (pypdfium2 | dlparse_v1 | dlparse_v2) The PDF backend to use. PdfBackend.DLPARSE_V1
--table-mode choice (fast | accurate) The mode to use in the table structure model. TableFormerMode.FAST
--artifacts-path path If provided, the location of the model artifacts. None
--abort-on-error / --no-abort-on-error boolean If enabled, the bitmap content will be processed using OCR. False
--output path Output directory where results are saved. .
--verbose, -v integer Set the verbosity level. -v for info logging, -vv for debug logging. 0
--version boolean Show version information. None
--help boolean Show this message and exit. False