CLI Reference

This page provides documentation for our command line tools.

docling

Usage:

docling [OPTIONS] source

Options:

Name	Type	Description	Default
`--from`	choice (`docx` \| `pptx` \| `html` \| `image` \| `pdf` \| `asciidoc` \| `md` \| `xlsx`)	Specify input formats to convert from. Defaults to all formats.	None
`--to`	choice (`md` \| `json` \| `text` \| `doctags`)	Specify output formats. Defaults to Markdown.	None
`--ocr` / `--no-ocr`	boolean	If enabled, the bitmap content will be processed using OCR.	`True`
`--force-ocr` / `--no-force-ocr`	boolean	Replace any existing text with OCR generated text over the full content.	`False`
`--ocr-engine`	choice (`easyocr` \| `tesseract_cli` \| `tesseract`)	The OCR engine to use.	`OcrEngine.EASYOCR`
`--ocr-lang`	text	Provide a comma-separated list of languages used by the OCR engine. Note that each OCR engine has different values for the language names.	None
`--pdf-backend`	choice (`pypdfium2` \| `dlparse_v1` \| `dlparse_v2`)	The PDF backend to use.	`PdfBackend.DLPARSE_V1`
`--table-mode`	choice (`fast` \| `accurate`)	The mode to use in the table structure model.	`TableFormerMode.FAST`
`--artifacts-path`	path	If provided, the location of the model artifacts.	None
`--abort-on-error` / `--no-abort-on-error`	boolean	If enabled, the bitmap content will be processed using OCR.	`False`
`--output`	path	Output directory where results are saved.	`.`
`--verbose`, `-v`	integer	Set the verbosity level. -v for info logging, -vv for debug logging.	`0`
`--version`	boolean	Show version information.	None
`--help`	boolean	Show this message and exit.	`False`