Pipeline options
Pipeline options allow to customize the execution of the models during the conversion pipeline.
This includes options for the OCR engines, the table model as well as enrichment options which
can be enabled with do_xyz = True
.
This is an automatic generated API reference of the all the pipeline options available in Docling.
pipeline_options
Classes:
-
AcceleratorDevice
–Devices to run model inference
-
AcceleratorOptions
– -
EasyOcrOptions
–Options for the EasyOCR engine.
-
OcrEngine
–Enum of valid OCR engines.
-
OcrMacOptions
–Options for the Mac OCR engine.
-
OcrOptions
–OCR options.
-
PdfBackend
–Enum of valid PDF backends.
-
PdfPipelineOptions
–Options for the PDF pipeline.
-
PipelineOptions
–Base pipeline options.
-
RapidOcrOptions
–Options for the RapidOCR engine.
-
TableFormerMode
–Modes for the TableFormer model.
-
TableStructureOptions
–Options for the table structure.
-
TesseractCliOcrOptions
–Options for the TesseractCli engine.
-
TesseractOcrOptions
–Options for the Tesseract engine.
AcceleratorDevice
AcceleratorOptions
Bases: BaseSettings
Methods:
-
check_alternative_envvars
–Set num_threads from the "alternative" envvar OMP_NUM_THREADS.
Attributes:
-
device
(AcceleratorDevice
) – -
model_config
– -
num_threads
(int
) –
model_config
class-attribute
instance-attribute
model_config = SettingsConfigDict(
env_prefix="DOCLING_",
env_nested_delimiter="_",
populate_by_name=True,
)
num_threads
class-attribute
instance-attribute
num_threads: int = 4
check_alternative_envvars
classmethod
check_alternative_envvars(data: Any) -> Any
Set num_threads from the "alternative" envvar OMP_NUM_THREADS. The alternative envvar is used only if it is valid and the regular envvar is not set.
Notice: The standard pydantic settings mechanism with parameter "aliases" does not provide the same functionality. In case the alias envvar is set and the user tries to override the parameter in settings initialization, Pydantic treats the parameter provided in init() as an extra input instead of simply overwriting the evvar value for that parameter.
EasyOcrOptions
Bases: OcrOptions
Options for the EasyOCR engine.
Attributes:
-
bitmap_area_threshold
(float
) – -
confidence_threshold
(float
) – -
download_enabled
(bool
) – -
force_full_page_ocr
(bool
) – -
kind
(Literal['easyocr']
) – -
lang
(List[str]
) – -
model_config
– -
model_storage_directory
(Optional[str]
) – -
recog_network
(Optional[str]
) – -
use_gpu
(Optional[bool]
) –
bitmap_area_threshold
class-attribute
instance-attribute
bitmap_area_threshold: float = 0.05
confidence_threshold
class-attribute
instance-attribute
confidence_threshold: float = 0.5
download_enabled
class-attribute
instance-attribute
download_enabled: bool = True
force_full_page_ocr
class-attribute
instance-attribute
force_full_page_ocr: bool = False
kind
class-attribute
instance-attribute
kind: Literal['easyocr'] = 'easyocr'
lang
class-attribute
instance-attribute
lang: List[str] = ['fr', 'de', 'es', 'en']
model_config
class-attribute
instance-attribute
model_config = ConfigDict(
extra="forbid", protected_namespaces=()
)
model_storage_directory
class-attribute
instance-attribute
model_storage_directory: Optional[str] = None
recog_network
class-attribute
instance-attribute
recog_network: Optional[str] = 'standard'
use_gpu
class-attribute
instance-attribute
use_gpu: Optional[bool] = None
OcrEngine
Bases: str
, Enum
Enum of valid OCR engines.
Attributes:
-
EASYOCR
– -
OCRMAC
– -
RAPIDOCR
– -
TESSERACT
– -
TESSERACT_CLI
–
EASYOCR
class-attribute
instance-attribute
EASYOCR = 'easyocr'
OCRMAC
class-attribute
instance-attribute
OCRMAC = 'ocrmac'
RAPIDOCR
class-attribute
instance-attribute
RAPIDOCR = 'rapidocr'
TESSERACT
class-attribute
instance-attribute
TESSERACT = 'tesseract'
TESSERACT_CLI
class-attribute
instance-attribute
TESSERACT_CLI = 'tesseract_cli'
OcrMacOptions
Bases: OcrOptions
Options for the Mac OCR engine.
Attributes:
-
bitmap_area_threshold
(float
) – -
force_full_page_ocr
(bool
) – -
framework
(str
) – -
kind
(Literal['ocrmac']
) – -
lang
(List[str]
) – -
model_config
– -
recognition
(str
) –
bitmap_area_threshold
class-attribute
instance-attribute
bitmap_area_threshold: float = 0.05
force_full_page_ocr
class-attribute
instance-attribute
force_full_page_ocr: bool = False
framework
class-attribute
instance-attribute
framework: str = 'vision'
kind
class-attribute
instance-attribute
kind: Literal['ocrmac'] = 'ocrmac'
lang
class-attribute
instance-attribute
lang: List[str] = ['fr-FR', 'de-DE', 'es-ES', 'en-US']
model_config
class-attribute
instance-attribute
model_config = ConfigDict(extra='forbid')
recognition
class-attribute
instance-attribute
recognition: str = 'accurate'
OcrOptions
Bases: BaseModel
OCR options.
Attributes:
-
bitmap_area_threshold
(float
) – -
force_full_page_ocr
(bool
) – -
kind
(str
) – -
lang
(List[str]
) –
bitmap_area_threshold
class-attribute
instance-attribute
bitmap_area_threshold: float = 0.05
force_full_page_ocr
class-attribute
instance-attribute
force_full_page_ocr: bool = False
kind
instance-attribute
kind: str
lang
instance-attribute
lang: List[str]
PdfBackend
Bases: str
, Enum
Enum of valid PDF backends.
Attributes:
-
DLPARSE_V1
– -
DLPARSE_V2
– -
PYPDFIUM2
–
DLPARSE_V1
class-attribute
instance-attribute
DLPARSE_V1 = 'dlparse_v1'
DLPARSE_V2
class-attribute
instance-attribute
DLPARSE_V2 = 'dlparse_v2'
PYPDFIUM2
class-attribute
instance-attribute
PYPDFIUM2 = 'pypdfium2'
PdfPipelineOptions
Bases: PipelineOptions
Options for the PDF pipeline.
Attributes:
-
accelerator_options
(AcceleratorOptions
) – -
artifacts_path
(Optional[Union[Path, str]]
) – -
create_legacy_output
(bool
) – -
do_ocr
(bool
) – -
do_table_structure
(bool
) – -
document_timeout
(Optional[float]
) – -
generate_page_images
(bool
) – -
generate_picture_images
(bool
) – -
generate_table_images
(bool
) – -
images_scale
(float
) – -
ocr_options
(Union[EasyOcrOptions, TesseractCliOcrOptions, TesseractOcrOptions, OcrMacOptions, RapidOcrOptions]
) – -
table_structure_options
(TableStructureOptions
) –
accelerator_options
class-attribute
instance-attribute
accelerator_options: AcceleratorOptions = (
AcceleratorOptions()
)
artifacts_path
class-attribute
instance-attribute
artifacts_path: Optional[Union[Path, str]] = None
create_legacy_output
class-attribute
instance-attribute
create_legacy_output: bool = True
do_ocr
class-attribute
instance-attribute
do_ocr: bool = True
do_table_structure
class-attribute
instance-attribute
do_table_structure: bool = True
document_timeout
class-attribute
instance-attribute
document_timeout: Optional[float] = None
generate_page_images
class-attribute
instance-attribute
generate_page_images: bool = False
generate_picture_images
class-attribute
instance-attribute
generate_picture_images: bool = False
generate_table_images
class-attribute
instance-attribute
generate_table_images: bool = Field(
default=False,
deprecated="Field `generate_table_images` is deprecated. To obtain table images, set `PdfPipelineOptions.generate_page_images = True` before conversion and then use the `TableItem.get_image` function.",
)
images_scale
class-attribute
instance-attribute
images_scale: float = 1.0
ocr_options
class-attribute
instance-attribute
ocr_options: Union[
EasyOcrOptions,
TesseractCliOcrOptions,
TesseractOcrOptions,
OcrMacOptions,
RapidOcrOptions,
] = Field(EasyOcrOptions(), discriminator="kind")
table_structure_options
class-attribute
instance-attribute
table_structure_options: TableStructureOptions = (
TableStructureOptions()
)
PipelineOptions
Bases: BaseModel
Base pipeline options.
Attributes:
-
accelerator_options
(AcceleratorOptions
) – -
create_legacy_output
(bool
) – -
document_timeout
(Optional[float]
) –
accelerator_options
class-attribute
instance-attribute
accelerator_options: AcceleratorOptions = (
AcceleratorOptions()
)
create_legacy_output
class-attribute
instance-attribute
create_legacy_output: bool = True
document_timeout
class-attribute
instance-attribute
document_timeout: Optional[float] = None
RapidOcrOptions
Bases: OcrOptions
Options for the RapidOCR engine.
Attributes:
-
bitmap_area_threshold
(float
) – -
cls_model_path
(Optional[str]
) – -
det_model_path
(Optional[str]
) – -
force_full_page_ocr
(bool
) – -
kind
(Literal['rapidocr']
) – -
lang
(List[str]
) – -
model_config
– -
print_verbose
(bool
) – -
rec_model_path
(Optional[str]
) – -
text_score
(float
) – -
use_cls
(Optional[bool]
) – -
use_det
(Optional[bool]
) – -
use_rec
(Optional[bool]
) –
bitmap_area_threshold
class-attribute
instance-attribute
bitmap_area_threshold: float = 0.05
cls_model_path
class-attribute
instance-attribute
cls_model_path: Optional[str] = None
det_model_path
class-attribute
instance-attribute
det_model_path: Optional[str] = None
force_full_page_ocr
class-attribute
instance-attribute
force_full_page_ocr: bool = False
kind
class-attribute
instance-attribute
kind: Literal['rapidocr'] = 'rapidocr'
lang
class-attribute
instance-attribute
lang: List[str] = ['english', 'chinese']
model_config
class-attribute
instance-attribute
model_config = ConfigDict(extra='forbid')
print_verbose
class-attribute
instance-attribute
print_verbose: bool = False
rec_model_path
class-attribute
instance-attribute
rec_model_path: Optional[str] = None
text_score
class-attribute
instance-attribute
text_score: float = 0.5
use_cls
class-attribute
instance-attribute
use_cls: Optional[bool] = None
use_det
class-attribute
instance-attribute
use_det: Optional[bool] = None
use_rec
class-attribute
instance-attribute
use_rec: Optional[bool] = None
TableFormerMode
TableStructureOptions
Bases: BaseModel
Options for the table structure.
Attributes:
-
do_cell_matching
(bool
) – -
mode
(TableFormerMode
) –
do_cell_matching
class-attribute
instance-attribute
do_cell_matching: bool = True
TesseractCliOcrOptions
Bases: OcrOptions
Options for the TesseractCli engine.
Attributes:
-
bitmap_area_threshold
(float
) – -
force_full_page_ocr
(bool
) – -
kind
(Literal['tesseract']
) – -
lang
(List[str]
) – -
model_config
– -
path
(Optional[str]
) – -
tesseract_cmd
(str
) –
bitmap_area_threshold
class-attribute
instance-attribute
bitmap_area_threshold: float = 0.05
force_full_page_ocr
class-attribute
instance-attribute
force_full_page_ocr: bool = False
kind
class-attribute
instance-attribute
kind: Literal['tesseract'] = 'tesseract'
lang
class-attribute
instance-attribute
lang: List[str] = ['fra', 'deu', 'spa', 'eng']
model_config
class-attribute
instance-attribute
model_config = ConfigDict(extra='forbid')
path
class-attribute
instance-attribute
path: Optional[str] = None
tesseract_cmd
class-attribute
instance-attribute
tesseract_cmd: str = 'tesseract'
TesseractOcrOptions
Bases: OcrOptions
Options for the Tesseract engine.
Attributes:
-
bitmap_area_threshold
(float
) – -
force_full_page_ocr
(bool
) – -
kind
(Literal['tesserocr']
) – -
lang
(List[str]
) – -
model_config
– -
path
(Optional[str]
) –
bitmap_area_threshold
class-attribute
instance-attribute
bitmap_area_threshold: float = 0.05
force_full_page_ocr
class-attribute
instance-attribute
force_full_page_ocr: bool = False
kind
class-attribute
instance-attribute
kind: Literal['tesserocr'] = 'tesserocr'
lang
class-attribute
instance-attribute
lang: List[str] = ['fra', 'deu', 'spa', 'eng']
model_config
class-attribute
instance-attribute
model_config = ConfigDict(extra='forbid')
path
class-attribute
instance-attribute
path: Optional[str] = None