Automatic OCR language detection with tesseract