Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.upsonic.ai/llms.txt

Use this file to discover all available pages before exploring further.

What is RapidOCR?

Lightweight OCR based on ONNX Runtime for fast inference. Best for speed and lightweight deployment.

Usage

from upsonic.ocr import OCR
from upsonic.ocr.layer_1.engines import RapidOCREngine
# Also available: from upsonic.ocr import RapidOCREngine

# Create engine instance
engine = RapidOCREngine(languages=['en'], confidence_threshold=0.5)

# Create OCR orchestrator
ocr = OCR(layer_1_ocr_engine=engine)

# Extract text from image
text = ocr.get_text('invoice.png')
print(text)

# Process PDF
result = ocr.process_file('document.pdf')
print(f"Extracted {len(result.text)} characters from {result.page_count} pages")

Parameters

ParameterTypeDefaultDescription
languagesList[str]['en']List of language codes
confidence_thresholdfloat0.0Minimum confidence for text blocks
rotation_fixboolFalseAuto-detect and fix image rotation
enhance_contrastboolFalseEnhance image contrast
remove_noiseboolFalseApply noise reduction
pdf_dpiint300DPI for PDF rendering

Supported Languages

English, Chinese (simplified and traditional), Japanese, Korean, and several other scripts including Tamil, Telugu, Arabic, Cyrillic, and Devanagari.