What is Thai Document OCR? A Complete Beginner's Guide
By Dr. Kobkrit Viriyayudhakorn, CEO & Founder, iApp Technology
Ever wondered how apps can instantly read your Thai ID card or passport? How banks verify your identity from a photo of your documents? The magic behind this is called OCR - Optical Character Recognition. In this guide, we'll explain everything you need to know about Thai Document OCR in simple terms.

What is OCR?
OCR (Optical Character Recognition) is a technology that converts images of text into machine-readable text data. Think of it as teaching a computer to "read" like a human.
Simple Analogy
Imagine you have a photo of a Thai ID card. A human can easily read the name, address, and ID number. OCR technology enables computers to do the same thing - look at the image and extract all the text information automatically.
What Makes Thai OCR Special?
Thai OCR is more challenging than English OCR because:
- Complex Script: Thai has 44 consonants, 32 vowels, and 5 tones
- No Word Spacing: Thai text doesn't have spaces between words
- Stacking Characters: Vowels and tone marks stack above/below consonants
- Mixed Languages: Thai documents often contain both Thai and English text
This is why specialized Thai OCR solutions like iApp's are essential for accurate results.
5 Key Terms You Need to Know
Before diving deeper, let's clarify some OCR jargon that often confuses beginners:
1. Accuracy Rate
Accuracy Rate measures how correctly the OCR system reads text, usually expressed as a percentage.
| Level | Description | Example |
|---|---|---|
| Character-level | Accuracy per character | 98.13% (iApp Thai ID Card OCR) |
| Field-level | Accuracy per data field | 96.82% for account numbers |
| Document-level | Overall document accuracy | 95%+ for clear images |
Why it matters: Higher accuracy means fewer errors and less manual correction needed.
2. Bounding Box
A Bounding Box is a rectangle that identifies where text is located in an image.
Image coordinates: [x1, y1, x2, y2]
Example: [119, 292, 376, 334] = text location
Why it matters: Bounding boxes help you understand exactly where each piece of information was found in the document.
3. Preprocessing
Preprocessing is the preparation of images before OCR analysis. Common steps include:
- Cropping: Removing unnecessary parts of the image
- Rotation: Fixing tilted documents
- Deskewing: Straightening skewed text
- Enhancement: Improving contrast and clarity
Why it matters: Good preprocessing dramatically improves OCR accuracy.
4. Confidence Score
Confidence Score indicates how certain the OCR system is about its reading, usually from 0 to 1 (or 0% to 100%).
{
"id_number": "1234567890123",
"detection_score": 0.98 // 98% confident
}
Why it matters: Low confidence scores signal that results should be verified manually.
5. Structured Data Output
Structured Data Output is the organized format of extracted information, typically JSON.
{
"th_name": "นาย ทดสอบ ตัวอย่าง",
"en_name": "Mr. Test Example",
"id_number": "1-2345-67890-12-3",
"date_of_birth": "01 Jan 1990",
"address": "123 ถนนสุขุมวิท กรุงเทพฯ"
}
Why it matters: Structured data can be directly used in your applications without additional parsing.
Why is Thai Document OCR Important?
1. Digital Transformation
Thailand is rapidly digitizing government and business services. OCR enables:
- Paperless onboarding
- Digital identity verification
- Automated document processing
2. Time and Cost Savings
Manual data entry is slow and expensive:
| Method | Time per Document | Error Rate |
|---|---|---|
| Manual Entry | 2-5 minutes | 1-3% |
| OCR API | 1-2 seconds | Less than 2% |
3. E-KYC Compliance
Thai regulations require businesses to verify customer identities. OCR enables:
- Instant ID verification
- PDPA-compliant data processing
- Anti-fraud measures
4. Improved Customer Experience
Customers expect fast, digital services:
- No more filling out forms manually
- Instant account opening
- Seamless onboarding
What Problems Does Thai Document OCR Solve?

Banking & Finance
- Account Opening: Extract customer information from ID cards
- Loan Applications: Process income documents automatically
- KYC Verification: Verify identity documents instantly
Insurance
- Claims Processing: Read accident reports and medical documents
- Policy Enrollment: Capture customer data from IDs
- Fraud Detection: Verify document authenticity
Healthcare
- Patient Registration: Extract data from national ID cards
- Insurance Claims: Process health insurance documents
- Medical Records: Digitize paper records
Government Services
- Citizen Services: Streamline document submissions
- Tax Processing: Read tax documents and receipts
- License Verification: Verify driver licenses and permits
E-Commerce & Logistics
- Seller Verification: Verify merchant identities
- Address Extraction: Read shipping labels
- Returns Processing: Process return documents
How Does Thai Document OCR Work?
Let's break down the OCR process step by step:
Step 1: Document Capture
The process starts with capturing an image of the document through:
- Smartphone camera
- Scanner
- Uploaded image file
Step 2: Image Preprocessing
The system prepares the image for analysis:
- Auto-rotation: Corrects tilted images
- Cropping: Removes background
- Enhancement: Improves text visibility
- Noise Reduction: Removes artifacts
Step 3: Text Detection
AI identifies where text exists in the image:
- Locates text regions
- Creates bounding boxes
- Identifies field types (name, ID number, etc.)
Step 4: Character Recognition
The core OCR engine reads the text:
- Analyzes each character
- Uses deep learning models trained on Thai text
- Handles both Thai and English characters
Step 5: Post-Processing
The system refines the results:
- Spell checking
- Format validation (e.g., 13-digit ID numbers)
- Data structuring
Step 6: Output Generation
Final structured data is returned as JSON.
How to Use Thai Document OCR
Method 1: Web Demo
Try iApp's OCR directly on our website - no coding required!
- Visit Thai ID Card OCR Demo
- Upload an image
- See results instantly
Method 2: API Integration
For developers, integrate OCR via REST API:
import requests
# Thai ID Card OCR
url = "https://api.iapp.co.th/v3/store/ekyc/thai-national-id-card/front"
files = {"file": open("id_card.jpg", "rb")}
headers = {"apikey": "YOUR_API_KEY"}
response = requests.post(url, headers=headers, files=files)
result = response.json()
print(f"Name: {result['th_name']}")
print(f"ID Number: {result['id_number']}")
print(f"Address: {result['address']}")
Method 3: Mobile SDK
For mobile apps, use our SDK for iOS and Android with built-in camera capture and real-time processing.
Thai Document OCR Examples
Example 1: Thai National ID Card OCR
Input: Photo of Thai National ID Card (front)
Output:
{
"id_number": "1-2345-67890-12-3",
"th_name": "นาง ทดสอบ ตัวอย่าง",
"en_name": "Mrs. Test Example",
"th_dob": "15 ม.ค. 2533",
"en_dob": "15 Jan 1990",
"address": "123/45 ถ.สุขุมวิท แขวงคลองตัน เขตวัฒนา กรุงเทพฯ",
"province": "กรุงเทพมหานคร",
"detection_score": 0.98,
"process_time": 1.2
}
Use Cases: E-KYC, account opening, identity verification
Example 2: Passport OCR
Input: Photo of passport profile page
Output:
{
"type": "P",
"country": "THA",
"surname": "EXAMPLE",
"names": "TEST",
"number": "AA1234567",
"nationality": "THA",
"date_of_birth": "900115",
"sex": "F",
"expiration_date": "300114",
"personal_number": "1234567890123",
"valid_score": 100
}
Use Cases: Travel booking, immigration, international KYC
Example 3: Thai Driver License OCR
Input: Photo of Thai driver license
Output:
{
"license_number": "12345678",
"th_name": "นาย ทดสอบ ตัวอย่าง",
"en_name": "Mr. Test Example",
"date_of_birth": "15/01/1990",
"expiry_date": "15/01/2030",
"license_type": "รถยนต์ส่วนบุคคล",
"province": "กรุงเทพมหานคร"
}
Use Cases: Car rental, ride-sharing verification, age verification
Example 4: Bank Passbook OCR
Input: Photo of Thai bank passbook
Output:
{
"bank_name": "ธนาคารไทยพาณิชย์",
"account_number": "123-4-56789-0",
"account_name": "นาย ทดสอบ ตัวอย่าง",
"bank_branch": "สาขาสยามพารากอน",
"signature_detected": true
}
Use Cases: Payment verification, loan processing, account linking
iApp Technology's Thai Document OCR Services
At iApp Technology, we offer comprehensive OCR solutions for Thai documents:
Thai National ID Card OCR
- Accuracy: 98.13% character-level
- Speed: 1-2 seconds
- Cost: 1.25 IC per request (front)
- Try Demo
Passport OCR
- Accuracy: 95.51%
- Supports: All MRZ passports worldwide
- Cost: 0.75 IC per page
- Try Demo
Thai Driver License OCR
- Fields: License number, name, expiry, type
- Cost: 1.25 IC per request
- Try Demo
Thai Bank Passbook OCR
- Supports: All major Thai banks
- Accuracy: 93% overall
- Cost: 1.25 IC per page
- Try Demo
Additional OCR Services
- Tax Deduction Certificate
- Civil Registration Certificate
- License Plate Recognition
- Power/Water Meter Reading
- CV/Resume Parsing
Getting Started with iApp OCR APIs
Step 1: Create a Free Account
Visit iapp.co.th/register to create your account.
Step 2: Get Your API Key
Go to API Key Management to generate your key.
Step 3: Choose Your OCR Service
Select the document type you need to process from our documentation.
Step 4: Make Your First API Call
curl -X POST "https://api.iapp.co.th/v3/store/ekyc/thai-national-id-card/front" \
-H "apikey: YOUR_API_KEY" \
-F "file=@id_card.jpg"
Step 5: Integrate and Go Live
Use our code examples in Python, JavaScript, PHP, Swift, Kotlin, Java, and Dart.
Best Practices for Thai Document OCR
Image Quality Tips
- Good Lighting: Avoid shadows and glare
- Flat Surface: Keep documents flat
- Full Frame: Capture the entire document
- High Resolution: Minimum 300 DPI recommended
- Clear Focus: Avoid blurry images
Integration Tips
- Handle Errors: Check for error codes in responses
- Validate Confidence: Review low-confidence results manually
- Retry Logic: Implement retries for network failures
- Secure Storage: Don't store sensitive document images
Compliance Tips
- User Consent: Get permission before capturing documents
- Data Minimization: Only extract needed fields
- PDPA Compliance: Follow Thai data protection laws
- Audit Logging: Keep records of OCR operations
Summary
Thai Document OCR transforms how businesses handle document processing in Thailand. Here's what we covered:
- OCR converts document images into structured, machine-readable data
- Key terms: Accuracy Rate, Bounding Box, Preprocessing, Confidence Score, Structured Data
- Applications: Banking, insurance, healthcare, government, e-commerce
- Thai OCR challenges: Complex script, no word spacing, mixed languages
- iApp solutions: ID Card, Passport, Driver License, Bank Passbook OCR
The digital transformation of Thai businesses depends on accurate, fast document processing - and OCR makes it possible.
Ready to Try Thai Document OCR?
Start digitizing your document workflows today:
- Create Free Account - Get started in minutes
- Try Thai ID Card OCR - Our most popular API
- Explore All OCR APIs - Full documentation
- Contact Us - Enterprise solutions
Have questions? Join our Discord community or email us at support@iapp.co.th.
iApp Technology Co., Ltd. Thailand's Leading AI Company