Introducing Kaitom Voice V3 - Next-Generation Thai Text-to-Speech

January 27, 2026 · 4 min read

CEO @ iApp Technology

Kaitom Voice V3 - Next-Generation Thai Text-to-Speech

We're excited to announce Kaitom Voice V3, the next generation of our Thai Text-to-Speech API. This major update brings significant improvements to speech quality, introduces smart text normalization, and simplifies integration with a modern JSON-based API.

What's New in V3

Kaitom Voice V3 represents a complete overhaul of our TTS engine, delivering the most natural-sounding Thai speech synthesis we've ever created.

Smart Text Normalization

V3 automatically handles complex text elements that previously required pre-processing:

Type	Input	Spoken Output
Numbers	`1,234.56`	"หนึ่งพันสองร้อยสามสิบสี่จุดห้าหก"
Dates	`27/01/2569`	"วันที่ยี่สิบเจ็ดมกราคมพ.ศ.สองพันห้าร้อยหกสิบเก้า"
Currency	`฿1,500`	"หนึ่งพันห้าร้อยบาท"
Time	`14:30`	"สิบสี่นาฬิกาสามสิบนาที"
Percentages	`25%`	"ยี่สิบห้าเปอร์เซ็นต์"

IPA Pronunciation Support

For technical terms, brand names, or words requiring specific pronunciation, you can now use IPA (International Phonetic Alphabet) notation:

พูด {IPA: kʰun} ให้ถูกต้อง

This is particularly useful for:

Foreign names and brand names
Technical terminology
Regional dialects or accents
Custom pronunciation preferences

Automatic Language Detection

No more specifying language modes! V3 automatically detects and handles Thai-English mixed content:

Hello and Welcome! ยินดีต้อนรับสู่ iApp Technology ผู้นำด้าน AI ของประเทศไทย

Simplified JSON API

We've modernized the API with a clean JSON interface:

curl -X POST 'https://api.iapp.co.th/v3/store/audio/tts/generate' \
    --header 'apikey: YOUR_API_KEY' \
    --header 'Content-Type: application/json' \
    --data '{"text": "สวัสดีครับ น้องไข่ต้ม เวอร์ชั่น 3"}'

High-Quality Audio Output

44.1 kHz WAV output - Professional-grade audio quality
Extended character limit - Up to 10,000 characters per request
Faster processing - Optimized for speed without sacrificing quality

Quick Start

Python

import requests

url = "https://api.iapp.co.th/v3/store/audio/tts/generate"
headers = {
    "apikey": "YOUR_API_KEY",
    "Content-Type": "application/json"
}
data = {"text": "สวัสดีครับ น้องไข่ต้ม เวอร์ชั่น 3"}

response = requests.post(url, headers=headers, json=data)
with open("output.wav", "wb") as f:
    f.write(response.content)

JavaScript

const response = await fetch("https://api.iapp.co.th/v3/store/audio/tts/generate", {
    method: "POST",
    headers: {
        "apikey": "YOUR_API_KEY",
        "Content-Type": "application/json"
    },
    body: JSON.stringify({ text: "สวัสดีครับ น้องไข่ต้ม เวอร์ชั่น 3" })
});
const blob = await response.blob();

V3 vs V2 Comparison

Feature	V2	V3
API Format	Form Data	JSON
Language Mode	Required (`TH` / `TH_MIX_EN`)	Auto-detected
Text Normalization	Basic	Smart (numbers, dates, currency)
IPA Support	No	Yes
Max Characters	Unlimited	10,000
Audio Quality	Standard	44.1 kHz

Pricing

V3 maintains the same competitive pricing as our previous versions:

Cost: 1 IC per 400 characters
Free trial credits available for new users

Use Cases

E-Learning & Education

Convert educational content into audio lessons with proper pronunciation of numbers, dates, and technical terms.

Chatbots & Virtual Assistants

Create natural-sounding voice responses for Thai chatbots with automatic language handling.

Content Creation

Generate professional voiceovers for videos and podcasts with high-quality audio output.

Accessibility

Make digital content accessible to visually impaired users with clear, natural speech.

IVR Systems

Build interactive voice response systems with smart text normalization for phone numbers, amounts, and dates.

Migration Guide

Migrating from V2 to V3 is straightforward:

Before (V2):

curl -X POST 'https://api.iapp.co.th/v3/store/speech/text-to-speech/kaitom' \
    --header 'apikey: YOUR_API_KEY' \
    --form 'text="สวัสดีครับ"' \
    --form 'language="TH"'

After (V3):

curl -X POST 'https://api.iapp.co.th/v3/store/audio/tts/generate' \
    --header 'apikey: YOUR_API_KEY' \
    --header 'Content-Type: application/json' \
    --data '{"text": "สวัสดีครับ"}'

Key changes:

New endpoint: /v3/store/audio/tts/generate
Content-Type: application/json
Request body: JSON format {"text": "..."}
No language parameter needed

Try It Now

Ready to experience the next generation of Thai TTS?

Interactive Demo - Try V3 directly in your browser
API Documentation - Complete technical reference
Get API Key - Start building today

What's Next

We're continuously improving Kaitom Voice. Upcoming features include:

Additional voice options (coming soon)
SSML support for fine-grained control
Streaming audio output for real-time applications

Feedback

We'd love to hear your feedback on Kaitom Voice V3! Join our community:

Discord: discord.gg/kYcpmdEcS2
Email: sale@iapp.co.th
Phone: 086-322-5858

Kaitom Voice V3 is available now for all iApp API users. Existing V1 and V2 APIs will continue to be supported.

Introducing Kaitom Voice V3 - Next-Generation Thai Text-to-Speech

What's New in V3

Smart Text Normalization

IPA Pronunciation Support

Automatic Language Detection

Simplified JSON API

High-Quality Audio Output

Quick Start

Python

JavaScript

V3 vs V2 Comparison

Pricing

Use Cases

E-Learning & Education

Chatbots & Virtual Assistants

Content Creation

Accessibility

IVR Systems

Migration Guide

Try It Now

What's Next

Feedback

ChindaX

SpeechFlow

ChindaGO

What's New in V3​

Smart Text Normalization​

IPA Pronunciation Support​

Automatic Language Detection​

Simplified JSON API​

High-Quality Audio Output​

Quick Start​

Python​

JavaScript​

V3 vs V2 Comparison​

Pricing​

Use Cases​

E-Learning & Education​

Chatbots & Virtual Assistants​

Content Creation​

Accessibility​

IVR Systems​

Migration Guide​

Try It Now​

What's Next​

Feedback​

What's New in V3

Smart Text Normalization

IPA Pronunciation Support

Automatic Language Detection

Simplified JSON API

High-Quality Audio Output

Quick Start

Python

JavaScript

V3 vs V2 Comparison

Pricing

Use Cases

E-Learning & Education

Chatbots & Virtual Assistants

Content Creation

Accessibility

IVR Systems

Migration Guide

Try It Now

What's Next

Feedback