SUTRA are ultrafast, multilingual, online Generative AI models that can operate in 50+ languages with conversation, search, and visual capabilities.

Multilingual

Supports ultrafast text-generation and instruction-following in multiple languages.

Cost-efficient

Energy and cost efficient as well as easier to maintain and add new language capabilities.

Ultrafast

Lightweight MoE architecture and purpose-built tokenizer enables ultra low latency.

Online

Connected and hallucination-free models that provide factual responses in a conversational tone.

SUTRA models are fine-tuned and aligned with proprietary conversational datasets, ensuring coherent and consistent dialogs.

50+ languages with

accuracy & efficiency

SUTRA surpasses leading models by 20-30% on the MMLU benchmark in comprehending and generating responses across numerous languages.

SUTRA models are fine-tuned and aligned with 150M+ conversational data, ensuring coherent and consistent dialogs.

MMLU scores

HINDI

हिंदी

GPT 3.5

39

LLAMA 3

64

SUTRA

68

HINDI

हिंदी

GPT 3.5

39

SUTRA

68

KOREAN

한국어

HyperClova

54

GPT 3.5

51

SUTRA

67

KOREAN

한국어

HyperClova

54

SUTRA

67

GUJARATI

ગુજરાતી

GPT 4

61

GPT 3.5

35

SUTRA

67

GUJARATI

ગુજરાતી

GPT 4

61

SUTRA

67

JAPANESE

日本語

LLAMA 3

70

SAKANA

62

SUTRA

75

JAPANESE

日本語

LLAMA 3

70

SUTRA

75

ARABIC

العربية

LLAMA 3

60

GPT 3.5

49

SUTRA

67

ARABIC

العربية

LLAMA 3

60

SUTRA

67

Cost-efficient tokenization

for non-English languages

SUTRA's purpose-built tokenizer is more efficient across languages, enabling cost-savings and ultrafast inference for multilingual applications.

Token Counts

GPT 3.5

SUTRA

Token Counts

GPT 3.5

SUTRA

હાય, હું સૂત્ર છું, હું બહુભાષી અને અલ્ટ્રાફાસ્ટ AI મોડેલ છું. તમને મળીને આનંદ થયો.

5.7x fewer tokens than GPT

131

23

હાય, હું સૂત્ર છું, હું બહુભાષી અને અલ્ટ્રાફાસ્ટ AI મોડેલ છું. તમને મળીને આનંદ થયો.

5.7x fewer tokens than GPT

131

23

नमस्ते, मैं सूत्र हूं, मैं एक बहुभाषी और अल्ट्राफास्ट एआई मॉडल हूं। आप से मिल कर अच्छा लगा।

4.1x fewer tokens than GPT

94

23

नमस्ते, मैं सूत्र हूं, मैं एक बहुभाषी और अल्ट्राफास्ट एआई मॉडल हूं। आप से मिल कर अच्छा लगा।

4.1x fewer tokens than GPT

94

23

안녕하세요. 저는 다국어 초고속 AI 모델 SUTRA입니다. 그것은 당신을 만나서 반가워요.

1.8x fewer tokens than GPT

43

24

안녕하세요. 저는 다국어 초고속 AI 모델 SUTRA입니다. 그것은 당신을 만나서 반가워요.

1.8x fewer tokens than GPT

43

24

ஹாய், நான் சூத்ரா, நான் ஒரு பன்மொழி மற்றும் அதிவேக AI மாடல். உங்களை சந்தித்ததில் மகிழ்ச்சி.

5.1x fewer tokens than GPT

117

23

ஹாய், நான் சூத்ரா, நான் ஒரு பன்மொழி மற்றும் அதிவேக AI மாடல். உங்களை சந்தித்ததில் மகிழ்ச்சி.

5.1x fewer tokens than GPT

117

23

مرحبًا، أنا SUTRA، أنا نموذج ذكاء اصطناعي متعدد اللغات وفائق السرعة. تشرفنا.

2.9x fewer tokens than GPT

61

21

مرحبًا، أنا SUTRA، أنا نموذج ذكاء اصطناعي متعدد اللغات وفائق السرعة. تشرفنا.

2.9x fewer tokens than GPT

61

21

हाय, मी सूत्र आहे, मी एक बहुभाषिक आणि अल्ट्राफास्ट AI मॉडेल आहे. तुम्हाला भेटून छान वाटले.

3.8x fewer tokens than GPT

88

23

हाय, मी सूत्र आहे, मी एक बहुभाषिक आणि अल्ट्राफास्ट AI मॉडेल आहे. तुम्हाला भेटून छान वाटले.

3.8x fewer tokens than GPT

88

23

こんにちは、SUTRA です。多言語対応の超高速 AI モデルです。お会い出来て嬉しいです。

1.6x fewer tokens than GPT

40

25

こんにちは、SUTRA です。多言語対応の超高速 AI モデルです。お会い出来て嬉しいです。

1.6x fewer tokens than GPT

40

25

Hi, I’m SUTRA, I am a multilingual and ultrafast AI model. It's nice to meet you.

Similar number of tokens as GPT

26

26

Hi, I’m SUTRA, I am a multilingual and ultrafast AI model. It's nice to meet you.

Similar number of tokens as GPT

26

26

Up-to-date &

hallucination-free

SUTRA-Online are internet connected and hallucination-free models that understand queries, browse the web, and summarize information to provide current answers.


It can answer queries like “Who won the game last night” or “What’s the current stock price” accurately, whereas offline models suffer from knowledge cut-off dates.

Models

SUTRA-Pro

SUTRA-Pro is adept at executing instructions in 50+ languages for conversational use cases and complex tasks. Developed with a rich diversity of instructions from both proprietary and leading open-access datasets, our models excel across a spectrum of multilingual benchmarks, offering unparalleled proficiency in over 50 languages including the nuanced spectrums of Latin, Indic, and Far Eastern languages.

Parameters

150b

Parameters

150b

SUTRA-Light

SUTRA-Light is designed for conversation, summarization, and other tasks in 50+ languages. An ideal choice for multilingual conversations at scale, balancing performance, cost, and speed. SUTRA-Light is highly efficient and optimized for ultra-low latency applications.

Parameters

56b - active 13b

Parameters

56b - active 13b

Time-to-first-token (TTFT)

100-200ms

Time-to-first-token (TTFT)

100-200ms

SUTRA-Online

SUTRA-Online can use knowledge from the internet, and thus leverage the most up-to-date information when forming responses. By understanding queries, browsing the web, and summarizing information, SUTRA-Online can accurately respond to time-sensitive queries, unlocking knowledge beyond its training corpus. This means that SUTRA models can respond to queries like “Who won in yesterday’s India vs England cricket match?” accurately, whereas offline models suffer from knowledge cut-off dates.

Parameters

56b - active 13b

Parameters

56b - active 13b

Knowledge-cutoff

Up-to-date

Knowledge-cutoff

Up-to-date



Detailed SUTRA model spec and sample code are available.

SUTRA for developers

Developer Portal

Access our all-in-one SUTRA resource hub for developers

API Reference

Learn how to start building using simple to use SUTRA API

Playground

Experience the power of SUTRA and compare with other LLMs

SUTRA models are the technology backbone of TWO’s products and services and are available as Model as a Service (MaaS) to other apps and services via usage-based pricing and simple-to-integrate APIs.