Features

Cloud-based live transcription

Cloud-based transcription converts audio to text for active or selected hosts in real time. Text can be distributed as live captions to all participants in the channel.

LLM integration

Integrate speech to text with LLMs for further processing, without impacting RTC performance. Upload transcription text as .vtt files to LLMs like GPT to generate summaries, notes, and more.

Transcribing and labeling simultaneous speakers

Easily label who said what—even with up to 3 simultaneous speakers. Separate transcription for each host ensures accuracy and allows you to choose to transcribe for one specific host.

Captioning for cloud recordings

Transcribe audio to text on video or audio recordings to enable closed captions (CC) on playback or review important discussion items in the transcript.

Multi-language support

Real-time transcription supports all major languages and dialects, and each channel can support audio-to-text transcription for up to two languages simultaneously.

Enterprise-grade security and compliance

Agora is ISO and SOC 2 certified and meets compliance standards for regional privacy laws and industry regulations, including GDPR, CCPA, and HIPAA. Live captions and transcription can be encrypted in the same way as encrypted RTC audio or video.

あなたのビジョン、制限なし。

Interactive Whiteboardを使用すると、カスタムブランディングと豊富な機能を備えたコラボレーションアプリをすばやく構築できます。当社のプラットフォームでは、カスタマイズされた魅力的な学習環境を簡単に作成できます。

柔軟な API は、カスタムブランディングと広範なデジタルホワイトボード機能をサポートします。
リアルタイムの音声通話とビデオ通話、インタラクティブなストリーミング、シグナリングを簡単に統合できます。
ファイルのプリロード、共有、注釈付けによってユーザーの帯域幅を節約し、すべての動的コンテンツを保持できます。

また、HIPAA、GDPR、CCPAへのコンプライアンスにも安心してお使いいただけます。

Instantly transcribe speech to text for live audio and video

Agora’s Real-Time Speech to Text provides accurate live transcription and subtitling services at a low cost.

Reduce cost and increase efficiency

More efficient and cost-effective than traditional client-side live transcription, Agora’s solution by uses advanced technology to remove silence, reduce Word Error Rate (WER), and distribute live captions to all participants in a channel.

Reduce cost and increase efficiency

Get the most accurate results at scale

Cutting-edge AI ensures the highest accuracy even with overlapping speech, regional accents, and poor network conditions. Scale from one-to-one meetings to up to millions of participants with the same accuracy.

Get the most accurate results at scale

Integrate with ease

Agora’s Real-Time Speech to Text is highly integrated with Agora’s network (SD-RTN™), providing global user transcription and real-time text distribution even in poor network environments.

Integrate with ease

以下のレコーディングオプション:

クラウドレコーディング

記録をクラウドに保存、取得、共有します。

ドキュメントに移動

オンプレミス録画

セキュリティと機密保持のため、ローカルサーバーに保存してください。

ドキュメントに移動

Web ページの録画

Web ブラウザーの画面エクスペリエンス全体を記録します。

ドキュメントに移動

アゴラ・メディア・サービス

レコーディング

オーディオストリーム、ビデオストリーム、およびWebページを録画して、アーカイブ、レビュー、または配信します。

クラウドレコーディングドキュメント

オンプレミスレコーディングドキュメンテーション

Web ページレコーディングドキュメント

メディアゲートウェイ

RTMP/SRTプロトコルを使用してメディアストリームをAgora音声およびビデオチャネルに直接プッシュし、メディアストリームの高度なトランスコーディング処理を可能にして配信を容易にします。

ドキュメンテーションに移動

メディアプル

ライブまたは録画したビデオやオーディオコンテンツを取り込み、Agoraチャンネルに直接取り込むことで、Agoraセッションのエンゲージメントをさらに高めましょう。

ドキュメンテーションに移動

メディアプッシュ

オーディオとビデオのストリームを Agora チャンネルからコンテンツ配信ネットワーク (CDN) にプッシュすることで、ハイブリッドなエンゲージメント体験で視聴者を増やしましょう。

ドキュメンテーションに移動

Made for developers

あなたのコード

アゴラ SDK

柔軟な SDK を使用して、最初からエクスペリエンスをカスタマイズできます。

ドキュメントに移動

あなたのコード

アゴラ SDK

AgoraのVideo SDKを使用して、柔軟性とカスタマイズ性を最大限に高めながら、リアルタイム動画を作成してアプリに統合できます。

ドキュメントに移動

コードなし

アプリビルダー

Agoraのアプリビルダーは、コード不要のビジュアルデザイナーを使用して、ビデオを製品にリアルタイムで取り込む最も速くて簡単な方法です。

ドキュメントに移動

ローコード

アゴラ UI キット

ローコードの UI Kit ライブラリを使用して、わずか数行のコードでリアルタイム動画をアプリに追加できます。

ドキュメントに移動

あなたのコード

アゴラ SDK

柔軟な SDK を使用して、最初からエクスペリエンスをカスタマイズできます。

ドキュメントに移動

ローコード

アゴラ UI キット

ローコードのUIKitライブラリを使用すると、わずか数行のコードを使用してリアルタイム通信とストリーミングを統合できます。

ドキュメントに移動

ドキュメンテーション

このプロジェクトでは、Agora API の使用方法を理解するのに役立つ一連の API 例を紹介します。

Platform-agnostic RESTful APIs make it easy to add highly accurate and cost-effective real-time speech-to-text capabilities.

ドキュメントに移動

AgoraコンソールでAIノイズ抑制拡張機能を有効にします。

Activate the Real-Time Speech to Text extension in the Agora Console.

あなたのコード

アゴラ SDK

AgoraのVoice SDKを使用して、最大限の柔軟性と完全なカスタマイズで音声通話を構築および統合できます。

ドキュメントに移動

コードなし

アプリビルダー

Agoraのアプリビルダーは、リアルタイムのボイスチャット、ビデオチャット、ライブストリーミングを製品に追加する最も速くて簡単な方法です。

ドキュメントに移動

あなたのコード

アゴラ SDK

AgoraのInteractive Whiteboard SDKを使用して、最も柔軟で完全なカスタマイズを実現しながら、リアルタイムのビジュアルコラボレーション機能を構築してアプリケーションに統合できます。

ドキュメントに移動

ローコード

ファストボード

事前構築された UI とカスタムプラグインを含める機能により、リアルタイムのビジュアルコラボレーションをより迅速に構築できます。

今すぐ試してみる

セキュリティ、プライバシー、コンプライアンス

アゴラは、ISO/IEC 27001、27017、27018、27701、およびSOC 2のセキュリティ基準の認定を受けており、GDPR、CCAP、COPPA、HIPAAなどのプライバシー規制を満たしています。Agoraは、サービスの提供に必要なインターネットプロトコル（IP）アドレスと運用情報以外のエンドユーザーデータを収集または保存しません。

ISO 27001:2022

ISO 27017:2015

ISO 27018:2019

ISO 27701:2019

ヒパー

GDPR

SOC2 タイプ 1 & 2

CCPA

コッパ

ユースケース

Transcribe speech to text for any real-time application

Securely transcribe and record real-time audio or video and organize recordings and transcripts to speed up workflows.

[すべて表示]

An online classroom with real-time captioning powered by speech-to-text transcription and subtitling.

Education

Give faculty and students real-time captions and analyze them with an LLM to provide lesson summaries and suggestions for further learning.

A live video call with a doctor and speech-to-text transcription services.

Telehealth

Keep secure records of virtual appointments for Minimum Effective Response (MER) and cross-reference telehealth knowledge bases.

A live basketball game showing player soaring through the air and making a slam dunk in front of a packed arena. Overlay text via speech-to-text reads "Unbelievable move! The score is now 68-65."

Events

Empower your event with real-time, accurate notes, ensuring a more accessible, searchable, and engaging event experience.

Live shopping

Use virtual assistants to improve accessibility and reach a wider audience by offering detailed product information, personalized recommendations, and guiding customers through the purchasing process.

A virtual meeting between four people with real-time automated notes and documented outstanding questions and action items via an LLM.

Virtual meetings

Provide real-time automated notes in meetings and document outstanding questions and action items via an LLM.

An influencer on social channel sharing a review of a sandwich with speech-to-text translations into Vietnamese.

Social & metaverse

Eliminate communication barriers for people with different languages or disabilities. Extract conversation for business optimization, advertising, and moderation.

ファストボード

Agoraのインタラクティブホワイトボードを最新のFastboard SDKと簡単に構築して統合できます。これにより、ビルド済みのUIとカスタムプラグインを含めることができるため、まったく同じホワイトボード機能がすべて提供されます。

Request more information

Connect with our experts to answer your questions, discuss requirements, and provide more detail on the ConvoAI Device Kit

FIRST NAME:*

LAST NAME:*

EMAIL ADDRESS:*

COMPANY:*

COUNTRY:*

Thank you for your request. A member of our team will be in touch shortly.

FAQ（よくある質問）

Agoraの会話型AIエンジンは他の音声AIソリューションとどう違いますか？

アゴラは、低遅延の応答とリアルタイムの割り込み処理により、より自然な音声対話を可能にします。また、内蔵のバックグラウンドノイズ抑制、エコーキャンセル、選択的注意ロックにより、どのような環境でもAIがユーザーの声を明確に認識できます。さらに、アゴラのグローバルリアルタイムネットワークにより、世界中どこでも安定した接続と高いパフォーマンスを提供します。

Agoraの会話型AIエンジンで接続可能なLLMは？

アゴラの会話型AIエンジンには、OpenAI互換のLLMを接続できます。具体的には、OpenAIのGPTモデル、Google Gemini、DeepSeek、およびOpenAI互換のカスタムモデルが利用可能です。さらに、今後追加のLLMのサポートも予定されています。

音声AIエージェントの導入に必要な技術は？

音声AIエージェントを実装するには、LLM（大規模言語モデル）とテキスト読み上げ（TTS：Text-to-Speech）サービスをアゴラの会話型AIエンジンに接続する必要があります。これにより、LLMや音声を自由にカスタマイズし、最適な音声AI体験を提供できます。

「カスケードモデル」とは？

カスケードモデルは、

音声→文字変換（STT）

LLMによる処理

文字→音声変換（TTS）
の順でAI応答を生成するプロセスを指します。

Agoraの会話型AIエンジンでLLMを作成できますか？

いいえ、本エンジンは既存のLLMとの音声対話を可能にするものであり、 LLMの作成やトレーニング機能はありません。

TEN

App Builder

フレキシブルクラスルーム

SDK をダウンロード

サポートプランと価格

Real-Time Speech to Text

Real-Time Speech to Text

Features

あなたのビジョン、制限なし。

OpenAI のリアルタイム API の実際の動作をご覧ください

Instantly transcribe speech to text for live audio and video

Reduce cost and increase efficiency

Reduce cost and increase efficiency

Get the most accurate results at scale

Get the most accurate results at scale

Integrate with ease

Integrate with ease

以下のレコーディングオプション:

アゴラ・メディア・サービス

Made for developers

クイックスタートガイド

Made for developers

アゴラ SDK

アゴラ SDK

アプリビルダー

アゴラ UI キット

アゴラ SDK

アゴラ UI キット

ドキュメンテーション

アゴラ SDK

アプリビルダー

アゴラ SDK

ファストボード

Transcribe speech to text for any real-time application

ファストボード

FAQ（よくある質問）

Get started with 300 free minutes

私たちに話してください

デベロッパーリソース