Google ADK (Agent Development Kit)

This guide shows how to integrate the AgenTao SDK with the Google Agent Development Kit (ADK) for complex AI agent behavior, including multi-agent orchestration, long-term memory, and structured tools.

Overview

The integration bridges AgenTao’s bidirectional audio stream with ADK’s Runner and LiveRequestQueue:

Caller audio -> ADK: Incoming phone audio is sent to the LiveRequestQueue as real-time input
ADK audio -> Caller: ADK events containing model audio are forwarded to the caller via send_audio()

Full Example

1 import asyncio
2 import os
3 from google.adk.agents import Agent
4 from google.adk.runners import Runner
5 from google.adk.sessions import InMemorySessionService
6 from google.adk.agents.live_request_queue import LiveRequestQueue
7 from google.adk.agents.run_config import RunConfig, StreamingMode
8 from google.genai import types
9 from agentao_sdk import AgenTaoClient, AgenTaoClientConfig, ActiveCall
10 import agentao_sdk.events as events
11 
12 agent = Agent(
13     name="agentao_agent",
14     model="gemini-2.5-flash-native-audio-preview-12-2025",
15     instruction="You are a helpful AI assistant talking over a phone call.",
16 )
17 session_service = InMemorySessionService()
18 runner = Runner(
19     app_name="agentao_app", agent=agent, session_service=session_service
20 )
21 
22 async def start_adk_session(call: ActiveCall):
23     await call.answer()
24 
25     live_request_queue = LiveRequestQueue()
26 
27     async def stream_to_adk():
28         async for audio_chunk in call.audio_stream():
29             audio_blob = types.Blob(
30                 data=audio_chunk, mime_type="audio/pcm;rate=24000"
31             )
32             live_request_queue.send_realtime(audio_blob)
33 
34     async def receive_from_adk():
35         run_config = RunConfig(
36             streaming_mode=StreamingMode.BIDI,
37             response_modalities=["AUDIO"],
38         )
39         async for event in runner.run_live(
40             user_id="default_user",
41             session_id=call.call_id,
42             live_request_queue=live_request_queue,
43             run_config=run_config,
44         ):
45             if event.interrupted:
46                 await call.clear_send_audio_buffer()
47 
48             if event.content and event.content.parts[0].inline_data:
49                 await call.send_audio(
50                     event.content.parts[0].inline_data.data
51                 )
52 
53     await asyncio.gather(stream_to_adk(), receive_from_adk())
54 
55 async def main():
56     config = AgenTaoClientConfig.sandbox(
57         api_key=os.getenv("WSS_API_KEY"),
58         connector_uuid=os.getenv("WSS_CONNECTOR_UUID"),
59         sample_rate=24000,
60     )
61 
62     async with AgenTaoClient(config) as client:
63         @client.on(events.INCOMING_CALL)
64         async def handle_call(call: ActiveCall):
65             await start_adk_session(call)
66 
67         await client.run_forever()
68 
69 if __name__ == "__main__":
70     asyncio.run(main())

How It Works

ADK Components

Agent - Defines the AI model and system instruction
Runner - Orchestrates agent execution and session management
InMemorySessionService - Stores session state (swap for a persistent store in production)
LiveRequestQueue - Accepts real-time audio input and feeds it to the runner

Stream to ADK

The stream_to_adk() coroutine reads audio chunks from call.audio_stream(), wraps them as types.Blob objects, and sends them to the LiveRequestQueue using send_realtime().

Receive from ADK

The receive_from_adk() coroutine runs runner.run_live() with StreamingMode.BIDI and response_modalities=["AUDIO"]. For each event:

Interruption: When event.interrupted is True, clear_send_audio_buffer() is called to stop queued audio
Model audio: When event.content contains inline_data, the raw audio is forwarded to the caller

Session Management

Each call gets its own session, keyed by call.call_id. This allows ADK to maintain conversation context within a single call. For cross-call memory, use a persistent session service.

When to Use ADK vs. GenAI SDK

Capability	GenAI SDK	ADK
Simple voice AI	Yes	Yes
Multi-agent orchestration	No	Yes
Structured tool calling	Limited	Yes
Session/memory management	Manual	Built-in
Complex agent workflows	No	Yes

Use the GenAI SDK integration for straightforward voice AI. Use ADK when you need structured agents, tools, or multi-agent coordination.

Environment Variables

Variable	Description
`GOOGLE_API_KEY`	Google API key with Gemini API access
`WSS_API_KEY`	AgenTao API key
`WSS_CONNECTOR_UUID`	AgenTao connector UUID

Next Steps

Google GenAI Integration - Simpler integration for basic voice AI
Audio Streaming - Buffer management and interruption handling
Use Cases - Combine ADK with escalation patterns