AI Agent with DeepSeek-R1, AutoGen and MAX Recipe

Recipes

Code / recipes / AI Agent with DeepSeek-R1, AutoGen and MAX

Help us improve and tell us what you’d like us to build next.

Request a recipe topic

README

This recipe demonstrates how to build AI agents using:

DeepSeek-R1-Distill-Qwen-7B model that runs on GPU
AutoGen framework for multi-agent conversations
MAX for efficient model serving and inference
Rich Python library for beautiful terminal interfaces

We'll create two example applications that showcase:

A conversational AI assistant with thought process visibility leveraging DeepSeek thinking process
A collaborative screenplay development system with multiple specialized agents

The patterns demonstrated here can be adapted for various agent-based applications like:

Customer service automation
Educational tutoring systems
Creative writing assistants
Technical support agents
Research assistants

Requirements

Please make sure your system meets our system requirements.

To proceed, ensure you have the pixi CLI installed:

curl -fsSL https://pixi.sh/install.sh | sh

...and updated to the latest version:

pixi self-update

GPU requirements

Important: GPU requirements

This recipe requires a GPU with CUDA 12.5 support. Recommended GPUs:

NVIDIA H100 / H200, A100, A40, L40

Quick start

Download the code for this recipe:

git clone https://github.com/modularml/max-recipes.git
cd max-recipes/deepseek-qwen-autogen-agent

Run the MAX server via in a terminal:

Make sure the port 8010 is available. You can adjust the port settings in pyproject.toml.
```
pixi run server
```

In a new terminal, run either example:

For the chat agent:
```
pixi run chat_agent
```

For the screenplay development team:

pixi run screenplay_agents

The agents will be ready when you see the welcome message in your terminal.

Quick demos

Chat agent with visible thinking

Demo shows:

Starting a conversation with the AI
AI's thinking process displayed in yellow panels
Final responses in green panels
Multiple turns of natural conversation
Example of complex reasoning task

Screenplay development team

Demo shows:

User providing initial scene idea
Screenwriter creating first draft
Story Critic analyzing and improving
Dialogue Expert polishing the scene
Full collaborative workflow between agents

Technical deep dive

Single agent chat implementation

1. Agent configuration

client = OpenAIChatCompletionClient(
    model="deepseek-ai/DeepSeek-R1-Distill-Qwen-7B",
    base_url=LLM_SERVER_URL,
    api_key=LLM_API_KEY,
    model_info={
        "name": "deepseek-ai/DeepSeek-R1-Distill-Qwen-7B",
        "family": "deepseek",
        "pricing": {"prompt": 0.0, "completion": 0.0},
    },
    temperature=0.7,  # Adjust for more/less creative responses
    max_tokens=4096,  # Adjust for longer conversation
)

assistant = AssistantAgent(
    name="assistant",
    model_client=client,
    system_message="You are a helpful assistant.",
)

Key features:

Uses DeepSeek-R1 model through MAX
Configurable temperature for response creativity
Adjust max_tokens for longer conversations

2. Conversation management

# Keep track of conversation history
conversation_history = []

# Add user message to history
conversation_history.append(TextMessage(content=user_input, source="user"))

# Get AI response with full context
response = await assistant.on_messages(
    conversation_history,  # Pass entire history
    CancellationToken(),
)
conversation_history.append(response.chat_message)

Benefits:

Maintains context across multiple turns
Enables coherent multi-turn conversations
Preserves conversation state

3. Thinking process visibility

The agent separates its thinking from its final response:

content = response.chat_message.content
parts = content.split('')
if len(parts) > 1:
    thinking = parts[0].strip()
    final_answer = parts[1].strip()
else:
    thinking = "No explicit thinking process shown"
    final_answer = content.strip()

This allows users to see:

The reasoning process
Considered alternatives
Decision-making steps

4. Rich terminal interface

The chat interface uses Rich for enhanced visualization:

console.print(Panel(
    Markdown(thinking),
    border_style="yellow",
    title="[bold]💭 Thinking Process[/bold]",
    title_align="left"
))

console.print(Panel(
    Markdown(final_answer),
    border_style="green",
    title="[bold]🤖 Response[/bold]",
    title_align="left"
))

Features:

Color-coded panels for different content types
Markdown rendering for formatted text
Clear separation of thinking and responses
Status indicators during processing

Multi-agent screenplay development

The screenplay development system uses multiple specialized agents working together through round-robin conversation. Here's a detailed look at the implementation:

1. Specialized agent definitions

screenwriter = AssistantAgent(
    name="screenwriter",
    system_message="""You are an experienced screenwriter who creates engaging movie scenes and dialogue.
First think about the scene carefully. Consider:
- Setting and atmosphere
- Character development
- Plot progression
Then write a brief but vivid scene with clear stage directions and dialogue.""",
    model_client=client,
)

story_critic = AssistantAgent(
    name="story_critic",
    system_message="""You are a story development expert. Review the screenwriter's scene and think about:
- Plot coherence and dramatic tension
- Character motivations and arcs
- Theme development
Then provide:
1. An improved version of the scene
2. A list of specific improvements made and why they work better""",
    model_client=client,
)

dialogue_expert = AssistantAgent(
    name="dialogue_expert",
    system_message="""You are a dialogue specialist. Review both the original and improved scenes, then think about:
- Character voice authenticity
- Subtext and emotional depth
- Natural flow and rhythm
Then provide:
1. A final version with enhanced dialogue
2. A list of specific dialogue improvements made and their impact on the scene""",
    model_client=client,
)

Each agent has a specific role:

Screenwriter: Creates initial scene drafts
Story Critic: Improves plot and structure
Dialogue Expert: Enhances character voices and interactions

2. Team coordination

agent_team = RoundRobinGroupChat(
    agents=[screenwriter, story_critic, dialogue_expert],
    max_turns=3,  # One turn for each agent
    speaker_selection_method="round_robin"
)

The round-robin chat ensures:

Ordered turn-taking between agents
Complete review cycle for each scene
Collaborative improvement process

3. Message processing and display

async for message in stream:
    if hasattr(message, 'content') and hasattr(message, 'source'):
        content = message.content
        source = message.source

        # Split thinking and contribution
        parts = content.split('')
        if len(parts) > 1:
            thinking = parts[0].strip()
            final_answer = parts[1].strip()
        else:
            thinking = "No explicit thinking process shown"
            final_answer = content.strip()

        # Display each agent's contribution
        console.print(Panel(
            Markdown(thinking),
            border_style="yellow",
            title=f"[bold]💭 {source} Thinking[/bold]",
            title_align="left"
        ))

        console.print(Panel(
            Markdown(final_answer),
            border_style="green",
            title=f"[bold]🎬 {source} Contribution[/bold]",
            title_align="left"
        ))

Features:

Real-time display of each agent's process
Clear attribution of contributions
Separation of thinking and final output
Rich formatting for readability

Example workflow

Initial Scene Creation

User: Write a scene about a reunion between old friends
Screenwriter: [Analyzes setting and characters]
Screenwriter: [Creates initial scene with dialogue]

Story Review and Enhancement

Story Critic: [Evaluates dramatic tension]
Story Critic: [Suggests structural improvements]

Dialogue Polish

Dialogue Expert: [Analyzes character voices]
Dialogue Expert: [Enhances dialogue authenticity]

Customization options

You can adapt the screenplay system for different creative tasks:

Add Specialized Agents

technical_advisor = AssistantAgent(
    name="technical_advisor",
    system_message="""You ensure accuracy in specialized scenes...""",
    model_client=client
)

Modify Review Cycles

agent_team = RoundRobinGroupChat(
    agents=[...],
    max_turns=4,  # Add more revision cycles
)

Enhance Agent Specialties

system_message="""You are a genre specialist focusing on:
- Genre conventions
- Typical plot structures
- Character archetypes
..."""

Customizing for your use case

This recipe can be adapted for various applications:

Educational Tutoring
- Add agents for different subjects
- Implement knowledge assessment
- Create personalized learning paths
Technical Support
- Add agents for different technical domains
- Implement troubleshooting workflows
- Create solution verification steps
Research Assistant
- Add agents for literature review
- Implement data analysis
- Create report generation

Troubleshooting

Common issues and solutions:

Server Connection Issues
- Ensure MAX is running (pixi run server)
- Check if the default port 8010 is available
- Verify network connectivity
Agent Response Issues
- Check system messages for clarity
- Adjust max_turns for multi-agent scenarios
- Verify conversation history handling
Performance Issues
- Monitor GPU memory usage
- Adjust batch sizes if needed
- Consider reducing conversation history length

Conclusion

This recipe demonstrates how to:

Build single and multi-agent systems with AutoGen
Use DeepSeek-R1 and examine its thinking process
Create beautiful terminal interfaces with Rich Python library
Implement robust error handling
Enable collaborative agent interactions

The patterns shown here provide a foundation for building your own agent-based applications.

Next steps

Deploy your agents on AWS, GCP or Azure
Explore MAX documentation for more features
Join our Modular Forum and Discord community

Share your agent projects with us using #ModularAI on social media!

DETAILS

THE CODE

deepseek-qwen-autogen-agent

AUTHOR

Ehsan M. Kermani

AVAILABLE TASKS

pixi run server

pixi run chat_agent

pixi run screenplay_agents

PROBLEMS WITH THE CODE?

File an Issue