app-malt: A2A Compatibility #6

Kolleida · 2025-12-27T04:41:58Z

Overview

Agentbeats green agent end to end demo at app-malt/green_agents/. Details can be found in here.
MALT app now calls LLM endpoints with HTTP via A2A -> supports querying many more LLMs without explicitly coding a new agent.
New package with the agent querying code (netarena) for shared code across apps.
- Includes agent querying code.
MALT app now async -> needed to support A2A + enables evaluating multiple agents concurrently
Use uv for dependency management.

- Query A2A endpoint for LLM answer instead of using dedicated LLM agent classes (convert main to async).

code for querying into separate file (for reuse later). Support parsing both synchronous message and task objects (defaults to last text artifact, then last text message).

(app-malt) -> now an async generator that yields the eval result objects (JSON) - Does not write to file by default anymore. - Makes it easier to process results for serving in web server. - Utils for creating the QA prompts (no Langchain) -> could be reused elsewhere.

…ver endpoint. Can handle both regular and streaming responses (with text and JSON structured output).

fail). - Each query evaluation result now also includes agent info -> lay groundwork for multiple agent evaluation at once. - Support for passing HTTP kwargs (e.g. headers) when calling each agent.

code was using raw relative path to find graph topology data, causing errors when scripts not executed from app-malt directory.

- Rename agent_utils.py -> agent_client.py and move to netarena. - App-malt now skips queries when agent cannot respond due to network errors.

- Move old LLM calling and prompting code to separate folder. - .gitignore should now properly ignore misc folders outside repo root.

results as eval artifacts (MALT agent). - Now do not need to specify agent endpoints separately in scenario.toml (just listing them in participants is good enough). - Add card-url option in compliance with Agentbeats format.

… truth code execution, improving isolation and preventing namespace pollution. Update test data with more diverse queries.

Kolleida and others added 17 commits December 23, 2025 17:04

- Use structured config object for app-malt main loop.

e2db21b

- Query A2A endpoint for LLM answer instead of using dedicated LLM agent classes (convert main to async).

Preliminary refactor to support querying various server endpoints. Move

e563ab6

code for querying into separate file (for reuse later). Support parsing both synchronous message and task objects (defaults to last text artifact, then last text message).

Change agent_utils to use modern version of A2AClient to ping A2A ser…

10a48da

…ver endpoint. Can handle both regular and streaming responses (with text and JSON structured output).

Rename AgentServer (and related objects) to AgentClient.

aa8f407

- Evaluation loop now skips agents if it cannot connect (before would

c6822fa

fail). - Each query evaluation result now also includes agent info -> lay groundwork for multiple agent evaluation at once. - Support for passing HTTP kwargs (e.g. headers) when calling each agent.

Demo green agent based on Agentbeats tutorial.

e5ada58

Shell script to illustrate how to run green agent demo. Fix issue where

e141512

code was using raw relative path to find graph topology data, causing errors when scripts not executed from app-malt directory.

Dependencies with uv.

5f37e65

Command line args for malt_agent to specify host and port to expose on.

8b0cd65

Fix typo when passing port in malt_agent.py example.

b49e1b4

- Define netarena python package to put shared code across apps.

66a1b1b

- Rename agent_utils.py -> agent_client.py and move to netarena. - App-malt now skips queries when agent cannot respond due to network errors.

- Remove unused cmd line args related to specifying model type.

94b7024

- Move old LLM calling and prompting code to separate folder. - .gitignore should now properly ignore misc folders outside repo root.

Remove unused arg in sample scenario (green agent).

9d0f540

Fix issue where query latency was not being calculated correctly.

99b958b

- Include final average correctness, safety, and latency

0b72155

results as eval artifacts (MALT agent). - Now do not need to specify agent endpoints separately in scenario.toml (just listing them in participants is good enough). - Add card-url option in compliance with Agentbeats format.

Enhance malt_env to use shared execution namespace for LLM and ground…

227f13f

… truth code execution, improving isolation and preventing namespace pollution. Update test data with more diverse queries.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

app-malt: A2A Compatibility #6

app-malt: A2A Compatibility #6

Uh oh!

Kolleida commented Dec 27, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

app-malt: A2A Compatibility #6

Are you sure you want to change the base?

app-malt: A2A Compatibility #6

Uh oh!

Conversation

Kolleida commented Dec 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Kolleida commented Dec 27, 2025 •

edited

Loading