Skip to content

Conversation

@Kolleida
Copy link
Collaborator

@Kolleida Kolleida commented Dec 27, 2025

Overview

  • Agentbeats green agent end to end demo at app-malt/green_agents/. Details can be found in here.
  • MALT app now calls LLM endpoints with HTTP via A2A -> supports querying many more LLMs without explicitly coding a new agent.
  • New package with the agent querying code (netarena) for shared code across apps.
    • Includes agent querying code.
  • MALT app now async -> needed to support A2A + enables evaluating multiple agents concurrently
  • Use uv for dependency management.

Kolleida and others added 17 commits December 23, 2025 17:04
- Query A2A endpoint for LLM answer instead of using dedicated LLM agent
classes (convert main to async).
code for querying into separate file (for reuse later). Support parsing
both synchronous message and task objects (defaults to last text
artifact, then last text message).
  (app-malt) -> now an async generator that yields the eval result
objects (JSON)
   - Does not write to file by default anymore.
   - Makes it easier to process results for serving in web server.
- Utils for creating the QA prompts (no Langchain) -> could be reused
  elsewhere.
…ver endpoint. Can handle both regular and streaming responses (with text and JSON structured output).
  fail).
- Each query evaluation result now also includes agent info -> lay
  groundwork for multiple agent evaluation at once.
- Support for passing HTTP kwargs (e.g. headers) when calling each
  agent.
code was using raw relative path to find graph topology data, causing
errors when scripts not executed from app-malt directory.
- Rename agent_utils.py -> agent_client.py and move to netarena.
- App-malt now skips queries when agent cannot respond due to network
  errors.
- Move old LLM calling and prompting code to separate folder.
- .gitignore should now properly ignore misc folders outside repo root.
  results as eval artifacts (MALT agent).
- Now do not need to specify agent endpoints separately in scenario.toml
  (just listing them in participants is good enough).
- Add card-url option in compliance with Agentbeats format.
… truth code execution, improving isolation and preventing namespace pollution.

Update test data with more diverse queries.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants