Skip to content
@HiThink-Research

HiThink Research

HiThink-Research

Popular repositories Loading

  1. BizFinBench BizFinBench Public

    A Business-Driven Real-World Financial Benchmark for Evaluating LLMs

    Python 221 9

  2. MME-Finance MME-Finance Public

    [MM 2025] A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning

    Python 44 4

  3. GAGE GAGE Public

    General AI evaluation and Gauge Engine. A unified evaluation engine for LLMs, MLLMs, audio, and diffusion models.

    Python 38 5

  4. BizFinBench.v2 BizFinBench.v2 Public

    BizFinBench.v2: A Unified Offline–Online Bilingual Benchmark for Expert-Level Financial Capability Evaluation of LLMs

    Python 30 2

  5. FinMTM FinMTM Public

    FinMTM: A Multi-Turn Multimodal Benchmark for Financial Reasoning and Agent Evaluation

    Python 16

  6. CCPO CCPO Public

    Compress2Focus: Efficient Coordinate Compression for Policy Optimization in Multi-Turn GUI Agents

    Python 6

Repositories

Showing 10 of 10 repositories

Top languages

Loading…

Most used topics

Loading…