←返回资讯列表
tau2-bench - Python项目推荐
τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment
GitHub Trending··1 分钟阅读
τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment
项目统计:⭐ 634 stars | 🍴 152 forks📈 今日新增 5 stars
编程语言:Python
标签:ai, benchmark, conversational-agents, language-model-agent, llm
开源协议:MIT
GitHub 链接:https://github.com/sierra-research/tau2-bench 项目主页:https://arxiv.org/abs/2506.07982
#AI#科技#资讯
分享: