feat: force-think mode for fusion models such as deepseek-v3.1 qwen3#358
feat: force-think mode for fusion models such as deepseek-v3.1 qwen3#358Robin7831 wants to merge 2 commits intolabring:mainfrom
Conversation
|
dingyi seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account. You have signed the CLA already but the status is still pending? Let us recheck it. |
|
dingyi seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account. You have signed the CLA already but the status is still pending? Let us recheck it. |
|
是分离think吗,可以看看已实现的 thinksplit 插件 https://github.com/labring/aiproxy/tree/main/core/relay/plugin/thinksplit |
|
|
控制思考使用UnmarshalGeneralThinkingFromNode,部分adaptor已经实现了某种通用参数转模型的思考参数。比如ali的qwen |
支持使用vllm等框架部署的思考-非思考融合模型在思考模式下的规范化输出