Zijian Zeng

[2] viXra:2604.0059 replaced on 2026-04-26 07:19:04 , (75 unique-IP downloads)

Reducing Credit Assignment Variance via Counterfactual Reasoning Paths

Authors: Fei Ding, Yongkang Zhang, Yeling Peng, Youwei Wang, Guoxiong Zhou, Zijian Zeng
Category: Artificial Intelligence

[1] viXra:2604.0058 submitted on 2026-04-15 20:10:37 , (44 unique-IP downloads)

Design Conditions for Intra-Group Learning of Sequence-Level Rewards: Token Gradient Cancellation

Authors: Fei Ding, Yongkang Zhang, Youwei Wang, Zijian Zeng
Category: Artificial Intelligence