标签
X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests(论文重做版)2026-03-01Competitive ProgrammingSynthetic DataSFT-then-RLDual-VerificationCode LLM
Scaling Agentic Verifier for Competitive Coding(论文重做版)2026-03-01Competitive ProgrammingVerifierAgentTest-time ScalingCode LLM
EvoCodeBench:面向“推理时自进化”代码系统的人类对齐评测基准2026-03-01Competitive ProgrammingBenchmarkSelf-Evolving AgentMultilingualHuman-Referenced Metrics
CodeHacker:自动化生成对抗测试用例以检测竞赛解法漏洞(CodeHacker: Automated Test Case Generation for Detecting Vulnerabilities in Competitive Programming Solutions)2026-03-01Competitive ProgrammingAdversarial TestingBenchmarkLLMRL