arxiv:2603.14473
Ming Zhang
konglongge
·
AI & ML interests
LLMs
Recent Activity
liked a dataset 30 days ago
llmeval-fdu/LLMEval-Logic upvoted a paper about 1 month ago
LLMEval-Logic: A Solver-Verified Chinese Benchmark for Logical Reasoning of LLMs with Adversarial Hardening submitted a paper about 1 month ago
LLMEval-Logic: A Solver-Verified Chinese Benchmark for Logical Reasoning of LLMs with Adversarial Hardening