Artificial Intelligence Paper Rankings

AI-estimated scientific impact ranking of the latest arXiv Artificial Intelligence preprints. Methodology New: General Relativity

Sign up for free to unlock all papers &

200papers (2208 total)
51350matches
1

Simulating clinical interventions with a generative multimodal model of human physiology

Guy Lutsker, Gal Sapir +6

1719
107
100%
Apr 30, 2026
2

End-to-end autonomous scientific discovery on a real optical platform

Shuxing Yang, Fujia Chen +6

1691
89
95.5%
Apr 29, 2026
3

MIMIC: A Generative Multimodal Foundation Model for Biomolecules

Siavash Golkar, Jake Kovalic +6

1685
151
96%
Apr 27, 2026
4

Foundation Models to Unlock Real-World Evidence from Nationwide Medical Claims

Fan Ma, Yuntian Liu +6

1661
177
95.5%
May 4, 2026
5

Machine Collective Intelligence for Explainable Scientific Discovery

Gyoung S. Na, Chanyoung Park

1658
92
87%
Apr 30, 2026
6

AI scientists produce results without reasoning scientifically

Martiño Ríos-García, Nawaf Alampara +6

1653
227
90.3%
Apr 20, 2026
7

Towards a General Intelligence and Interface for Wearable Health Data

Girish Narayanswamy, Maxwell A. Xu +6

1643
43
79.1%
May 21, 2026
8

Generative structure search for efficient and diverse discovery of molecular and crystal structures

Yifang Qin, Yu Shi +4

1636
90
75.6%
Apr 30, 2026
9

A Collective Variational Principle Unifying Bayesian Inference, Game Theory, and Thermodynamics

Djamel Bouchaffra, Faycal Ykhlef +2

1629
79
78.5%
Apr 30, 2026
10

IatroBench: Pre-Registered Evidence of Iatrogenic Harm from AI Safety Measures

David Gringras

1624
279
85.7%
Apr 9, 2026
11

AI-Assisted Peer Review at Scale: The AAAI-26 AI Review Pilot

Joydeep Biswas, Sheila Schoepp +6

1622
240
84.2%
Apr 15, 2026
12

Hodoscope: Unsupervised Monitoring for AI Misbehaviors

Ziqian Zhong, Shashwat Saxena +1

1607
215
81.4%
Apr 13, 2026
13

Value-Conflict Diagnostics Reveal Widespread Alignment Faking in Language Models

Inderjeet Nair, Jie Ruan +1

v2
1605
59
76.3%
Apr 22, 2026
14

Emotion Concepts and their Function in a Large Language Model

Nicholas Sofroniew, Isaac Kauvar +6

1599
255
80%
Apr 9, 2026
15

SymptomAI: Towards a Conversational AI Agent for Everyday Symptom Assessment

Joseph Breda, Fadi Yousif +6

1597
42
71.4%
May 5, 2026
16

Unbiased Prevalence Estimation with Multicalibrated LLMs

Fridolin Linder, Thomas Leeper +4

1596
40
72.5%
Apr 23, 2026
17

Heterogeneous Scientific Foundation Model Collaboration

Zihao Li, Jiaru Zou +6

1593
41
68.3%
Apr 30, 2026
18

Formal Conjectures: An Open and Evolving Benchmark for Verified Discovery in Mathematics

Moritz Firsching, Paul Lezeau +6

1592
34
73.5%
May 13, 2026
19

Epistemic Blinding: An Inference-Time Protocol for Auditing Prior Contamination in LLM-Assisted Analysis

Michael Cuccarese

1589
336
83%
Apr 7, 2026
20

HiL-Bench (Human-in-Loop Benchmark): Do Agents Know When to Ask for Help?

Mohamed Elfeki, Tu Trinh +6

1589
228
79.8%
Apr 10, 2026
21

Containment Verification: AI Safety Guarantees Independent of Alignment

Royce Moon, Lav R. Varshney

1589
25
92%
May 9, 2026
22

Conditional Equivalence of DPO and RLHF: Implicit Assumption, Failure Modes, and Provable Alignment

Zhiqin Yang, Yonggang Zhang +4

1589
34
61.8%
May 20, 2026
23

Subliminal Transfer of Unsafe Behaviors in AI Agent Distillation

Jacob Dang, Brian Y. Xie +1

1588
50
76%
Apr 16, 2026
24

Brief chatbot interactions produce lasting changes in human moral values

Yue Teng, Qianer Zhong +3

1587
50
74%
Apr 23, 2026
25

Bounding the Black Box: A Statistical Certification Framework for AI Risk Regulation

Natan Levy, Gadi Perl

1586
44
77.3%
Apr 23, 2026
26

The Power of Power Law: Asymmetry Enables Compositional Reasoning

Zixuan Wang, Xingyu Dang +2

1586
40
67.5%
Apr 24, 2026
27

Polysemantic Experts, Monosemantic Paths: Routing as Control in MoEs

Charles Ye, Bo Yuan +1

1585
91
75.8%
Apr 20, 2026
28

BioMiner: A Multi-modal System for Automated Mining of Protein-Ligand Bioactivity Data from Literature

Jiaxian Yan, Jintao Zhu +6

1585
66
75.8%
Apr 23, 2026
29

Auditable Agents

Yi Nian, Aojie Yuan +3

1583
153
76.5%
Apr 7, 2026
30

KISS - Knowledge Infrastructure for Scientific Simulation: A Scaffolding for Agentic Earth Science

Ziwei Li, Liujun Zhu +6

1582
22
90.9%
May 18, 2026
31

Using large language models for embodied planning introduces systematic safety risks

Tao Zhang, Kaixian Qu +5

v2
1581
44
70.5%
Apr 20, 2026
32

Context Over Content: Exposing Evaluation Faking in Automated Judges

Manan Gupta, Inderjeet Nair +2

1578
44
77.3%
Apr 16, 2026
33

Numerical Instability and Chaos: Quantifying the Unpredictability of Large Language Models

Chashi Mahiul Islam, Alan Villarreal +3

1578
83
75.9%
Apr 14, 2026
34

Prospective multi-pathogen disease forecasting using autonomous LLM-guided tree search

Sarah Martinson, Michael P. Brenner +4

1577
18
83.3%
May 15, 2026
35

Discovering Novel LLM Experts via Task-Capability Coevolution

Andrew Dai, Boris Meinardus +3

1576
54
72.2%
Apr 16, 2026
36

Towards Understanding Specification Gaming in Reasoning Models

Kei Nishimura-Gasparian, Robert McCarthy +1

1576
51
60.8%
May 4, 2026
37

The Accountability Horizon: An Impossibility Theorem for Governing Human-Agent Collectives

Haileleol Tibebu

1574
90
75.6%
Apr 9, 2026
38

How Independent are Large Language Models? A Statistical Framework for Auditing Behavioral Entanglement and Reweighting Verifier Ensembles

Chenchen Kuai, Jiwan Jiang +6

1571
109
75.2%
Apr 8, 2026
39

RePAIR: Interactive Machine Unlearning through Prompt-Aware Model Repair

Jagadeesh Rachapudi, Pranav Singh +3

1571
62
75.8%
Apr 14, 2026
40

How LLMs Are Persuaded: A Few Attention Heads, Rerouted

Xiangkun Sun, Lingkai Kong +3

1571
25
88%
May 10, 2026
41

Process Reward Agents for Steering Knowledge-Intensive Reasoning

Jiwoong Sohn, Tomasz Sternal +3

1571
78
76.9%
Apr 10, 2026
42

Towards Faster Language Model Inference Using Mixture-of-Experts Flow Matching

Aihua Li

1571
70
75.7%
Apr 16, 2026
43

Participatory provenance as representational auditing for AI-mediated public consultation

Sachit Mahajan

1571
42
66.7%
Apr 22, 2026
44

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Meng Chu, Xuan Billy Zhang +6

1570
38
76.3%
Apr 24, 2026
45

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Haozhe Wang, Cong Wei +4

1569
65
76.9%
Apr 13, 2026
46

Conditional Attribute Estimation with Autoregressive Sequence Models

Erica Stutz, Giacomo Marino +3

1569
25
92%
May 13, 2026
47

Detecting Safety Violations Across Many Agent Traces

Adam Stein, Davis Brown +3

1567
86
76.7%
Apr 13, 2026
48

OLLM: Options-based Large Language Models

Shashank Sharma, Janina Hoffmann +1

1566
53
75.5%
Apr 21, 2026
49

Model Spec Midtraining: Improving How Alignment Training Generalizes

Chloe Li, Sara Price +2

1565
29
75.9%
May 3, 2026
50

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Guanting Dong, Junting Lu +6

1565
53
71.7%
Apr 20, 2026
51

How Adversarial Environments Mislead Agentic AI?

Zhonghao Zhan, Huichi Zhou +4

1565
78
69.2%
Apr 20, 2026
52

MathNet: a Global Multimodal Benchmark for Mathematical Reasoning and Retrieval

Shaden Alshammari, Kevin Wen +6

1563
52
76.9%
Apr 20, 2026
53

The Capability Paradox: How Smarter Auditors Make Multi-Agent Systems Less Secure

Qiqi Liu, Thorsten Holz +2

1563
21
85.7%
May 17, 2026
54

Advancing Mathematics Research with AI-Driven Formal Proof Search

George Tsoukalas, Anton Kovsharov +6

1563
21
90.5%
May 21, 2026
55

Hidden Biases in Conditioning Autoregressive Models

Francois Pachet, Pierre Roy

1563
99
76.8%
Apr 9, 2026
56

DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents

Zhaorun Chen, Xun Liu +6

1562
23
91.3%
May 6, 2026
57

When Reasoning Traces Become Performative: Step-Level Evidence that Chain-of-Thought Is an Imperfect Oversight Channel

Wenkai Li, Fan Yang +3

1562
24
91.7%
May 12, 2026
58

ResearchEVO: An End-to-End Framework for Automated Scientific Discovery and Documentation

Zhe Zhao, Haibin Wen +5

1561
165
71.5%
Apr 7, 2026
59

Resolving the bias-precision paradox with stochastic causal representation learning for personalized medicine

Peisong Zhang, Manqiang Peng +6

1561
24
91.7%
May 7, 2026
60

Characterizing Model-Native Skills

Feiyang Kang, Mahavir Dabas +2

1561
48
62.5%
Apr 19, 2026
61

EvoLM: Self-Evolving Language Models through Co-Evolved Discriminative Rubrics

Shuyue Stella Li, Rui Xin +6

1560
33
69.7%
May 5, 2026
62

Introspection Adapters: Training LLMs to Report Their Learned Behaviors

Keshav Shenoy, Li Yang +5

v2
1560
32
75%
Apr 18, 2026
63

Hallucination as Exploit: Evidence-Carrying Multimodal Agents

Guijia Zhang, Hao Zheng +1

1558
20
65%
May 18, 2026
64

CauSim: Scaling Causal Reasoning with Increasingly Complex Causal Simulators

Nicolás Astorga, Anita Kriz +1

1558
21
95.2%
May 9, 2026
65

Causal Bias Detection in Generative Artifical Intelligence

Drago Plecko

1556
22
90.9%
May 12, 2026
66

Attractor Geometry of Transformer Memory: From Conflict Arbitration to Confident Hallucination

Qiyao Liang, Risto Miikkulainen +1

1556
21
95.2%
May 7, 2026
67

The Geometry of Forgetting: Temporal Knowledge Drift as an Independent Axis in LLM Representations

Rania Elbadry, Ahmed Heakl +5

1555
18
94.4%
May 9, 2026
68

Recursive Multi-Agent Systems

Xiyuan Yang, Jiaru Zou +6

1555
39
61.5%
Apr 28, 2026
69

Fusion-fission forecasts when AI will shift to undesirable behavior

Neil F. Johnson, Frank Yingjie Huo

1554
22
90.9%
May 14, 2026
70

Unleashing LLMs in Bayesian Optimization: Preference-Guided Framework for Scientific Discovery

Xinzhe Yuan, Zhuo Chen +5

1553
20
85%
May 18, 2026
71

Separable Expert Architecture: Toward Privacy-Preserving LLM Personalization via Composable Adapters and Deletable User Proxies

Chris Schneider, Philipp Schoenegger +1

1552
41
63.4%
Apr 23, 2026
72

SWE-chat: Coding Agent Interactions From Real Users in the Wild

Joachim Baumann, Vishakh Padmakumar +4

1551
48
70.8%
Apr 22, 2026
73

Extracting Search Trees from LLM Reasoning Traces Reveals Myopic Planning

Sixing Chen, Ji-An Li +4

1551
33
75.8%
May 7, 2026
74

SELFDOUBT: Uncertainty Quantification for Reasoning LLMs via the Hedge-to-Verify Ratio

Satwik Pandey, Suresh Raghu +1

1551
94
73.4%
Apr 7, 2026
75

LACE: Lattice Attention for Cross-thread Exploration

Yang Li, Zirui Zhang +2

1551
66
72.7%
Apr 16, 2026
76

Position: Safety and Fairness in Agentic AI Depend on Interaction Topology, Not on Model Scale or Alignment

Tanav Singh Bajaj, Nikhil Singh +2

1550
34
76.5%
May 1, 2026
77

Reason in Chains, Learn in Trees: Self-Rectification and Grafting for Multi-turn Agent Policy Optimization

Yu Li, Sizhe Tang +1

1550
85
70.6%
Apr 8, 2026
78

A Versatile AI Agent for Rare Disease Diagnosis and Risk Gene Prioritization

Tianyu Liu, Wangjie Zheng +6

1549
20
85%
May 7, 2026
79

Agentic Discovery of Exchange-Correlation Density Functionals

Titouan Duston, Jiashu Liang +6

1549
20
90%
May 6, 2026
80

JURY-RL: Votes Propose, Proofs Dispose for Label-Free RLVR

Xinjie Chen, Biao Fu +5

1548
41
58.5%
Apr 28, 2026
81

Transferable Human Mobility Network Reconstruction with neuroGravity

Jinming Yang, Shaoyu Huang +5

1548
41
68.3%
Apr 26, 2026
82

From Insight to Action: A Novel Framework for Interpretability-Guided Data Selection in Large Language Models

Ling Shi, Xinwei Wu +6

1548
40
57.5%
Apr 28, 2026
83

CIVeX: Causal Intervention Verification for Language Agents

Fabio Rovai

1548
19
94.7%
May 9, 2026
84

Bias by Necessity: Impossibility Theorems for Sequential Processing with Convergent AI and Human Validation

Jikun Wu, Dongxin Guo +1

1547
20
90%
May 9, 2026
85

PRTS: A Primitive Reasoning and Tasking System via Contrastive Representations

Yang Zhang, Jiangyuan Zhao +6

1547
30
73.3%
Apr 30, 2026
86

QED: An Open-Source Multi-Agent System for Generating Mathematical Proofs on Open Problems

Chenyang An, Qihao Ye +2

v2
1547
40
57.5%
Apr 27, 2026
87

TRACE: Trajectory Correction from Cross-layer Evidence for Hallucination Reduction

Tej Sanibh Ranade

1546
37
70.3%
May 18, 2026
88

SMCEvolve: Principled Scientific Discovery via Sequential Monte Carlo Evolution

Jiachen Jiang, Huminhao Zhu +1

1546
29
62.1%
May 14, 2026
89

Data Language Models: A New Foundation Model Class for Tabular Data

Eda Erol, Giuliano Pezzoli +1

1545
19
94.7%
May 7, 2026
90

LLM Reasoning Is Latent, Not the Chain of Thought

Wenshuo Wang

1544
61
67.2%
Apr 17, 2026
91

ASH: Agents that Self-Hone via Embodied Learning

Benjamin Schneider, Xavier Schneider +2

1544
19
89.5%
May 14, 2026
92

Learning to Build the Environment: Self-Evolving Reasoning RL via Verifiable Environment Synthesis

Yucheng Shi, Zhenwen Liang +4

1544
18
88.9%
May 14, 2026
93

MARL-GPT: Foundation Model for Multi-Agent Reinforcement Learning

Maria Nesterova, Mikhail Kolosov +6

1544
140
70.7%
Apr 7, 2026
94

Process Matters more than Output for Distinguishing Humans from Machines

Milena Rmus, Mathew D. Hardy +2

1543
24
91.7%
May 7, 2026
95

Attributing Emergence in Million-Agent Systems

Ling Tang, Jilin Mei +6

1542
20
90%
May 12, 2026
96

PhysicianBench: Evaluating LLM Agents in Real-World EHR Environments

Ruoqi Liu, Imran Q. Mohiuddin +6

1541
37
81.1%
May 4, 2026
97

Rollout Cards: A Reproducibility Standard for Agent Research

Charlie Masters, Ziyuan Liu +1

1541
26
92.3%
May 12, 2026
98

Alignment Imprint: Zero-Shot AI-Generated Text Detection via Provable Preference Discrepancy

Junxi Wu, Kailin Huang +5

1541
34
76.5%
Apr 18, 2026
99

Verifiable Process Rewards for Agentic Reasoning

Huining Yuan, Zelai Xu +6

1541
20
90%
May 11, 2026
100

Remembering More, Risking More: Longitudinal Safety Risks in Memory-Equipped LLM Agents

Ahmad Al-Tawaha, Shangding Gu +3

1541
25
80%
May 18, 2026
101

Thinking in Text and Images: Interleaved Vision--Language Reasoning Traces for Long-Horizon Robot Manipulation

Jinkun Liu, Haohan Chi +6

1541
38
71.1%
May 1, 2026
102

Orchard: An Open-Source Agentic Modeling Framework

Baolin Peng, Wenlin Yao +6

1540
21
85.7%
May 14, 2026
103

State-Centric Decision Process

Sungheon Jeong, Ryozo Masukawa +3

1540
22
95.5%
May 12, 2026
104

Read the Paper, Write the Code: Agentic Reproduction of Social-Science Results

Benjamin Kohler, David Zollikofer +3

1539
53
56.6%
Apr 23, 2026
105

Awakening the Sleeping Agent: Lean-Specific Agentic Data Reactivates General Tool Use in Goedel Prover

Jui-Hui Chung, Hongzhou Lin +3

1539
59
66.1%
Apr 9, 2026
106

Distribution-Aware Algorithm Design with LLM Agents

Saharsh Koganti, Priyadarsi Mishra +2

1539
19
89.5%
May 13, 2026
107

Unlocking LLM Creativity in Science through Analogical Reasoning

Andrew Shen, Shaul Druckmann +1

1539
23
91.3%
May 11, 2026
108

What Really Improves Mathematical Reasoning: Structured Reasoning Signals Beyond Pure Code

Yuze Zhao, Junpeng Fang +6

1539
22
90.9%
May 19, 2026
109

Reason to Play: Behavioral and Brain Alignment Between Frontier LRMs and Human Game Learners

Botos Csaba, Sreejan Kumar +6

1539
18
94.4%
May 8, 2026
110

Geometric Metrics for MoE Specialization: From Fisher Information to Early Failure Detection

Dongxin Guo, Jikun Wu +1

1539
33
72.7%
Apr 16, 2026
111

From Prompts to Protocols: An AI Agent for Laboratory Automation

Angelos Angelopoulos, James F. Cahoon +1

1538
24
91.7%
May 15, 2026
112

Forge: Quality-Aware Reinforcement Learning for NP-Hard Optimization in LLMs

Xiaozhe Li, Xinyu Fang +6

1537
22
86.4%
May 9, 2026
113

Self-Correction as Feedback Control: Error Dynamics, Stability Thresholds, and Prompt Interventions in LLMs

Aofan Liu, Jingxiang Meng

v2
1536
29
75.9%
Apr 24, 2026
114

AI Co-Mathematician: Accelerating Mathematicians with Agentic AI

Daniel Zheng, Ingrid von Glehn +6

1536
20
95%
May 7, 2026
115

SciCore-Mol: Augmenting Large Language Models with Pluggable Molecular Cognition Modules

Yuxuan Chen, Changwei Lv +6

1535
22
81.8%
May 21, 2026
116

To See the Unseen: on the Generalization Ability of Transformers in Symbolic Reasoning

Nevena Lazić, Liam Fowl +2

1535
41
70.7%
Apr 23, 2026
117

D3-Gym: Constructing Real-World Verifiable Environments for Data-Driven Discovery

Hanane Nour Moussa, Yifei Li +6

v2
1535
43
55.8%
Apr 30, 2026
118

Reasoning Can Be Restored by Correcting a Few Decision Tokens

Changshuo Shen, Leheng Sheng +3

1535
20
85%
May 16, 2026
119

The Wittgensteinian Representation Hypothesis: Is Language the Attractor of Multimodal Convergence?

Zhaoyang Zhang, Run Shao +5

1535
20
85%
May 10, 2026
120

Policy-Invisible Violations in LLM-Based Agents

Jie Wu, Ming Gong

1535
56
67.9%
Apr 14, 2026
121

Seirênes: Adversarial Self-Play with Evolving Distractions for LLM Reasoning

Chi Zhang, Haibo Qiu +4

1535
20
85%
May 12, 2026
122

When Attention Closes: How LLMs Lose the Thread in Multi-Turn Interaction

Vardhan Dongre, Joseph Hsieh +4

1535
21
90.5%
May 13, 2026
123

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Tianle Wang, Zhaoyang Wang +5

1535
25
88%
May 7, 2026
124

SGA-MCTS: Decoupling Planning from Execution via Training-Free Atomic Experience Retrieval

Xin Xie, Dongyun Xue +6

1534
49
67.3%
Apr 16, 2026
125

Efficient Agentic Reasoning Through Self-Regulated Simulative Planning

Mingkai Deng, Jinyu Hou +5

1534
22
95.5%
May 21, 2026
126

FormalScience: Scalable Human-in-the-Loop Autoformalisation of Science with Agentic Code Generation in Lean

Jordan Meadows, Lan Zhang +1

1534
29
75.9%
Apr 24, 2026
127

Reasoning Fails Where Step Flow Breaks

Xiaoyu Xu, Yulan Pan +5

1534
51
64.7%
Apr 8, 2026
128

Reasoning Structure Matters for Safety Alignment of Reasoning Models

Yeonjun In, Wonjoong Kim +2

1534
43
67.4%
Apr 21, 2026
129

SciResearcher: Scaling Deep Research Agents for Frontier Scientific Reasoning

Tianshi Zheng, Rui Wang +3

1534
47
57.4%
May 2, 2026
130

History Anchors: How Prior Behavior Steers LLM Decisions Toward Unsafe Actions

Alberto G. Rodríguez Salgado

1534
19
84.2%
May 13, 2026
131

To Whom Do Language Models Align? Measuring Principal Hierarchies Under High-Stakes Competing Demands

Fangyi Yu, Nabeel Seedat +2

1534
20
90%
May 12, 2026
132

MIRROR: A Hierarchical Benchmark for Metacognitive Calibration in Large Language Models

Jason Z Wang

1533
30
76.7%
Apr 15, 2026
133

PROMETHEUS: Automating Deep Causal Research Integrating Text, Data and Models

Sridhar Mahadevan

1533
21
95.2%
May 13, 2026
134

CT Open: An Open-Access, Uncontaminated, Live Platform for the Open Challenge of Clinical Trial Outcome Prediction

Jianyou Wang, Youze Zheng +6

1533
30
76.7%
Apr 17, 2026
135

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

Bowen Ye, Rang Li +6

1533
204
75.5%
Apr 7, 2026
136

SAVE: A Generalizable Framework for Multi-Condition Single-Cell Generation with Gene Block Attention

Jiahao Li, Jiayi Dong +4

1533
31
74.2%
Apr 18, 2026
137

Agentick: A Unified Benchmark for General Sequential Decision-Making Agents

Roger Creus Castanyer, Pablo Samuel Castro +1

1533
17
94.1%
May 7, 2026
138

Ulterior Motives: Detecting Misaligned Reasoning in Continuous Thought Models

Sharan Ramjee

1533
29
75.9%
Apr 25, 2026
139

The Evaluation Differential: When Frontier AI Models Recognise They Are Being Tested

Varad Vishwarupe, Nigel Shadbolt +2

1533
21
90.5%
May 12, 2026
140

Missingness-MDPs: Bridging the Theory of Missing Data and POMDPs

Joshua Wendland, Markel Zubia +6

1532
25
92%
May 12, 2026
141

CLEF: EEG Foundation Model for Learning Clinical Semantics

Peng Cao, Ali Mirzazadeh +3

1532
20
85%
May 11, 2026
142

A Foundation Model for Zero-Shot Logical Rule Induction

Yin Jun Phua

1531
20
90%
May 6, 2026
143

Seeing Through Experts Eyes A Foundational Vision Language Model Trained on Radiologists Gaze and Reasoning

Kinhei Lee, Peiyuan Jing +6

1530
29
75.9%
Apr 15, 2026
144

TRIAGE: Evaluating Prospective Metacognitive Control in LLMs under Resource Constraints

Zabir Al Nazi, Shubhashis Roy Dipta

1530
19
84.2%
May 13, 2026
145

Do Androids Dream of Breaking the Game? Systematically Auditing AI Agent Benchmarks with BenchJack

Hao Wang, Hanchen Li +4

1530
19
89.5%
May 12, 2026
146

Intern-Atlas: A Methodological Evolution Graph as Research Infrastructure for AI Scientists

Yujun Wu, Dongxu Zhang +6

v2
1529
33
78.8%
Apr 30, 2026
147

FibQuant: Universal Vector Quantization for Random-Access KV-Cache Compression

Namyoon Lee, Yongjune Kim

1529
19
89.5%
May 12, 2026
148

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Qihan Ren, Peng Wang +6

1529
53
66%
Apr 8, 2026
149

Discovering Agentic Safety Specifications from 1-Bit Danger Signals

Víctor Gallego

1529
29
79.3%
Apr 25, 2026
150

MCPHunt: An Evaluation Framework for Cross-Boundary Data Propagation in Multi-Server MCP Agents

Haonan Li, Tianjun Sun +2

1529
48
60.4%
Apr 30, 2026
151

Stability Implies Redundancy: Delta Attention Selective Halting for Efficient Long-Context Prefilling

Yujie Chen, Tailai Chen +5

1529
41
65.9%
Apr 20, 2026
152

Fully Open Meditron: An Auditable Pipeline for Clinical LLMs

Xavier Theimer-Lienhard, Mushtaha El-Amin +6

1529
16
81.2%
May 15, 2026
153

Learning to Hand Off: Provably Convergent Workflow Learning under Interface Constraints

Jiayu Li, Enpei Zhang +3

1528
22
86.4%
May 18, 2026
154

Learning to Communicate: Toward End-to-End Optimization of Multi-Agent Language Systems

Ye Yu, Heming Liu +4

1528
43
65.1%
Apr 23, 2026
155

Poly-EPO: Training Exploratory Reasoning Models

Ifdita Hasan Orney, Jubayer Ibn Hamid +6

v2
1528
28
78.6%
Apr 19, 2026
156

Geometry over Density: Few-Shot Cross-Domain OOD Detection

Shawn Li, You Qin +5

v2
1527
16
87.5%
May 5, 2026
157

SUPERNOVA: Eliciting General Reasoning in LLMs with Reinforcement Learning on Natural Instructions

Ashima Suvarna, Kendrick Phan +3

1526
67
70.1%
Apr 9, 2026
158

Multi-Agent Orchestration for High-Throughput Materials Screening on a Leadership-Class System

Thang Duc Pham, Harikrishna Tummalapalli +6

1526
49
67.3%
Apr 9, 2026
159

Stable Agentic Control: Tool-Mediated LLM Architecture for Autonomous Cyber Defense

Kerri Prinos, Lilianne Brush +6

1526
27
81.5%
May 4, 2026
160

From Debate to Decision: Conformal Social Choice for Safe Multi-Agent Deliberation

Mengdie Flora Wang, Haochen Xie +6

1526
58
63.8%
Apr 9, 2026
161

When Can Human-AI Teams Outperform Individuals? Tight Bounds with Impossibility Guarantees

Dongxin Guo, Jikun Wu +1

1526
19
94.7%
May 9, 2026
162

Do Agent Rules Shape or Distort? Guardrails Beat Guidance in Coding Agents

Xing Zhang, Guanghui Wang +5

1526
44
68.2%
Apr 13, 2026
163

Remember the Decision, Not the Description: A Rate-Distortion Framework for Agent Memory

Mingxi Zou, Zhihan Guo +6

1526
22
86.4%
May 11, 2026
164

Can Large Language Models Reinvent Foundational Algorithms?

Jian Zhao, Haoren Luo +4

1525
111
65.8%
Apr 7, 2026
165

The Compliance Trap: How Structural Constraints Degrade Frontier AI Metacognition Under Adversarial Pressure

Rahul Kumar

1525
34
76.5%
May 4, 2026
166

CoDaS: AI Co-Data-Scientist for Biomarker Discovery via Wearable Sensors

Yubin Kim, Salman Rahman +6

1525
30
76.7%
Apr 16, 2026
167

XDecomposer: Learning Prior-Free Set Decomposition for Multiphase X-ray Diffraction

Hanyu Gao, Bin Cao +3

1525
20
95%
May 7, 2026
168

AIBuildAI: An AI Agent for Automatically Building AI Models

Ruiyi Zhang, Peijia Qin +3

1525
33
75.8%
Apr 15, 2026
169

State Contamination in Memory-Augmented LLM Agents

Yian Wang, Agam Goyal +2

1525
26
84.6%
May 16, 2026
170

The Query Channel: Information-Theoretic Limits of Masking-Based Explanations

Erciyes Karakaya, Ozgur Ercetin

1525
29
79.3%
Apr 17, 2026
171

Imperfect World Models are Exploitable

Logan Mondal Bhamidipaty, Esmeralda S. Whitammer +3

v2
1524
22
77.3%
May 15, 2026
172

IoT-Brain: Grounding LLMs for Semantic-Spatial Sensor Scheduling

Zhaomeng Zhou, Lan Zhang +4

1523
46
65.2%
Apr 9, 2026
173

Correct Is Not Enough: Training Reasoning Planners with Executor-Grounded Rewards

Tianyang Han, Hengyu Shi +4

v2
1523
20
80%
May 5, 2026
174

Learning from Contrasts: Synthesizing Reasoning Paths from Diverse Search Trajectories

Peiyang Liu, Zhirui Chen +5

1523
59
67.8%
Apr 13, 2026
175

Quantifying and Understanding Uncertainty in Large Reasoning Models

Yangyi Li, Chenxu Zhao +1

1523
55
65.5%
Apr 15, 2026
176

Generative Recursive Reasoning

Junyeob Baek, Mingyu Jo +4

v2
1523
19
68.4%
May 19, 2026
177

Geometric Routing Enables Causal Expert Control in Mixture of Experts

Ivan Ternovtsii, Yurii Bilak

1523
41
51.2%
Apr 15, 2026
178

Problem Reductions at Scale: Agentic Integration of Computationally Hard Problems

Xi-Wei Pan, Shi-Wen An +1

1522
80
66.2%
Apr 13, 2026
179

Self-Programmed Execution for Language-Model Agents

Luke J. O'Connor

1522
19
89.5%
May 7, 2026
180

FVD: Inference-Time Alignment of Diffusion Models via Fleming-Viot Resampling

Shivanshu Shekhar, Sagnik Mukherjee +2

1522
72
66.7%
Apr 8, 2026
181

OptimusKG: Unifying biomedical knowledge in a modern multimodal graph

Lucas Vittor, Ayush Noori +6

1522
30
73.3%
Apr 29, 2026
182

The Two Boundaries: Why Behavioral AI Governance Fails Structurally

Alan L. McCann

1522
31
80.6%
Apr 30, 2026
183

From Holo Pockets to Electron Density: GPT-style Drug Design with Density

Jiahao Chen, Letian Gao +5

1522
22
86.4%
May 9, 2026
184

Frontier-Eng: Benchmarking Self-Evolving Agents on Real-World Engineering Tasks with Generative Optimization

Yizhe Chi, Deyao Hong +6

1522
57
66.7%
Apr 14, 2026
185

PopuLoRA: Co-Evolving LLM Populations for Reasoning Self-Play

Roger Creus Castanyer, Geoffrey Bradway +4

1521
20
80%
May 16, 2026
186

Do Self-Evolving Agents Forget? Capability Degradation and Preservation in Lifelong LLM Agent Adaptation

Ye Yu, Xiaopeng Yuan +4

1521
20
90%
May 10, 2026
187

On-Policy Self-Evolution via Failure Trajectories for Agentic Safety Alignment

Bo Yin, Qi Li +1

1520
21
81%
May 12, 2026
188

δδ-mem: Efficient Online Memory for Large Language Models

Jingdi Lei, Di Zhang +6

1520
20
80%
May 12, 2026
189

ECG-WM: A Physiology-Informed ECG World Model for Clinical Intervention Simulation

Zhikang Chen, Yue Wang +5

1519
20
80%
May 17, 2026
190

Von Neumann Networks

Shekhar S. Chandra

1519
22
86.4%
May 7, 2026
191

Ex Ante Evaluation of AI-Induced Idea Diversity Collapse

Nafis Saami Azad, Raiyan Abdul Baten

1519
21
85.7%
May 7, 2026
192

Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention

Ali Hatamizadeh, Yejin Choi +1

1518
18
88.9%
May 21, 2026
193

NeuroMAS: Multi-Agent Systems as Neural Networks with Joint Reinforcement Learning

Haoran Lu, Luyang Fang +2

1518
20
80%
May 16, 2026
194

Safety Geometry Collapse in Multimodal LLMs and Adaptive Drift Correction

Jiahe Guo, Xiangran Guo +6

1518
25
84%
May 18, 2026
195

Quantifying the human visual exposome with vision language models

Christian Rominger, Andreas R. Schwerdtfeger +6

1517
37
64.9%
May 5, 2026
196

Contextual Agentic Memory is a Memo, Not True Memory

Binyan Xu, Xilin Dai +1

1517
52
69.2%
Apr 30, 2026
197

Reinforcing VLAs in Task-Agnostic World Models

Yucen Wang, Rui Yu +6

1517
23
82.6%
May 12, 2026
198

Breaking Winner-Takes-All\textit{Winner-Takes-All}: Cooperative Policy Optimization Improves Diverse LLM Reasoning

Haoxuan Chen, Tianming Liang +2

1517
20
85%
May 12, 2026
199

MemQ: Integrating Q-Learning into Self-Evolving Memory Agents over Provenance DAGs

Junwei Liao, Haoting Shi +6

1517
18
88.9%
May 8, 2026
200

Beyond Fixed Benchmarks and Worst-Case Attacks: Dynamic Boundary Evaluation for Language Models

Haoxiang Wang, Da Yu +1

1517
19
84.2%
May 7, 2026
Win-rate scores from pairwise comparisons with 95% confidence intervals. Papers compared using full-text deep analysis.