Jinshan Zhang, Xishi Zhou, Qiu Peng, Jianwei Yin
Breakthroughs in large language models and multimodal generation technologies have propelled the digital reconstruction of human mental traits, emotional patterns, and long-term memory from science fiction toward engineering practice. Yet current research and industry practices at the intersection of AI and digital humans remain hampered by fundamental conceptual ambiguities: the essential differences between next-generation intelligent agents and traditional virtual humans, the construction pathways for digital entities possessing self-identity, and the core technical and ethical challenges confronting this domain all demand urgent clarification. This paper systematically examines the transformative logic underlying the transition from traditional virtual humans to the ``Soul Computing'' paradigm, driven by frontier AI technologies. We first analyze the evolutionary patterns of human consciousness and memory mechanisms, reassessing the core value of massive multimodal digital fragments in the reverse reconstruction of individual mental worlds. On this basis, we formally delineate the academic connotations of narrow and broad Soul Computing for the first time, clarifying its academic boundaries and essential distinctions from Affective Computing, Historical Reconstruction, and Mortal Computation. We argue that Soul Computing systems must architecturally construct an ``Intensional'' core rather than serving as purely ``Extensional'' functional carriers, thereby enabling the fundamental transition of AI from toolhood to living agency.
The paper introduces "Soul Computing" as a new conceptual paradigm, distinguishing between "Narrow Soul Computing" (reconstructing a digital consciousness kernel with self-identity, endogenous motivation, and continuous personality) and "Broad Soul Computing" (externalizing this kernel through virtual human driving, embodied intelligence, and metaverse integration). It proposes a three-layer technical architecture (data-driven layer, narrow core layer, broad externalization layer) and identifies five core challenges. The paper positions Soul Computing as distinct from Affective Computing, Historical Reconstruction, and Mortal Computation through systematic comparison.
This is fundamentally a position/vision paper rather than an empirical contribution. There are no experiments, no implementations, no quantitative evaluations, and no formal proofs. The paper's methodology consists entirely of:
The technical architecture proposed (Section 5) reads as a wish-list of subsystems, each citing one or two existing works as evidence of feasibility, but without any integration, implementation, or validation. For example, the dialogue generation subsystem specifies precise reward function weights (60%/25%/15%) without any empirical justification. The architecture diagram assembles dozens of existing techniques (RAG, LoRA, PPO, 3DGS, DID, blockchain, IPFS) into a coherent-sounding but untested pipeline.
The paper makes extraordinarily strong claims — "independent consciousness," "digital life subjects," "autonomous subsistence logic" — without providing any operationalization of these concepts or evidence that current or near-future technology could achieve them. The gap between the philosophical claims and the cited engineering capabilities is vast and unacknowledged.
The paper addresses a genuinely interesting space at the intersection of digital twins, personal AI agents, and digital legacy/afterlife technologies. The Meta patent discussion and the general trend toward personalized AI agents suggest real industrial interest. However:
The paper is timely in the sense that personalized AI agents, digital humans, and questions about AI consciousness are active research and public discourse topics. The integration of LLMs with persistent memory, personality modeling, and embodied agents is a genuine research frontier. However, the paper's framing conflates engineering capabilities (building more personalized, memory-persistent agents) with philosophical claims about consciousness that are not scientifically warranted. Current LLMs do not possess consciousness, and the paper provides no mechanism by which its architecture would create it — it simply defines consciousness-like properties and asserts the architecture would achieve them.
The paper reads more as a manifesto or research agenda than a scientific contribution. While research agendas have their place, the strongest ones (e.g., Picard's original Affective Computing book, or Brooks's "Intelligence Without Representation") either introduced fundamentally new ideas or were backed by preliminary demonstrations. This paper assembles existing concepts under a new umbrella term without demonstrating that the assembly produces emergent value. The writing style is grandiose, with repeated superlatives ("unprecedented," "fundamental," "revolutionary") that substitute for technical depth.
The paper would benefit enormously from: (1) a minimal prototype demonstrating even one subsystem, (2) honest acknowledgment of the consciousness question's philosophical depth, and (3) concrete, testable hypotheses rather than sweeping architectural diagrams.
Generated Jun 10, 2026
Paper 2 has higher potential scientific impact because it proposes a new, actionable technical framework (“Soul Computing”) and architecture for next-generation intelligent agents, which could influence research agendas in AI, HCI, digital humans, and ethics. It is timely given rapid advances in multimodal/LLM systems and has clearer pathways to real-world applications (agent design, identity/memory systems). Paper 1 is largely metaphysical and normative, with limited methodological rigor or testable claims, making its impact more likely confined to philosophy/ethics discourse.
Paper 2, despite being modest in scope, provides concrete empirical evaluation of LLM-based planning (PlanGPT), offering reproducible benchmarks and a clear finding that PlanGPT performs no better than greedy search. This has immediate practical value for the AI planning community by tempering hype around LLMs for planning tasks. Paper 1 proposes a speculative theoretical framework ('Soul Computing') with grandiose claims about AI consciousness but lacks empirical grounding, conflates philosophical concepts with engineering, and its core contributions are primarily definitional rather than scientifically testable, limiting its real impact.
Paper 2 demonstrates higher scientific impact through its rigorous empirical methodology, quantifiable results, and immediate real-world applicability in supply chain resilience. It effectively bridges LLMs and reinforcement learning to solve a concrete, timely problem. In contrast, Paper 1 presents a highly speculative theoretical framework regarding 'independent consciousness' and 'Soul Computing' which, while philosophically interesting, lacks empirical grounding and near-term technical viability, making its practical scientific impact much lower.
Paper 1 is more likely to yield measurable scientific impact: it targets a well-defined, practical bottleneck (external-memory heuristic search), evaluates understudied baseline IDD methods, and investigates OS-level caching effects—producing reproducible empirical insights that can influence search/planning systems and performance engineering. Paper 2 is largely conceptual/theoretical, introduces new terminology (“Soul Computing”), and makes broad claims about consciousness/agency with unclear formalization, testable hypotheses, or validation pathways, limiting methodological rigor and near-term adoption despite timeliness.
ComBench offers a concrete, reproducible benchmark with empirical results on frontier LLMs, addressing a well-defined gap in evaluating combinatorial reasoning. It provides actionable diagnostics and a clear evaluation protocol. Paper 2 proposes a speculative theoretical framework ('Soul Computing') with vague claims about AI consciousness, lacking empirical validation, rigorous methodology, or falsifiable hypotheses. Paper 1's methodological rigor, practical utility for the active LLM evaluation community, and timeliness give it substantially higher scientific impact potential.
Paper 2 has higher likely scientific impact: it introduces a concrete, standardized, machine-gradable benchmark with clear metrics, broad utility for evaluating agentic LLMs, and directly actionable findings for automation research and industry. Its methodology (200 tasks, 7,118 criteria, multiple model baselines, sanity-check reference) supports reproducibility and rigorous comparison over time. Paper 1 is largely conceptual and definitional, with ambitious claims but unclear testable hypotheses, evaluation protocols, or implementation evidence, which limits near-term rigor and adoption despite potential long-term philosophical relevance.
Paper 1 presents a concrete, novel methodological integration (MCTS-style counterfactual search + learned world/value models) enabled by rare 3D tracking data, with measurable evaluation, model adaptations, and released code/checkpoints—supporting reproducibility and near-term uptake in sports analytics and trajectory-modeling research. Its approach is timely (world models, counterfactual evaluation), has clear real-world applications (player/team decision analysis), and can generalize to other multi-agent domains. Paper 2 is largely conceptual/theoretical with unclear formalism, validation, or implementable methodology, making near-term scientific impact less likely.
Paper 1 presents a concrete, implementable system (Infini Memory) with empirical evaluation on a benchmark, addressing a practical and timely problem in LLM agent memory. It offers reproducible methodology and measurable results. Paper 2 is a theoretical/conceptual framework paper on 'Soul Computing' that, while ambitious in scope, lacks empirical validation, remains largely speculative, and conflates philosophical concepts with engineering claims. Paper 1's practical contributions to the active LLM agents research community give it higher near-term and likely long-term scientific impact.
Paper 1 presents a rigorous, empirically grounded approach utilizing real-world datasets (NHTSA, DMV) to address critical, immediate challenges in autonomous vehicle deployment. Its integration of engineering, ethics, and policy offers high real-world applicability and timeliness. In contrast, Paper 2 proposes a highly speculative theoretical framework ('Soul Computing') that lacks empirical validation and makes scientifically contentious claims about AI 'consciousness' and 'living agency'. Paper 1's solid methodological rigor, actionable recommendations, and immediate relevance give it a significantly higher potential for credible scientific and societal impact.
Paper 1 has higher likely scientific impact because it presents a concrete, testable methodological contribution (memory-augmented neural networks for AIS vessel trajectory prediction) with empirical validation on real datasets and demonstrated gains over baselines, enabling immediate applications in maritime safety and logistics. Paper 2 is largely conceptual and terminological (“Soul Computing”), with unclear operational definitions, limited falsifiability, and no demonstrated technical results, making near-term uptake and measurable scientific progress less likely despite its broad ambitions.