Haimeng Zhao, Alexander Zlokapa, Hartmut Neven, Ryan Babbush, John Preskill, Jarrod R. McClean, Hsin-Yuan Huang
Broadly applicable quantum advantage, particularly in classical data processing and machine learning, has been a fundamental open problem. In this work, we prove that a small quantum computer of polylogarithmic size can perform large-scale classification and dimension reduction on massive classical data by processing samples on the fly, whereas any classical machine achieving the same prediction performance requires exponentially larger size. Furthermore, classical machines that are exponentially larger yet below the required size need superpolynomially more samples and time. We validate these quantum advantages in real-world applications, including single-cell RNA sequencing and movie review sentiment analysis, demonstrating four to six orders of magnitude reduction in size with fewer than 60 logical qubits. These quantum advantages are enabled by quantum oracle sketching, an algorithm for accessing the classical world in quantum superposition using only random classical data samples. Combined with classical shadows, our algorithm circumvents the data loading and readout bottleneck to construct succinct classical models from massive classical data, a task provably impossible for any classical machine that is not exponentially larger than the quantum machine. These quantum advantages persist even when classical machines are granted unlimited time or if BPP=BQP, and rely only on the correctness of quantum mechanics. Together, our results establish machine learning on classical data as a broad and natural domain of quantum advantage and a fundamental test of quantum mechanics at the complexity frontier.
This paper addresses one of the most fundamental open problems in quantum computing: whether quantum machines can provide exponential advantages for processing classical data. The authors prove that a polylogarithmic-size quantum computer can perform classification, dimension reduction, and linear system solving on massive classical datasets, whereas any classical machine achieving comparable performance requires exponentially larger memory. The key innovation is quantum oracle sketching, an algorithm that constructs coherent quantum queries from streaming classical data samples by applying incremental quantum rotations on the fly—each sample is processed once and immediately discarded. This elegantly resolves the long-standing tension between quantum algorithms' need for coherent oracle access and the classical nature of real-world data, without requiring QRAM. Combined with interferometric classical shadows for efficient readout, the framework enables end-to-end construction of compact classical models from massive data.
The paper is exceptionally rigorous, with a 144-page appendix containing detailed proofs. The theoretical framework rests on several pillars:
Broad applicability: Unlike prior quantum advantages restricted to contrived or specialized problems, this work targets genuinely ubiquitous tasks—SVMs, PCA, and linear systems—that underpin machine learning, scientific computing, and data analysis across industries.
Memory wall relevance: The results directly address the "memory wall" problem in modern computing, where memory capacity has become the primary bottleneck (growing 410× slower than model parameters). This positions quantum computing as a solution to a pressing practical constraint.
Paradigm shift for quantum ML: The paper effectively rebuts widespread skepticism about quantum advantages for classical data by demonstrating that (1) QRAM is unnecessary, (2) noisy/correlated data can be handled, (3) dequantized algorithms still retain exponential space advantages, and (4) the readout bottleneck is solvable via interferometric classical shadows.
Fundamental physics implications: The authors frame their results as a test of quantum mechanics at the complexity frontier—experimental confirmation would probe the physical reality of exponentially large Hilbert spaces, analogous to Bell inequality tests for nonlocality.
The paper arrives at a critical juncture: quantum error correction is becoming practical, memory constraints dominate AI scaling, and the field has been seeking concrete, broadly applicable quantum advantages beyond cryptanalysis and simulation. The work directly addresses the community's need for provable, practical quantum utility.
This is a landmark contribution that fundamentally advances our understanding of quantum advantages for classical data processing. The combination of algorithmic innovation (quantum oracle sketching), rigorous unconditional lower bounds, and practical demonstrations on real datasets represents a rare achievement in quantum computing research. The work opens a genuinely new frontier by establishing that memory-efficient classical data processing is a natural domain of quantum advantage.
Generated Apr 10, 2026
Paper 2 claims a provable exponential quantum advantage for processing massive classical data using fewer than 60 logical qubits. By circumventing the notorious quantum data loading bottleneck, it unlocks quantum machine learning for broad, real-world domains like biology and natural language processing. While Paper 1 presents a highly rigorous algorithm for battery chemistry, its massive fault-tolerant resource requirements make it a longer-term goal. Paper 2's broader applicability, lower qubit requirements, and resolution of a fundamental open problem in quantum ML grant it a higher potential scientific impact.
Paper 1 demonstrates a fundamental breakthrough by proving and validating an exponential quantum advantage for classical data processing and machine learning. Its impact spans multiple fields, with real-world applications in biology and natural language processing using resources that are remarkably small (<60 logical qubits). While Paper 2 offers a mathematically optimal protocol for quantum channel estimation, its applications are largely confined to quantum characterization and learning theory. Paper 1's broader applicability, resolution of a major open problem, and cross-disciplinary relevance give it a significantly higher potential scientific impact.
Paper 2 establishes a provable exponential quantum advantage for classical data processing and machine learning—a fundamental open problem. It demonstrates broad applicability (single-cell RNA-seq, sentiment analysis), requires only ~60 logical qubits (near-term relevance), and the advantage persists even if BPP=BQP, relying only on quantum mechanics' correctness. This has transformative implications across quantum computing, complexity theory, and machine learning. Paper 1, while a solid engineering advance in electro-optic transduction using lithium tantalate, represents an incremental materials/platform improvement over existing lithium niobate approaches with modest efficiency gains.
Paper 2 establishes a provable exponential quantum advantage for classical data processing and machine learning—a fundamental open problem. It demonstrates broad applicability across real-world tasks (single-cell RNA-seq, sentiment analysis) with concrete resource estimates (<60 logical qubits), and the advantages hold unconditionally (even if BPP=BQP). This addresses a more foundational and widely impactful question than Paper 1's engineering advance in semi-device-independent randomness amplification. Paper 2's breadth of impact across quantum computing, complexity theory, and machine learning, combined with its theoretical depth and practical relevance, gives it substantially higher potential impact.
Paper 2 addresses the fundamental open problem of quantum advantage for classical data processing, proving exponential quantum advantages in machine learning with real-world validation (RNA sequencing, sentiment analysis). Its results are broadly applicable, require only ~60 logical qubits, and persist even if BPP=BQP. The practical implications for near-term quantum computing and machine learning give it enormous cross-disciplinary impact. Paper 1 is theoretically elegant, establishing fermions as more nonlocal than bosons, but its impact is narrower, primarily advancing foundational quantum information theory without immediate practical applications.
Paper 2 has higher potential impact due to a broadly applicable, complexity-theoretic exponential quantum advantage for massive classical-data tasks (classification and dimension reduction), addressing a central open problem in quantum computing. It claims strong separations in machine size/sample complexity, introduces a general technique (quantum oracle sketching + classical shadows) mitigating data-loading/readout bottlenecks, and reports real-world validations with <60 logical qubits, spanning ML, bioinformatics, and NLP. Paper 1 is novel and practical for secure networked quantum metrology, but its scope is narrower and likely affects a more specialized community.
Paper 1 establishes a provable exponential quantum advantage for classical machine learning by circumventing the notorious data loading bottleneck. While Paper 2 presents a highly significant experimental milestone in fault-tolerant quantum computing, Paper 1 solves a fundamental theoretical open problem with immediate, broad real-world applicability (e.g., biology, NLP). Demonstrating an exponential reduction in required computational size using fewer than 60 logical qubits represents a paradigm-shifting breakthrough with massive cross-disciplinary impact, giving it a higher potential scientific impact overall.
Paper 2 establishes exponential quantum advantage for classical data processing and machine learning—a fundamental open problem with enormous breadth of impact. It provides provable separations that hold regardless of computational complexity assumptions (even if BPP=BQP), validates results on real-world datasets (RNA sequencing, sentiment analysis), and demonstrates practical relevance with fewer than 60 logical qubits. Paper 1, while rigorous and insightful for quantum reservoir computing, addresses a more niche topic (continuous-variable QRC with Kerr nonlinearity) with narrower applicability. Paper 2's implications span quantum computing, ML, and foundations of quantum mechanics.
Paper 2 has higher potential impact due to its claimed broadly applicable, provable exponential quantum advantage for classical-data ML tasks, addressing a central open problem with cross-field relevance (quantum complexity, algorithms, and applied ML). If correct, the “polylog-size quantum vs exponentially larger classical” separation plus real-world validations would be transformative and timely for near-term quantum computing. Paper 1 is methodologically solid and important for QLDPC logical processing, but its impact is more specialized to fault-tolerant architecture and code surgery, with narrower breadth than a general quantum advantage framework.
Paper 1 targets a central, longstanding question—broad quantum advantage for classical data processing—and claims exponential separations with minimal assumptions, plus demonstrations on real datasets with <60 logical qubits. If valid, it would reshape expectations for near-term and fault-tolerant quantum ML, influence complexity theory, and provide a new experimental testbed at the “complexity frontier,” giving broad cross-field impact and high timeliness. Paper 2 is methodologically strong and practically important for fault-tolerant compilation, but is a more incremental (though valuable) advance within quantum compiling and resource estimation.