CaMBRAIN: Real-time, Continuous EEG Inference with Causal State Space Models
Abhilash Durgam, Nyle Siddiqui, Jeffrey A. Chan-Santiago, Qiushi Fu, Elakkat D. Gireesh, Mubarak Shah
Abstract
Electroencephalography (EEG) is a critical, non-invasive method to monitor electrical brain activity. EEGs can span anywhere from a couple seconds to multiple hours, posing a major hurdle for existing deep learning methods due to two major factors: (1) existing EEG models are predominantly built upon the attention mechanism, incurring quadratic scaling as the sequence length increases, and (2) raw EEG signals must be processed in a sliding-window fashion due to fixed-length input requirements, preventing global understanding of the entire signal. To this extent, we propose CaMBRAIN - the first Causal, Mamba-based state space model (SSM) capable of real-time inference of EEG signals, arguing that bidirectional approaches are needlessly expensive given the causal, unidirectional nature of EEG. However, training such a model is non-trivial, as crucial EEG events can be extremely brief - within fractions of a second - yet separated by long intervals spanning minutes. Current EEG methods use self-supervised objectives that optimize for signal reconstruction, but these are not well suited for streaming SSMs; they fail to explicitly train the hidden state to retain the salient long-range context needed for streaming inference. We therefore introduce a multi-stage self-supervised training pipeline specifically tailored to encourage long-range memory retention and strong performance on EEG signals, while preserving the linear-time complexity of state space models. CaMBRAIN achieves state-of-the-art (SOTA) results across 3 different EEG datasets with >10x higher throughput than existing models, enabling the first model capable of long-range, continuous inference of variable-length EEG signals.
AI Impact Assessments
(1 models)Scientific Impact Assessment: CaMBRAIN
1. Core Contribution
CaMBRAIN introduces a causal, unidirectional state space model (based on Mamba-3) for continuous, streaming EEG inference. The key insight is that EEG is inherently a causal, temporal signal, making bidirectional processing both unnecessary and computationally wasteful. The paper makes three interconnected contributions: (1) a persistent hidden-state architecture that eliminates sliding-window redundancy, (2) a three-stage self-supervised training pipeline designed to encourage long-range memory retention in the hidden state, and (3) demonstration that this approach achieves competitive or state-of-the-art results with >10× throughput improvement.
The training pipeline is arguably the most novel element. Stage 1 combines autoregressive next-token prediction with masked reconstruction under causal constraints. Stage 2 introduces a JEPA-style student-teacher framework where the student operates on masked inputs and must predict both masked and future latent representations from the teacher — critically, this trains the hidden state to retain predictive, long-range context rather than merely reconstruct local signals. This distinction between reconstruction-based and representation-based objectives for streaming SSMs is a meaningful conceptual contribution.
2. Methodological Rigor
The experimental design is reasonable, covering four datasets (TUAR, TUAB, MAT, CHB-MIT) spanning artifact detection, abnormality classification, mental stress, and seizure detection. The paper reports standard metrics (AUROC, AUPR) and provides three-seed averages with standard deviations in the appendix.
Strengths in rigor:
Weaknesses in rigor:
3. Potential Impact
Clinical deployment: The 4.97ms per-patch latency and 720KB persistent state make CaMBRAIN genuinely deployable for bedside EEG monitoring, ICU seizure detection, and wearable BCI applications. The 1.23 GFLOPs/s sustained compute is 6-158× below competing models — this is a practically meaningful efficiency gap.
Paradigm shift in EEG processing: The paper makes a convincing case that treating EEG as a continuous stream rather than independent windows is fundamentally more appropriate. If validated at scale, this could reshape how clinical EEG systems are designed.
Broader SSM applications: The training pipeline (reconstruction → representation-level JEPA) for causal SSMs could transfer to other streaming physiological signals (ECG, EMG, continuous glucose monitoring).
4. Timeliness & Relevance
The paper is well-timed: Mamba-3 is very recent (ICLR 2026), and the EEG foundation model space is rapidly evolving (LUNA, REVE, CBraMod all from 2024-2025). The gap between foundation model capabilities and real-time clinical deployment is a genuine bottleneck. The JEPA-style training for SSMs also connects to active research threads in self-supervised learning.
The causal argument is particularly relevant as clinical EEG monitoring is inherently real-time — you cannot wait for future context when a patient is seizing.
5. Strengths & Limitations
Key strengths:
Key limitations:
Overall Assessment
CaMBRAIN presents a well-motivated and architecturally sound approach to a genuine clinical need. The combination of causal SSM architecture with a JEPA-style training pipeline for long-range memory retention is the paper's strongest intellectual contribution. The efficiency improvements are substantial and practically meaningful. However, the inconsistent performance across benchmarks (strong on TUAR/MAT/CHB-MIT, weaker on TUAB), the smaller pretraining corpus, and the unresolved failure mode of persistent state accumulating misleading context temper the impact. The work represents a solid advance in EEG modeling methodology, particularly for streaming applications, but falls short of being a definitive solution.
Generated May 28, 2026
Comparison History (20)
CaMBRAIN introduces a fundamentally new architecture paradigm for EEG processing—causal SSMs enabling real-time continuous inference—which addresses a critical bottleneck in clinical neuroscience. Its contributions span architectural innovation (first causal Mamba-based EEG model), a novel multi-stage self-supervised training pipeline for long-range memory retention, and practical clinical applicability with >10x throughput gains. The breadth of impact across neuroscience, clinical monitoring, and deep learning for time-series is substantial. Paper 1, while solid, is more incremental—refining DPO for multimodal reasoning hallucination—within an already crowded space of LLM alignment methods.
CaMBRAIN introduces a fundamentally new architecture for EEG processing that addresses critical limitations (quadratic scaling, fixed-length inputs) with a novel causal SSM approach and custom training pipeline. It achieves SOTA across 3 datasets with 10x throughput gains, enabling real-time continuous EEG monitoring with clear clinical applications. Paper 1, while identifying an important LLM security threat (sleeper attacks), is more incremental within the adversarial AI safety space, extending known attack paradigms to multi-interaction settings. Paper 2's cross-disciplinary impact (ML + neuroscience + clinical medicine) and practical applicability give it broader potential impact.
CaMBRAIN introduces a fundamentally new architecture paradigm for EEG processing—the first causal SSM for real-time continuous EEG inference—addressing critical scalability limitations of attention-based models. It achieves SOTA across 3 datasets with >10x throughput gains, enabling practical real-time clinical monitoring of variable-length signals. This has broad impact across neuroscience, clinical medicine, and BCI applications. Paper 1, while addressing an important safety concern (detection-to-abstention gap), is more incremental in scope, primarily refining LLM reasoning behavior. CaMBRAIN's architectural innovation and direct clinical applicability give it broader and more transformative potential impact.
CaMBRAIN addresses a fundamental challenge in EEG processing with a novel architecture (causal SSM for streaming EEG) and a tailored training pipeline for long-range memory retention. It achieves SOTA across multiple datasets with 10x throughput gains, enabling real-time continuous inference—a first in the field with clear clinical applications (brain monitoring, seizure detection). Paper 1 proposes a useful systems-level architectural layer for multi-agent LLM serving with solid engineering contributions, but its impact is more incremental and narrowly scoped to LLM infrastructure optimization. Paper 2's broader cross-disciplinary impact (ML + neuroscience + clinical medicine) and methodological novelty give it higher potential impact.
Paper 2 addresses fundamental architectural bottlenecks in deep learning for clinical time-series. By adapting state space models (Mamba) for real-time, continuous EEG inference, it enables >10x throughput and solves fixed-window limitations. This breakthrough has profound, durable implications for Brain-Computer Interfaces and continuous healthcare monitoring. While Paper 1 is highly timely and offers valuable explainability for GenAI detection, its long-term scientific impact is constrained by the inherently adversarial and fleeting nature of AI text detection.
Paper 1 addresses a critical bottleneck in real-time, long-sequence clinical data processing by adapting state space models (Mamba) for EEG inference. Its combination of high real-world applicability in healthcare, massive efficiency gains (>10x throughput), and addressing long-range memory retention gives it a broader potential impact across both AI and neurotechnology. Paper 2 offers strong theoretical contributions to operations research, but Paper 1's impact is likely to be more immediate and widespread due to its relevance to medical diagnostics and deep learning architecture scaling.
Paper 1 presents a methodological breakthrough in bio-signal processing by applying Mamba-based State Space Models to EEG data, solving the quadratic scaling and fixed-window limitations of current transformer models. With demonstrated SOTA results across three datasets and a 10x throughput improvement, its potential for immediate, high-impact real-world applications in clinical monitoring and BCIs is immense. While Paper 2 offers a highly relevant ethical critique of LLMs, Paper 1 introduces a rigorously tested, novel architecture likely to drive substantial quantitative follow-up research and technological adoption in machine learning and neuroscience.
Paper 2 addresses a fundamental bottleneck in continuous EEG processing by leveraging state space models to achieve linear scaling and real-time inference. This methodological innovation significantly advances medical monitoring and brain-computer interfaces, fields where long-range temporal dependencies are critical but computationally prohibitive with traditional attention mechanisms. While Paper 1 offers highly practical system-level improvements for speech translation, Paper 2's breakthrough in handling streaming, variable-length biological signals promises a deeper and more transformative impact across clinical applications and neuroscience.
Paper 1 introduces a groundbreaking application of State Space Models to EEG, solving the quadratic scaling bottleneck of attention models for continuous, long-horizon biological signals. Achieving >10x throughput and real-time inference has profound real-world applications in clinical neurology and brain-computer interfaces. While Paper 2 provides a valuable empirical analysis of training instabilities in multi-agent LLM workflows, Paper 1 represents a more significant architectural leap. Its ability to efficiently process variable-length clinical data positions it to have a broader and more transformative scientific impact in medical AI.
Paper 1 addresses a fundamental bottleneck in continuous physiological signal processing by enabling real-time, global inference of EEG data without the quadratic scaling of attention mechanisms. Its >10x throughput improvement and SOTA results present a major breakthrough with broad applications across healthcare, neuroscience, and brain-computer interfaces. In contrast, Paper 2 applies similar sequence-modeling techniques to a narrower, domain-specific problem (AV-pedestrian interactions), making Paper 1's methodological and practical contributions more widely impactful.
Paper 2 has higher potential impact due to a clearer methodological and systems innovation (causal Mamba/SSM for streaming EEG with linear-time scaling plus a tailored multi-stage self-supervised objective), strong real-world applicability (real-time continuous EEG monitoring), and broad relevance across ML, neuroscience, and medical devices. The claimed SOTA across multiple datasets and >10x throughput suggests solid empirical rigor and immediate deployability. Paper 1 introduces an important evaluation lens for LLM organizational “process alignment,” but its impact is narrower, more context-dependent, and primarily conceptual/measurement-focused with less direct generalization to high-stakes deployment performance gains.
Paper 2 exposes a critical flaw in LLM evaluation, demonstrating that standard accuracy metrics in CoT distillation mask deteriorating reasoning quality in high-stakes domains. This challenges fundamental assumptions in AI safety and evaluation, offering broad implications across the entire deep learning community. While Paper 1 presents an innovative and highly efficient SSM approach for EEG monitoring, Paper 2's findings address a pressing, widespread issue in the rapidly expanding field of LLM reasoning, likely triggering a broader paradigm shift in model evaluation.
Paper 1 addresses a fundamental limitation in processing long-sequence physiological data, introducing a novel architectural approach (causal SSMs) that enables real-time, continuous EEG monitoring. This has profound implications for clinical neurology, brain-computer interfaces, and neuroscience. In contrast, Paper 2 focuses on optimizing diffusion models for mobile hardware, which, while highly practical for consumer tech, offers less fundamental scientific innovation and a narrower interdisciplinary impact.
Paper 2 introduces a novel State Space Model approach to overcome significant bottlenecks in long-sequence, real-time EEG processing. By achieving SOTA results and a >10x throughput increase over attention-based models, it offers substantial methodological innovation with broad applications in clinical monitoring and brain-computer interfaces. Paper 1, while useful, presents a more incremental and domain-specific improvement in dental CT image reconstruction.
Paper 1 likely has higher scientific impact due to broader relevance and timeliness: ensuring faithfulness in LLM-mediated agentic XAI is a pressing, cross-domain problem affecting many ML deployments. Its explicit verification framework is conceptually novel and generalizable, and the open-world benchmark could become a community standard for evaluating model-specific faithfulness. Paper 2 is strong and application-critical (real-time EEG), but its impact may be narrower to biosignal/time-series modeling and depends on robustness/clinical validation beyond benchmark SOTA.
Paper 1 offers a highly innovative approach by adapting state space models to real-time, continuous EEG monitoring. Its ability to solve long-range dependency issues while achieving >10x higher throughput and linear-time complexity provides a significant leap for neurotechnology and clinical diagnostics. While Paper 2 tackles relevant constraints in LLM agents, Paper 1 demonstrates a more direct, high-impact real-world application with strong methodological rigor and clear state-of-the-art advancements in healthcare and biomedical engineering.
CaMBRAIN introduces a fundamentally new architecture for EEG processing using causal state space models with a novel multi-stage self-supervised training pipeline, achieving SOTA results with >10x throughput improvement. It addresses a critical bottleneck in clinical neuroscience (real-time, continuous EEG monitoring) with broad medical applications. Paper 2, while insightful about MLLM explainability, is primarily an empirical evaluation study with narrower scope. CaMBRAIN's combination of architectural innovation, practical clinical utility, and methodological contributions (streaming SSMs for biosignals) gives it substantially higher impact potential across both ML and healthcare domains.
While Paper 1 offers a significant breakthrough in real-time EEG processing with state space models, Paper 2 tackles one of the most pressing and field-defining challenges in artificial intelligence: scalable oversight and AI alignment. By providing statistical guarantees and finite-time bounds for controlling autonomous agents without relying on heuristic assumptions, Paper 2 promises a broader, more critical impact on the safe deployment of advanced, agentic AI systems across all domains.
Paper 1 has higher likely scientific impact: it advances sequence modeling for long, streaming biomedical time series by adapting causal state space models (Mamba/SSM) and proposing a tailored self-supervised pipeline to retain sparse, long-range clinical events—an approach broadly applicable beyond EEG (other biosignals, sensors) and timely given SSM interest. Its claims of linear-time inference and large throughput gains address a fundamental scalability bottleneck. Paper 2 is impactful for industry and applied ML/marketing, but is more domain-specific and its core innovation (utility-weighted contrastive loss) is a narrower methodological step.
Paper 1 bridges novel state space models with continuous physiological signal processing, solving a fundamental bottleneck in EEG analysis. By enabling real-time, continuous inference with >10x throughput, it has profound implications for clinical monitoring and brain-computer interfaces. While Paper 2 offers a valuable improvement in LLM alignment, Paper 1 demonstrates broader interdisciplinary impact, higher methodological innovation in adapting SSMs for streaming bio-signals, and clear real-world medical applications.