It was always going to be a tough…
In a discovery that’s changing how scientists view nocturnal predators, researchers have captured the first direct evidence of a bat hunting and eating a bird midflight. According to a new study published in Science detailed how Europe’s…
The worldwide construction of data centers illustrates how AI development has advanced through scaling – expanding data volume, model size, and computational resources. LLM performance improves according to power-law scaling when all three expand together. (1) Furthermore, scaling model size and dataset size in tandem has been shown to be near-optimal. (2)
Yet progress has required more than scale: cleaning and curation have been equally crucial. GPT-3 demonstrated the power of large-scale training with filtered Common Crawl, (3) and T5 achieved major improvements by building the C4 dataset after aggressively removing duplicates and low-quality text. (4) PaLM 2 also reported the significant impact of data quality. (5) On the other hand, concerns have been raised about the potential exhaustion of high-quality web text, (6) and partly motivated by applications such as edge AI (the concept of running AI on devices or chips), efficiency efforts such as Mixture-of-Experts (MoE) and compact models are also being pursued in parallel. (7,8) In short, AI has advanced through scaling and curation, while efficiency has also evolved along a complementary path.
By contrast, neuroscience has yet to fully address the ‘limits of data accumulation’ or the ‘optimization of modeling.’ Advances in optical methods, such as two-photon calcium imaging, now enable large-scale simultaneous measurements at single-neuron resolution. Recently, functional data spanning multiple fields of view have been integrated with EM connectomics, yielding analyses on the order of 75,000 neurons in total – marking major progress. (9)
The greater challenge, however, lies in behavioral and environmental diversity. Laboratory experiments still focus largely on ‘screen-based stimuli’ and ‘controlled tasks.’ Although natural scene stimuli are increasingly employed, (10) real-world contexts are far harder to reproduce. Consider sudden crowd surges in a train station, unexpected issues at immigration control, or nighttime evacuation after a major earthquake with power outages and aftershocks. Such scenarios are common in life, but even if reproduced and recorded, the resulting datasets would be rare and highly specialized. Thus, it becomes essential to examine how such data – naturally incorporating individual differences – can be meaningfully connected to others.
This contextual diversity makes integration particularly difficult. Unlike web text, which is relatively static and independent at scale, neural time-series data are strongly influenced by arousal, attention, individuality, apparatus, and surrounding environment. Therefore, standardized and shareable frameworks (NWB, BIDS, DANDI, OpenNeuro), (11) together with detailed metadata such as illumination, arousal state, and behavioral logs, are indispensable.
As a realistic step, I have proposed adding a ‘ten-minute spontaneous activity’ segment to each experiment. Spontaneous activity provides a statistical foundation less constrained by specific tasks or environments, reflecting arousal, attention, and individuality while serving as the substrate for task-evoked activity. This has been supported by findings from both human fMRI and mouse research. (12)
Moreover, resting brain activity exhibits scale-free long-range correlations lasting minutes to tens of minutes. (13) A ten-minute window thus captures the key temporal scales while remaining feasible as a unifying standard across laboratories. Longer recordings are, of course, preferable, but a two-step strategy – first establishing a ten-minute baseline and then extending it for refinement – is the most pragmatic approach.
Attaching this “ten-minute spontaneous activity” baseline forms the “adhesive dots” (as described in a previous article), enabling cross-comparison across studies (Fig.1).
This is not an abstract ideal: the role of resting-state structure as a foundation for interpreting and predicting task responses has been empirically demonstrated. (12)
The definition of spontaneous activity differs across species and paradigms. In humans, it is typically defined as an ‘eyes-open, fixation-rest state,’ whereas in animals it is categorized as ‘head-fixed, task-free’ or ‘freely moving without tasks.’ Importantly, spontaneous activity is not a static point, but a fluctuating dynamic influenced by arousal and microenvironmental factors. Thus, detailed metadata are indispensable. Notably, the diversity within spontaneous activity is far smaller than the vast diversity of tasks and environments.
This – the “Principle of External Complexity” – highlights that in situations like crowded trains or large gatherings, where one brain is surrounded by dozens or even thousands of other brains, environmental complexity can easily exceed an individual’s internal complexity, making neural data integration difficult. Focusing first on the limited variability of spontaneous activity provides AI with a practical intermediate target for translation and alignment.
The key lesson from LLMs is that breakthroughs emerged not from scaling alone but at the convergence of scaling, curation, and efficiency. In neuroscience, progress likewise requires addressing not only the expansion of neuron counts but also the challenge of behavioral and environmental diversity. As a preparatory step, standardizing the inclusion of a “ten-minute spontaneous activity” segment in each experiment – curated and shared as adhesive dots – would provide a common foundation for integration. This article has emphasized data-side strategies; the next will examine how AI can serve as the glue, through representational mapping and transformation learning, to connect fragmented datasets into a coherent understanding.
Audio Pro has unveiled the W-generation multi-room speaker systems, bringing together cutting-edge engineering, elegant design and new software.
“We’ve worked hard on refreshing our entire range, listening to consumer feedback while…
Purdue’s “In Our STEM ERA” Makerspace is among four Purdue Brand Studio campaigns that earned top honors at Ragan’s PR Daily 2025 Social Media & Digital Awards and Content Marketing…
By Rafiq Vayani
DUBAI: The Middle East Arthritis Foundation (MEAF), a non-profit dedicated to promoting health and hope, successfully hosted…
CNN International is launching a multiplatform show, called “CNN Creators,” from its new facility in Doha‘s Media City Qatar.
The new 30-minute weekly CNN International show — which will air on Thursdays at 11:30 a.m. ET — will…
ANNAPOLIS, Md. (Oct. 11, 2025)–The Morgan State men’s tennis team continued play on Saturday in the Gold…
Every week on Mondays, we like to take a look at what has been earning a lot of Followers on Steam to get a good idea of what’s been growing in the industry. This gives us a lot of information,…