Published:
Frontier models no longer agree on how to handle attention, yet they are converging on the same design goals while decisive gains increasingly come from training.
I am Bojian Yin, an Associate Professor at the Institute of Automation, Chinese Academy of Sciences. My research lies at the intersection of deep learning, brain-inspired intelligence, and foundational AI. I study the mathematical mechanisms that make intelligent learning efficient, adaptive, and robust.
More broadly, I want to bring the efficiency and adaptability of biological intelligence to modern AI. Today's models learn slowly, forget what they have seen, and demand enormous resources; I search for the mathematical principles behind learning and reasoning that could let machines learn the way brains do, continually, efficiently, and robustly, and turn those principles into real algorithms and systems. This pursuit has led to two first-author papers in Nature Machine Intelligence.
My research focuses on the mathematical and brain-inspired principles of learning, and on turning them into robust, generalizable algorithms for large language models and agentic systems. My work spans several interconnected areas:
Feel free to reach out if you're interested in collaborating or just chatting about learning algorithms, brain-inspired AI, or LLMs and agents.
Published:
Frontier models no longer agree on how to handle attention, yet they are converging on the same design goals while decisive gains increasingly come from training.
Published:
A reflection on external time, internal time, and how Selective Update lets neural networks decouple sequence length from meaningful state changes.
Published:
A reflection on scaling laws, entropy, structure, and why AI may still need a deeper intelligence equation.
Published:
前沿模型在如何处理注意力上并未达成一致,却正在收敛到相同的设计目标;真正拉开差距的因素越来越来自训练。
Published:
从人的时间感与神经网络的时间结构出发,理解 Selective Update 如何让外部时间和内部时间脱钩。
Published:
从负熵、scaling law 与结构出发,思考 AGI 的资源边界、学习机制和下一轮结构跃迁。
I have worked across academia, national research institutes, and neuromorphic hardware startups, with experience in algorithm design, mathematical modeling, FPGA and embedded deployment, SDK optimization, and interdisciplinary collaboration.