You do not think in sentences first. You think in structures.
Language is sequential — about 40 bits per second. Vision is parallel — about 100,000,000. The most important interface we have is the one we forget we are using.
Every era invents a new way to see.
Twelve milestones. Each one expanded what a single human mind could externalize — and so what civilization could compute on.
Eight people who did their thinking on paper — not in language.
The pattern is consistent: the drawing is not a record of the thought. The drawing is the thought. Language arrives later, to translate it for the rest of us.
Seven layers between the retina and the city.
Just as civilization is a layered OS, cognition is one too. Each upper layer rides on the discrimination capacity of the layer below; each lower layer is invisible to the layer above.
Ten interfaces for thinking with both hands.
Every one of these is a posture for the mind. The pen on a tablet is a different brain than the pen on Notion. Pick deliberately.
Civilization is visualized information.
Watch the same content pass through five compressions: prose → diagram → system → simulation → built reality. Each step gives up specificity and gains operability.
Machines learn to see — and to render back.
Six families of systems. Together they make the loop closed: a thought is sketched in your head, drawn by a model, evaluated by your eye, redrawn. The drawing-as-thinking loop is now a wire.
Foundational texts and figures cited above.
The next leap in intelligence is not faster language. It is programmable perception.
If 99% of what your brain is doing every second is vision, and the major engineering frontier in front of us is teaching machines to share that visual layer, then the next century of cognition will not be argued in sentences. It will be drawn.
“Language is sequential. Vision is parallel.”“语言是序列。视觉是并行。”