Cornell Tech - AI-Generated Images Map Visual Functions in the Brain

By Jim Schnabel, Weill Cornell Medicine

Researchers at Weill Cornell Medicine, Cornell Tech and Cornell’s Ithaca campus have demonstrated the use of artificial-intelligence (AI)-selected natural images and AI-generated synthetic images as neuroscientific tools for probing the visual processing areas of the brain. The goal is to apply a data-driven approach to understand how vision is organized while potentially removing biases that may arise when looking at responses to a more limited set of researcher-selected images.

In the study, published Oct. 23 in Communications Biology, the researchers had volunteers look at images that had been selected or generated based on an AI model of the human visual system. The images were predicted to maximally activate several visual processing areas. Using functional magnetic resonance imaging (fMRI) to record the brain activity of the volunteers, the researchers found that the images did activate the target areas significantly better than control images.

The researchers also showed that they could use this image-response data to tune their vision model for individual volunteers, so that images generated to be maximally activating for a particular individual worked better than images generated based on a general model.

“We think this is a promising new approach to study the neuroscience of vision,” said study senior author Amy Kuceyeski, professor of mathematics in radiology and of mathematics in neuroscience in the Feil Family Brain and Mind Research Institute at Weill Cornell Medicine.

The study was a collaboration with the laboratory of Mert Sabuncu, professor of electrical and computer engineering at Cornell Engineering and at Cornell Tech, and of electrical engineering in radiology at Weill Cornell Medicine. The study’s first author was Dr. Zijin Gu, who was a doctoral student co-mentored by Sabuncu and Kuceyeski at the time of the study.

Making an accurate model of the human visual system, in part by mapping brain responses to specific images, is one of the more ambitious goals of modern neuroscience. Researchers have found for example, that one visual processing region may activate strongly in response to an image of a face whereas another may respond to a landscape. Scientists must rely mainly on noninvasive methods in pursuit of this goal, given the risk and difficulty of recording brain activity directly with implanted electrodes. The preferred noninvasive method is fMRI, which essentially records changes in blood flow in small vessels of the brain – an indirect measure of brain activity – as subjects are exposed to sensory stimuli or otherwise perform cognitive or physical tasks. An fMRI machine can read out these tiny changes in three dimensions across the brain, at a resolution on the order of cubic millimeters.

For their own studies, Kuceyeski and Sabuncu and their teams used an existing dataset comprising tens of thousands of natural images, with corresponding fMRI responses from human subjects, to train an AI-type system called an artificial neural network (ANN) to model the human brain’s visual processing system. They then used this model to predict which images, across the dataset, should maximally activate several targeted vision areas of the brain. They also coupled the model with an AI-based image generator to generate synthetic images to accomplish the same task.

“Our general idea here has been to map and model the visual system in a systematic, unbiased way, in principle even using images that a person normally wouldn’t encounter,” Kuceyeski said.

The researchers enrolled six volunteers and recorded their fMRI responses to these images, focusing on the responses in several visual processing areas. The results showed that, for both the natural images and the synthetic images, the predicted maximal activator images, on average across the subjects, did activate the targeted brain regions significantly more than a set of images that were selected or generated to be only average activators. This supports the general validity of the team’s ANN-based model and suggests that even synthetic images may be useful as probes for testing and improving such models.

In a follow-on experiment, the team used the image and fMRI-response data from the first session to create separate ANN-based visual system models for each of the six subjects. They then used these individualized models to select or generate predicted maximal-activator images for each subject. The fMRI responses to these images showed that, at least for the synthetic images, there was greater activation of the targeted visual region, a face-processing region called FFA1, compared to the responses to images based on the group model. This result suggests that AI and fMRI can be useful for individualized visual-system modeling, for example to study differences in visual system organization across populations.

The researchers are now running similar experiments using a more advanced version of the image generator, called Stable Diffusion.

The same general approach could be useful in studying other senses such as hearing, they said. Kuceyeski also hopes ultimately to study the therapeutic potential of this approach.

“In principle, we could alter the connectivity between two parts of the brain using specifically designed stimuli, for example to weaken a connection that causes excess anxiety,” she said.

Many Weill Cornell Medicine physicians and scientists maintain relationships and collaborate with external organizations to foster scientific innovation and provide expert guidance. The institution makes these disclosure public to ensure transparency. For this information, see profile for Amy Kuceyeski.

Jim Schnabel is a freelance writer for Weill Cornell Medicine.

This story originally appeared in the Cornell Chronicle.

< Back to News

Our general idea here has been to map and model the visual system in a systematic, unbiased way, in principle even using images that a person normally wouldn’t encounter.”

Amy Kuceyeski Professor, Weill Cornell Medicine

Related People

Media Highlights

Bloomberg Law

Ripple Ruling Blurs Definition of Cryptocurrencies as Securities

Mental Daily

Study Takes A Closer Look At NYPD Patrol Patterns Using Dashcam Footage

Tech Policy Press

Content Moderation, Encryption, and the Law

Princeton University

Tech Expert Arvind Narayanan Takes the Helm at Princeton Center for Information Technology Policy

Marktechpost

A New AI Research from Stanford, Cornell, and Oxford Introduces a Generative Model that Discovers Object Intrinsics from Just a Few Instances in a Single Image

Master's Programs

PHD & Post Doctoral Programs

Buildings

Plan your event

Tour Campus

CONNECT WITH US

AI-Generated Images Map Visual Functions in the Brain

Media Highlights

Bloomberg Law

Mental Daily

Tech Policy Press

Princeton University

Marktechpost

RELATED STORIES

News Category Cornell University

Michael Kotlikoff Named Cornell’s 15th President

News Category Electrical & Computer Engineering

Cornell Tech Researcher Part of $12 Million NSF Study to Reduce Computing’s Carbon Footprint

News Category Electrical & Computer Engineering

Partnership with BrainChip Allows Cornell Tech Students Exposure to Neuromorphic Computing

News Category Cornell University

Cornell President Martha Pollack to Retire After Transformational Tenure

News Category Cornell University

Global Philanthropist Joan Klein Jacobs ’54 Dies at 91

News Category Cornell University

Celebrating ‘Quiet Greatness,’ Cornell Tech Dedicates Feeney Way

About

Discover

Resources