Chatting with your cells: Natural-language AI for single-cell data analysis !









Using sophisticated RNA sequencing technology, biomedical researchers can measure the activity of our genes across millions of single cells, creating detailed maps of tissues, organs, and diseases. Analyzing these datasets requires a rare combination of skills: a deep understanding of the biology, and the ability to develop computer code that turns data into insights.


What if we could equip biomedical researchers with an AI assistant that sees the data, supports the analysis, knows about the biology, and is easy to talk to? This could give scientists a virtual, AI-based colleague with both biological and bioinformatics expertise to support them in their research.

Toward this goal, researchers led by Christoph Bock, Principal Investigator at the CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences and Professor at the Medical University of Vienna, have developed CellWhisperer.

CellWhisperer is an AI method and software tool that links gene expression with descriptive text across more than a million biological samples. It provides an AI chat box to investigate complex biology in English language, unburdened by the complexities of computer code.

This study, published in Nature Biotechnology, demonstrates how AI creates a new way for scientists to interact with their data when studying the biological foundations of diseases.


From genes to text—and vice versa

CellWhisperer uses multimodal deep learning on gene activity profiles and matched biological text, which the authors curated from public databases with the help of AI models. Combining these two data modalities, it becomes possible to search massive datasets with text-based queries such as "Show me immune cells from the inflamed colon of patients with autoimmune diseases."

The CellWhisperer multimodal AI further integrates a large language model that was trained to emulate discussions between biologists and bioinformaticians when analyzing data.

Chatting with CellWhisperer thus sounds a bit like talking to a bioinformatics colleague, relying on CellWhisperer's view of the biological data and the biological knowledge of the large language model. For example, users can ask CellWhisperer about genes that are active in cells of interest, and let the model comment on potential biological implications.

CellWhisperer is built into a user-friendly web frontend based on the popular CELLxGENE browser and freely accessible online.

"By training on experimental data of 20,000 studies from the last two decades, CellWhisperer learned about the biological roles of genes and cells," explains co-first author Moritz Schaefer, a former Postdoctoral Researcher in Bock's research group at CeMM and now at Stanford University.

"This way, CellWhisperer is prepared to analyze new single-cell RNA sequencing data from many areas, making biomedical data exploration easier and more exciting."


Discover the latest in science, tech, and space with over 100,000 subscribers who rely on Phys.org for daily insights. Sign up for our free newsletter and get updates on breakthroughs, innovations, and research that matter—daily or weekly.


A step toward AI research agents

To illustrate CellWhisperer's potential for biological discovery, the team applied it to single-cell RNA sequencing data of human embryonic development. With basic queries such as "heart" or "brain," the model identified developmental time points, cell populations, and marker genes associated with human organ formation. Many of these markers matched known developmental genes, while others pointed to previously overlooked candidates.

"CellWhisperer is not just making biomedical research easier, it helps me understand what is going on in the cells that I am studying," says Peter Peneder, co-first author at the St. Anna Children's Cancer Research Institute.

"Science is teamwork, and with CellWhisperer, an AI research assistant has joined our team. CellWhisperer really helps with exploratory research—getting a first impression of a new dataset and figuring out where to dig deeper. It supports and empowers us as human scientists," emphasizes Bock.



International Research Data Analysis Excellence Awards

Theme: Exploring Recent Research and Advancements in Research Data Analysis

The International Research Data Analysis Excellence Awards celebrate groundbreaking contributions in the field of research data analysis. This year’s theme, "Exploring Recent Research and Advancements in Research Data Analysis," highlights the latest innovations, methodologies, and transformative applications that drive scientific discovery and practical solutions.

By recognizing outstanding researchers, teams, and organizations, these awards aim to:

Honor Excellence – Acknowledge remarkable achievements in data-driven research and innovation.


Visit Our Website : researchdataanalysis.com
Nomination Link : researchdataanalysis.com/award-nomination
Registration Link : researchdataanalysis.com/award-registration
member link : researchdataanalysis.com/conference-abstract-submission
Awards-Winners : researchdataanalysis.com/awards-winners
Contact us : rda@researchdataanalysis.com

Get Connected Here:
==================
Facebook : www.facebook.com/profile.php?id=61550609841317
Twitter : twitter.com/Dataanalys57236
Pinterest : in.pinterest.com/dataanalysisconference
Blog : dataanalysisconference.blogspot.com
Instagram : www.instagram.com/eleen_marissa

Comments