New software processes huge amounts of single-cell data

Comprehensive analysis of large gene-expression datasets

13-Feb-2018 - Germany

Advanced TFF Technology for Enhanced Monoclonal Antibody Concentration and Processing

High-Performance Sterilizing-Grade Filtration for Solvents and Oily Formulations

Robust glass columns for demanding MPLC applications

Scientists from the Helmholtz Zentrum München have developed a program that is able to help manage enormous datasets. The software, named Scanpy, is a candidate for analyzing the Human Cell Atlas.

“It’s about analyzing gene-expression data of a large number of individual cells,” explains lead author Alex Wolf of the Institute of Computational Biology (ICB) at Helmholtz Zentrum München. He developed Scanpy together with his colleague Philipp Angerer in the Machine Learning Group of Prof. Dr. Dr. Fabian Theis. In addition to his position at Helmholtz Zentrum, Theis is also a professor of mathematical modelling of biological systems at the Technical University of Munich. “New technical advances generate several orders of magnitude more data with a correspondingly greater information content,” Theis says. “However, the historically evolved software infrastructure for gene-expression analysis simply wasn’t designed to cope with the new challenges. New analytic methods are therefore needed.”

The race for the Human Cell Atlas

According to Theis, a major international research project could also benefit from the software. A team of international scientists is compiling a reference database, called the Human Cell Atlas, which holds data on the gene activity of all human cell types. “For this project, and in a growing number of other projects in which databases are combined, it is important to have scalable software,” says Theis. It is therefore no surprise that Scanpy is currently a candidate for helping to analyze the Human Cell Atlas.

“The publication of Scanpy marks the first software that allows comprehensive analysis of large gene-expression datasets with a broad range of machine-learning and statistical methods,” explains Wolf, describing the achievement. “The software is already being used by a number of groups around the world, notably at the Broad Institute of Harvard University and the Massachusetts Institute of Technology, MIT.”

Technologically, the application is a trailblazing development: Whereas biostatistics programs are traditionally written in the programming language R, Scanpy is based on the Python language, the dominant language in the machine learning community. Another new feature is that graph-based algorithms lie at the heart of Scanpy. Unlike the usual approach of regarding cells as points in a coordinate system within gene-expression space, the algorithms use a graph-like coordinate system. Instead of characterizing a single cell by the expression value for thousands of genes, the system simply characterizes cells by identifying their closest neighbors – very much like the connections in social networks. In fact, to identify cell types, Scanpy uses the same algorithms as Facebook does for identifying communities.

Original publication

Wolf, A. et al.; "Scanpy: large-scale single-cell gene expression data analysis"; Genome Biology; 2018

https://www.bionity.com/en/news/1153440/new-software-processes-huge-amounts-of-single-cell-data.html

Original publication

Wolf, A. et al.; "Scanpy: large-scale single-cell gene expression data analysis"; Genome Biology; 2018

Topics

software gene expression data analysis software data analysis gene expression profiling analytical software biostatistics data analysis

Show all

Organizations

Helmholtz Zentrum München

Gentle Counterflow Centrifugation for Superior Cell Processing Results

High-Recovery Anion Exchange Membrane Chromatography for Lentiviral Vector Purification

Continuous Low‑pH Virus Inactivation for Integrated Bioprocessing

Fighting cancer: latest developments and advances

Discover cancer news

Last viewed contents

New biodegradable plastics are compostable in your backyard

Go to page

More from the department science Subscribe to newsletter

New software processes huge amounts of single-cell data

Comprehensive analysis of large gene-expression datasets

The race for the Human Cell Atlas

Original publication

Effective through AI in Transplantation Medicine: New Prediction Models for Disease Progression

Other news from the department science

Like a miniature lunar rocket: Researchers develop modular nanorobot

Crossbreeding old chicken breeds with hybrids improves animal welfare and egg production

2026 Future Insight Prize Goes to Spear’s Vasilis Ntziachristos

Social inequality is linked to faster biological aging

Inducing cell death in pancreatic cancer cells

25-year study: Sugar-sweetened beverages from childhood significantly increase high blood pressure risk

New polymorph of indomethacin discovered – a rare event in pharmaceutical research

Tailor-made functionalized gelatin – manufactured with reproducible results

Lab-on-a-Chip platform shows how immune cells attack cancer cells

New research helps understand how a long, healthy lifespan may be passed down across generations

Chemists achieve breakthrough: Editing molecules instead of rebuilding them

Secondhand smoke can leave cancer-causing cadmium in the body

Known copper compound shows activity against Alzheimer’s-typical protein deposits

New method enables accurate sequencing of short peptides hidden in food and human body

A nasal spray reaches a woman's brain differently depending on the week

Light switch makes cancer vulnerable to attack

Mini-Brains from Patient Cells Point to Vitamin B3 as Treatment for Rare Childhood Disease

AI helps scientists design better biochar catalysts for removing antibiotic pollution

Researchers find fructose sends a weaker “I’m full” signal to the brain than glucose

AI fast-forwards molecular simulations by 10,000-fold

These products might interest you

Biacore Intelligent Analysis software by Cytiva

SPR machine-learning software extension for consistent and automated analysis of large data sets

ImageQuant™ TL analysis software by Cytiva

ImageQuant TL analysis software

Image Integrity Checker by Cytiva

Image integrity checker software - authenticate your images for publication

Tenthpin Intelligent Certificate VerificAItion by Tenthpin Solutions

Make Your Certificate Approval Processes Fast and Efficient

STAVEX by AICOS Technologies

User-friendly software for effortless Design of Experiments (DoE)

Most read news

It may not just be what’s in ultra-processed foods, but how they’re made

New drug could slow the development of Alzheimer’s

New antibiotics discovered to treat multi-resistant germs

Cytospire Therapeutics announces oversubscribed £61 million Series A financing

Egg consumption is associated with a lower risk of Alzheimer’s Disease

Mini-Brains from Patient Cells Point to Vitamin B3 as Treatment for Rare Childhood Disease

Miltenyi Biotec expands Cologne production site for clinical reagents

Insect larvae as a screening tool

First European biotech with CAR-T and LNP technology under one roof

Fewer animal experiments thanks to virtual mouse

Daily glass of 100% fruit juice could help support mental wellbeing

The Bacterial Savings Account

More news from our other portals

Festo is cutting approximately 1,300 jobs in Germany

Future Foods Lab: Nomad Foods advances two startups to concept development

Detecting heavy metals in soil and water: New method for on-site analysis

Focused Energy secures US$240 million: the world’s first laser fusion power plant is set to be built in Germany

New research finds that almost all plant-based meat alternatives contain mycotoxins

Why doesn't coffee taste like caffeine?

Atomic reshuffle paves way for record-breaking catalysts for hydrogen production

Less hunger, more environmental problems?

Holography meets spectroscopy: Ultrafast microscopy method for optical processes

Water splitting catalyst creates hydrogen at low temperatures

Nordzucker is revising its beet pricing model and investing €160 million in its factories

Common structural analysis of interfacial water is inadequate, according to a new study

German plastics recycling on the brink of collapse

Nestlé to acquire smart food pioneer yfood to accelerate the brand’s international expansion

For the first time, researchers are peering inside record-breaking superconductors

Metso introduces an advanced lithium carbonate process to support battery materials production

Green light for Arla Foods and DMK Group merger ​

Pyrolysis oil instead of crude oil: Faster fluorine analysis reduces the risk for refineries

Frequency combs: the key to the next generation of spectroscopy

PFAS detection in minutes rather than weeks: deep-tech start-up Grapheal secures €2.5 million in EU funding

Cooking plastics into oil

Carbon dimer: precision measurement delivers new record value

Magnetic field during catalyst synthesis triples ammonia yield

Making Chemistry Greener: The 2026 Gerhard Ertl Lecture Award goes to Professor Marc Koper

Fighting cancer: latest developments and advances

Last viewed contents

New biodegradable plastics are compostable in your backyard

Green light for Arla Foods and DMK Group merger