GEN, STAT cite CZI datasets and models as advancements toward AI-based virtual cell models

Scatterplot of 800,000 human organoid cells forming color-coded tissue clusters, visualized in CZI’s CELLxGENE tool.
A look inside CZ CELLxGENE: each color shows a different tissue type from more than 800,000 human organoid cells, helping power AI models like TranscriptFormer.

As the scientific community progresses toward AI-based virtual cell models, openly sharing datasets and models is paramount for the field to advance. Two articles in ​​GEN and STAT cover the announcement of a publicly available Perturb-seq dataset from AI drug developer Xaira Therapeutics, co-founded by Nobel Laureate and CZI grantee Dr. David Baker, and also credit CZI as a leader in openly sharing observational data through its CZ CELLxGENE platform. Along with other publicly available datasets, CZ CELLxGENE was used to train TranscriptFormer, a cross-species generative AI model built by CZI that further expands the field’s capacity to understand and simulate biology. The articles also mention CZI’s Billion Cells Project, an effort to generate an unprecedented one billion cell dataset to fuel rapid progress in AI model development for biology.

With leaders like CZI, Xaira and others opening up large-scale, high-quality datasets and tools, the research community is closer than ever to unlocking how cells behave — and how to treat disease at the cellular level.

###

About the Chan Zuckerberg Initiative
The Chan Zuckerberg Initiative was founded in 2015 to help solve some of society’s toughest challenges — from eradicating disease and improving education, to addressing the needs of our local communities. Our mission is to build a better future for everyone. For more information, please visit chanzuckerberg.com.

News

  • Science: Laser-boosted microscopes could reveal new drug targets, sharpen views inside cells

    Enhanced cryo–electron microscopy promises to bring previously elusive proteins into view

  • C&EN: A new, ultrabright laser powers a cryo-EM advance 15 years in the making

    Phase contrast enables structures of smaller proteins, could drive in-cell proteomics

  • Microscope Breakthrough Will Open Unprecedented View into Our Cells

    Biohub and UC Berkeley show that the laser phase plate, a revolutionary device with a laser 100 million times brighter than the Sun, dramatically improves images obtained through cryo-electron microscopy, giving scientists a new window into the molecular underpinnings of disease