Basic Biology

  • $ 0.00 0 items

Introduction to genetics

Introduction to genetics

Genetics is a field of biology that studies how traits are passed from parents to their offspring. The passing of traits from parents to offspring is known as heredity, therefore, genetics is the study of heredity. This introduction to genetics takes you through the basic components of genetics such as DNA, genes, chromosomes and genetic inheritance.

Genetics is built around molecules called DNA. DNA molecules hold all the genetic information for an organism. It provides cells with the information they need to perform tasks that allow an organism to grow, survive and reproduce. A gene is one particular section of a DNA molecule that tells a cell to perform one specific task.

Heredity is what makes children look like their parents. During reproduction, DNA is replicated and passed from a parent to their offspring. This inheritance of genetic material by offspring influences the appearance and behavior of the offspring. The environment that an organism lives in can also influence how genes are expressed.

DNA - introduction to genetics

A DNA molecule is a nucleic acid, one of the four molecules of life . It comes in the form of a long, linear molecule referred to as a strand. Each strand of DNA is bonded to a second strand of DNA to form a DNA double helix. In eukaryotic cells , DNA is found in the nucleus as a tightly coiled double helix.

DNA molecules are replicated during cell division. When a cell divides, the two new cells contain all the same DNA that the original cell had.

In sexual reproduction with two parents, half of the DNA of the offspring is provided by each of the parents. The genetic material of a child is made from 50% of their mother’s DNA and 50% their father’s DNA.

A gene is a specific segment of a DNA molecule that holds the information for one specific protein. DNA molecules have a unique code for each gene which codes for their specific protein. Some organisms can have more than 100,000 different genes so they will have 100,000 unique sequences of DNA ‘code’.

Genes are the basic unit of heredity. The genes of an individual are determined by their parent or parents. A bacteria that is born by one parent cell splitting into two cells and has the exact same genes as their one parent cell.

Eye color - introduction to genetics

Physical traits such as eye color or height are often determined by the combination of multiple genes. The environment an individual lives in also impacts how genes are expressed.

Chromosomes

A chromosome is a structure made from tightly packed strands of DNA and proteins called histones. Strands of DNA are tightly wrapped around the histone proteins and form into long worm-shaped structures called ‘chromatids’. Two chromatids join together to form a chromosome.

Chromosomes are formed in the nucleus of a cell when a cell is dividing. It is possible to see chromosomes under an ordinary light microscope if the cell is in the right stage of cell division.

The number of chromosomes varies between species. Humans have 46 chromosomes. Some species can have many more than 100 chromosomes while others can have as little as two.

Genetic inheritance

Inheritance is the backbone of genetics and is an important topic to cover in an introduction to genetics. Long before DNA had been discovered and the word ‘genetics’ had been invented, people were studying the inheritance of traits from one generation to the next.

Genetic inheritance occurs both in sexual reproduction and asexual reproduction . In sexual reproduction, two organisms contribute DNA to produce a new organism. In asexual reproduction, one organism provides all the DNA and produces a clone of themselves. In either, genetic material is passed from one generation to the next.

Experiments performed by a monk named Gregor Mendel provided the foundations of our current understanding of how genetic material is passed from parents to their offspring.

Last edited: 31 August 2020

Want to learn more?

Visit these pages on our website: Genetics |  DNA replication | Genetic Inheritance | Transcription & translation

Campbell Biology | Biology Resources

This is world’s #1 textbook for beginning biologists and has been hugely valuable to me over the years. This is the resource that I recommend above anything else for aspiring biologists.

Introduction to cells

eBook - $2.95

Also available from Amazon , Book Depository and all other good bookstores.

What does DNA stand for?

Know the answer? Why not test yourself with our quick 20 question quiz

MRS GREN

Library Home

Introduction to Genetics

essay introduction about genes

Natasha Ramroop Singh, Kamloops, British Columbia

Copyright Year: 2009

Publisher: Thompson Rivers University

Language: English

Formats Available

Conditions of use.

Attribution-NonCommercial-ShareAlike

Learn more about reviews.

Reviewed by James Langeland, Professor, Kalamazoo College on 1/30/23

This text does what it claims to do. It provides an introductory overview of a broad swath of genetics. read more

Comprehensiveness rating: 4 see less

This text does what it claims to do. It provides an introductory overview of a broad swath of genetics.

Content Accuracy rating: 4

No glaring errors. One could always nitpick any text book.

Relevance/Longevity rating: 3

The text is relevant, but not particularly unique in any sense. One could find virtually the same information in any number of genetics textbooks, presented in largely the same way. A major problem here is that the filed is presented more or less historically with many of the experiments and concepts being described having little to no relevance to genetics today. This is a problem with many texts so I do not single this one out.

Clarity rating: 4

As with many open source texts, this one suffers from substandard figures, which directly influences clarity. The words on the age are fine, but the adage is true-a picture can be worth a thousand words. The mainstream publishers spent a lot of money on figures and it shows--they can be really good.

Consistency rating: 4

No comments here.

Modularity rating: 4

There seem to be appropriate and logical chapter and section breaks.

Organization/Structure/Flow rating: 3

The flow is the same as nearly any other genetics textbook. It suffers from a rigid historical framework. Better than most at Muller's morphs however!

Interface rating: 5

No problems here. I do really like the integrated you tube links. I did not dive into the content of those videos (beyond the scope of my review), but the fact that they are there in abundance is a good use of the open source approach.

Grammatical Errors rating: 5

No problems here.

Cultural Relevance rating: 3

No comment.

A very timely section on SARS-Cov-2 at the end! Rich with study questions and answers. Genetics is and should be very problem based, so this is good. I appreciate what is being offered here and I understand the market. There is nothing "wrong" with this textbook. There is also no wow factor that would cause me to adopt it at this time.

Table of Contents

  • Chapter 1- Mendel's First Law and Meiosis
  • Chapter 2- Mendel's Second Law: Independent Assortment
  • Chapter 3- The Cell Cycle and Mitosis
  • Chapter 4- Pedigree Analysis
  • Chapter 5- The Complementation Test
  • Chapter 6- Alleles at a Single Locus
  • Chapter 7- The Central Dogma- Mutations and Biochemical Pathways
  • Chapter 8- Gene Interactions 
  • Chapter 9- Linkage and Recombination Frequency
  • Chapter 10- Sex Chromosomes & Sex Linkage
  • Chapter 11- Recombination Mapping of Gene Loci
  • Chapter 12- Physical Mapping of Chromosomes and Genomes
  • Chapter 13- Genes and COVID-19 Susceptibility in Humans 

Ancillary Material

About the book.

Genetics, otherwise known as the Science of Heredity, is the study of biological information, and how this information is stored, replicated, transmitted and used by subsequent generations. The study of genetics can be sub-divided into three main areas: Transmission Genetics, Molecular Genetics, and Population Genetics. In this Introductory text, the focus is on Transmission or Classical Genetics, which deals with the basic principles of heredity and the mechanisms by which traits are passed from one generation to the next. The work of Gregor Mendel is central to Transmission Genetics; as such, there is a discussion about the pioneering work performed by him along with Mendel’s Laws, as they pertain to inheritance. Other aspects of Classical Genetics are covered, including the relationship between chromosomes and heredity, the arrangement of genes on chromosomes, and the physical mapping of genes.

About the Contributors

Natasha Ramroop Singh , Thompson Rivers University

Contribute to this Page

  • Biology Article

Genetics is the branch of biology that deals with the study of heredity and its biological process. It also involves the study of genes, genomes and the cell cycle.

What is Genetics?

Genetics is termed as the study to understand the functioning of inheritance of traits from parents to offspring. The groundwork on which heredity stands is known as inheritance. It is defined as the procedure by which characteristics are handed down from one generation to the other. Gregor Johann Mendel is known as the “Father of Modern Genetics” for his discoveries on the basic principles of heredity .

Variation, as the name suggests is the amount of dissimilarity that exists between children and their parentages. It can be determined to keep in view the behaviouristic, cytological, physiological, and morphological characters of individuals fitting into similar species.

Some of the major reasons that variation are

  • Genetic/Chromosomal rearrangement.
  • Mutated genes due to the influence of the ecosystem.
  • Crossing over.

Let us have a detailed look at genetics notes to learn about genes and the principle of inheritance.

Genetics

Law of Inheritance by Gregor Mendel

Garden Pea (Pisum Sativum) was the plant that Mendel experimented on for 7 years to get to the point to propose the laws of inheritance in live creatures. Mendel carefully chose seven distinct characteristics of Pisum Sativum for the investigation concerning hybridization. Mendel used true-breeding lines i.e. those that go through constant self-pollination and display steady characteristic inheritance.

Also Read:    Mendel’s Laws of Inheritance

Principles of Inheritance

When Mendel observed the monohybrid cross he proposed two laws of inheritance-

Law of Dominance – Distinct elements termed as factors control the characteristics. These factors at all times exist as a couple. One of the constituent genes of the couple dominates over the former.

Law of Segregation – Alleles don’t blend and the two characteristics are recuperated all through the gamete formation (in the F2 generation). The characters are apart from each other and pass on to diverse gametes. Comparable types of gametes are produced by Homozygous and Heterozygous produces diverse sorts of a gamete with varied characteristics.

Also Refer:  Principles of Heredity

Incomplete Dominance

It is the discovery that was done after Mendel’s work. Incomplete dominance is the situation in which both the alleles do not display a dominant trait resulting in a fine combination or a midway amid the characteristics of the alleles.

Explore more:  Incomplete dominance

Codominance

When two alleles lack the dominant-recessive association and thus the duo affects the creature together.

Law of Independent Assortment

The separation of one set of characteristics is autonomous of the other set of characters when they are pooled in a hybrid.

The Chromosomal Theory of Inheritance

Both genes and chromosomes exist in sets of two. The homologous chromosome contains the two alleles of a gene pair in the homologous sites. The coupling and split of a set of chromosomes will cause a split in the set of genes (factor) they carry. This united knowledge is termed the Chromosomal Theory of Inheritance.

Sex Determination

A particular nuclear arrangement was perceived by Henking. He perceived that this particular nuclear arrangement was found in only fifty per cent of sperms. He termed this body as x. Later, it was observed that the ova which only obtained the X chromosome matured and were born as females and those that didn’t receive only X chromosomes were born as males. Thus, the X- chromosome was termed a sex chromosome and the remaining ones were termed autosomes.

The occurrence due to which a modification in DNA happens and causes a variation in the phenotype and genotype of a creature is termed a Mutation .

Explore more:  Determination Of Sex

Genetic Disorders

Disorders of a Mendelian nature include:

  • Haemophilia.
  • Sickle Cell Anaemia.
  • Phenylketonuria.

Disorders of a chromosomal nature include:

  • Down’s syndrome.
  • Klinefelter’s Syndrome.
  • Turners Syndrome.

Explore more:  Chromosomal Abnormalities

Learn more in detail about Genetics, its importance, applications and other related topics at BYJU’S Biology

Quiz Image

Put your understanding of this concept to test by answering a few MCQs. Click ‘Start Quiz’ to begin!

Select the correct answer and click on the “Finish” button Check your score and answers at the end of the quiz

Visit BYJU’S for all Biology related queries and study materials

Your result is as below

Request OTP on Voice Call

BIOLOGY Related Links

Leave a Comment Cancel reply

Your Mobile number and Email id will not be published. Required fields are marked *

Post My Comment

essay introduction about genes

Register with BYJU'S & Download Free PDFs

Register with byju's & watch live videos.

Encyclopedia Britannica

  • History & Society
  • Science & Tech
  • Biographies
  • Animals & Nature
  • Geography & Travel
  • Arts & Culture
  • Games & Quizzes
  • On This Day
  • One Good Fact
  • New Articles
  • Lifestyles & Social Issues
  • Philosophy & Religion
  • Politics, Law & Government
  • World History
  • Health & Medicine
  • Browse Biographies
  • Birds, Reptiles & Other Vertebrates
  • Bugs, Mollusks & Other Invertebrates
  • Environment
  • Fossils & Geologic Time
  • Entertainment & Pop Culture
  • Sports & Recreation
  • Visual Arts
  • Demystified
  • Image Galleries
  • Infographics
  • Top Questions
  • Britannica Kids
  • Saving Earth
  • Space Next 50
  • Student Center

initial proposal of DNA structure

Who discovered the structure of DNA?

Endoplasmic reticulum. cell biology

Our editors will review what you’ve submitted and determine whether to revise the article.

  • National Center for Biotechnology Information - The Structure and Function of DNA
  • Khan Academy - DNA
  • Healthline - DNA explained and explored
  • Biology LibreTexts - DNA
  • Live Science - DNA: Definition, Structure & Discovery
  • Genetic Home Reference - What Is DNA?
  • National Human Genome Research Institute - Deoxyribonucleic Acid (DNA) fact sheet
  • Nature - What Rosalind Franklin truly contributed to the discovery of DNA’s structure
  • DNA - Children's Encyclopedia (Ages 8-11)
  • DNA - Student Encyclopedia (Ages 11 and up)

initial proposal of DNA structure

What does DNA do?

Deoxyribonucleic acid (DNA) is an organic chemical that contains genetic information and instructions for protein synthesis . It is found in most cells of every organism. DNA is a key part of reproduction in which genetic heredity occurs through the passing down of DNA from parent or parents to offspring.

What is DNA made of?

DNA is made of nucleotides . A nucleotide has two components: a backbone, made from the sugar deoxyribose and phosphate groups, and nitrogenous bases, known as cytosine , thymine , adenine , and guanine . Genetic code is formed through different arrangements of the bases.

The discovery of DNA’s double-helix structure is credited to the researchers James Watson and Francis Crick , who, with fellow researcher Maurice Wilkins , received a Nobel Prize in 1962 for their work. Many believe that Rosalind Franklin should also be given credit, since she made the revolutionary photo of DNA’s double-helix structure, which was used as evidence without her permission.

Can you edit DNA?

Gene editing today is mostly done through a technique called Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR), adopted from a bacterial mechanism that can cut out specific sections in DNA. One use of CRISPR is the creation of genetically modified organism (GMO) crops.

What’s the difference between DNA and RNA?

DNA is the master blueprint for life and constitutes the genetic material in all free-living organisms. RNA uses DNA to code for the structure of proteins synthesized in cells . Learn more about the differences between DNA and RNA.

Recent News

DNA , organic chemical of complex molecular structure that is found in all prokaryotic and eukaryotic cells and in many viruses . DNA codes genetic information for the transmission of inherited traits.

genetically modified humans

A brief treatment of DNA follows. For full treatment, see genetics: DNA and the genetic code .

Learn how Francis Crick and James Watson revolutionized genetics by discerning DNA's structure

The chemical DNA was first discovered in 1869, but its role in genetic inheritance was not demonstrated until 1943. In 1953 James Watson and Francis Crick , aided by the work of biophysicists Rosalind Franklin and Maurice Wilkins , determined that the structure of DNA is a double-helix polymer , a spiral consisting of two DNA strands wound around each other. The breakthrough led to significant advances in scientists’ understanding of DNA replication and hereditary control of cellular activities.

essay introduction about genes

Each strand of a DNA molecule is composed of a long chain of monomer nucleotides . The nucleotides of DNA consist of a deoxyribose sugar molecule to which is attached a phosphate group and one of four nitrogenous bases : two purines ( adenine and guanine ) and two pyrimidines ( cytosine and thymine ). The nucleotides are joined together by covalent bonds between the phosphate of one nucleotide and the sugar of the next, forming a phosphate-sugar backbone from which the nitrogenous bases protrude. One strand is held to another by hydrogen bonds between the bases; the sequencing of this bonding is specific—i.e., adenine bonds only with thymine, and cytosine only with guanine.

Explore Paul Rothemund's DNA origami and its future application in medical diagnostics, drug delivery, tissue engineering, energy, and the environment

The configuration of the DNA molecule is highly stable, allowing it to act as a template for the replication of new DNA molecules, as well as for the production ( transcription ) of the related RNA (ribonucleic acid) molecule. A segment of DNA that codes for the cell’s synthesis of a specific protein is called a gene .

DNA replicates by separating into two single strands, each of which serves as a template for a new strand. The new strands are copied by the same principle of hydrogen-bond pairing between bases that exists in the double helix. Two new double-stranded molecules of DNA are produced, each containing one of the original strands and one new strand. This “semiconservative” replication is the key to the stable inheritance of genetic traits.

essay introduction about genes

Within a cell, DNA is organized into dense protein-DNA complexes called chromosomes . In eukaryotes , the chromosomes are located in the nucleus , although DNA also is found in mitochondria and chloroplasts . In prokaryotes , which do not have a membrane-bound nucleus, the DNA is found as a single circular chromosome in the cytoplasm . Some prokaryotes, such as bacteria , and a few eukaryotes have extrachromosomal DNA known as plasmids , which are autonomous , self-replicating genetic material. Plasmids have been used extensively in recombinant DNA technology to study gene expression.

Finding prehistoric family ties with modern DNA

The genetic material of viruses may be single- or double-stranded DNA or RNA. Retroviruses carry their genetic material as single-stranded RNA and produce the enzyme reverse transcriptase , which can generate DNA from the RNA strand. Four-stranded DNA complexes known as G-quadruplexes have been observed in guanine-rich areas of the human genome .

  • BiologyDiscussion.com
  • Follow Us On:
  • Google Plus
  • Publish Now

Biology Discussion

Essay on DNA: Meaning, Features and Forms | Genetics

essay introduction about genes

ADVERTISEMENTS:

In this article we will discuss about:- 1. Meaning of DNA 2. Features of DNA 3. Molecular Structure 4. Components 5. Forms.

  • Essay on the Forms of DNA

Essay # 1. Meaning of DNA:

A nucleic acid that carries the genetic information in the cell and is capable of self-replication and RNA synthesis is referred to as DNA. In other words, DNA refers to the molecules inside cells that carry genetic information and pass it from one generation to the next. The scientific name for DNA is deoxyribonucleic acid.

Essay # 2. Features of DNA:

The main features of DNA are given below:

i. Location:

In eukaryotes, DNA is found both in nucleus and cytoplasm. In the nucleus it is a major component of chromosome, whereas in cytoplasm it is found in mitochondria and chloroplasts. In prokaryotes, it is found in the cytoplasm.

ii. Structure:

Mostly the DNA structure is double stranded in both eukaryotes and prokaryotes. However, in some viruses DNA is single stranded. DNA is a double stranded molecule held together by weak bonds between base pairs of nucleotides. The four nucleotides in DNA contain the bases: adenine (A), guanine (G), cytosine (C), and thymine (T).

iii. Shape:

In eukaryotes, the DNA is of linear shape. In prokaryotes and mitochondria; the DNA is circular.

iv. Replication:

The DNA is capable of self-replication. This is the only chemical which has self-replicating capacity. The DNA replicates in semi-conservative manner. In eukaryotes, DNA replicates during the S-Phase of the cell cycle.

There are different forms of DNA such as A, B, C, D and Z DNA. The DNA may be linear or circular. It may be double stranded or single stranded. It may be right handed or left handed. The DNA may be repetitive or unique. It may be nuclear or cytoplasmic.

vi. Functions:

DNA plays important role in various ways. It is used in transcription i.e. synthesis of mRNA which in turn is used in protein synthesis. It carries genetic information from one generation to the next generation. DNA stores and transmits the genetic information in cells.

It forms the basis for genetic code. The genes are made of DNA and are responsible for passing on traits from generation to generation. DNA contains the genetic instructions for the development and functioning of living organisms. Thus it is the substance of heredity.

Essay # 3. Molecular Structure of DNA :

The double helical structure of DNA was identified by Watson and Crick in 1953. This brilliant research work resulted in significant breakthrough in understanding the gene function. This structure has been verified in many different ways and is universally accepted. James Watson and Francis Crick were awarded Nobel prize in 1958 for this significant contribution in the field of molecular biology.

The important features of their model of DNA are as follows:

1. Two helical polynucleotide chains are coiled around a common axis, the chains run in opposite directions.

2. The purine and pyrimidine bases are on the inside of the helix whereas the phosphate and deoxyribose units are situated on the outside. Also the planes of the base residues are perpendicular to the helix axis. While the planes of the sugar residues are almost at right angles to those of the bases.

Watson Crick Model

3. The diameter of the helix is 2 nm. Adjacent bases are separated by 0.34 nm along the helix axis. Hence the helix repeats itself every 10 residues on each chain at intervals of 3.4 nm.

4. The two chains are held together by hydrogen bonds formed between pairs of bases. Pairing is highly specific. Adenine pairs with thymine, guanine always pairs with cytosine. A = T, G = C.

5. The sequence of bases along the polynucleotide chain is not restricted. The precise sequence of bases carries the genetic information.

6. The sugar-phosphate backbones of the two DNA strands wind around the helix axis like the railing of a spiral staircase.

7. The bases of the individual nucleotides are on the inside of the helix, stacked on top of each other like the steps of a spiral staircase.

Essay # 4. Components of DNA :

DNA molecule is a polymer which is composed of several thousand pairs of nucleotide monomers. Union of several nucleotides together leads to the formation of polynucleotide chain. The monomer units of DNA are nucleotides, and the polymer is known as a “polynucleotide.” Each nucleotide consists of a 5-carbon sugar (deoxyribose), a nitrogen containing base attached to the sugar, and a phosphate group.

There are three components of DNA, viz:

(1) Nitrogenous bases,

(2) Deoxyribose sugar, and

(3) Phosphate group.

These are briefly discussed below.

i. Nitrogenous Bases :

Nucleotides are also known as nitrogenous bases or DNA bases. Nitrogenous base are of two types, viz. pyrimidines and purines.

Pyrimidines:

Main features of pyrimidines are given below:

(i) These are single ring structures,

(ii) These are of two types namely cytosine and thymine,

(iii) They occupy less space in DNA structure,

(iv) Pyrimidine is linked with deoxyribose sugar at position 3.

Main features or purines are given below:

(i) They are double ring compounds.

(ii) They are of two types, viz. adenine and guanine.

(iii) They occupy more space in DNA structure.

(iv) Deoxyribose sugar is linked at position 9 of purine.

Thus, in DNA there are four different types of nitrogenous bases, viz. adenine (A), guanine (G), cytosine (C) and thymine (T). In RNA, the pyrimidine base thymine is replace by uracil.

Base Pairing:

The purine and pyrimidine bases always pair in a definite fashion. Adenine will always pair with thymine and guanine with cytosine. Adenine and thymine are joined by double hydrogen bonds while guanine and cytosine are joined by triple hydrogen bonds. However, these bonds are weak which help in separation of DNA strands during replication.

ii. Deoxyribose Sugar :

This is a pentose sugar having five carbon atoms. The four carbon atoms are inside the ring and the fifth one is with CH 2 group. This has three OH groups on 1, 3 and 5 carbon positions. Hydrogen atoms are attached to carbon atoms one to four. In RNA, the sugar ribose is similar to deoxyribose except that it has OH group on carbon atom 2 instead of H group.

iii. Phosphate :

The phosphate molecule is arranged in an alternate manner to deoxyribose molecule. Thus there is deoxyribose on both sides of phosphate. The phosphate is joined with carbon atom 3 of deoxyribose at one side and with carbon atom 5 of deoxyribose on the other side.

Nucleosides and Nucleotides :

A combination of deoxyribose sugar and nitrogenous base is known as nucleoside and a combination of nucleoside and phosphate is called nucleotide.

Nucleoside = Deoxyribose Sugar + Nitrogenous base

Nucleotide = Deoxyribose + Nitrogenous base + Phosphate

Thus, a nucleotide is a nucleoside with one or more phosphate groups covalently attached to it. Nucleosides differ from nucleotides in that they lack phosphate groups. The four different nucleosides of DNA are deoxyadenosine (dA), deoxyguanosine (dG), deoxycytosine (dC), and deoxythymidine (dT).

DNA Backbone :

The DNA backbone is a polymer with an alternating sugar-phosphate sequence. The deoxyribose sugars are joined at both the 3′-hydroxyl and 5′-hydroxyl groups to phosphate groups in ester links, also known as “phosphodiester” bonds.

Essay # 5. Forms of DNA :

Depending upon the nucleotide base per turn of the helix, pitch of the helix, tilt of the base pair and humidity of the sample, the DNA can be observed in four different forms namely, A, B, C and D. The comparison of A, B and Z forms of DNA is presented in Table 15.1.

i. B-form :

This is the same form of DNA proposed by Watson and Crick.

Main features of B form of DNA are given below:

1. This is the most common form of DNA.

2. It is observed when humidity is 92% and salt concentration is high.

3. The coiling is in the right direction.

4. The number of base is 10 per turn of helix.

5. The pitch is 3.4 nm.

6. The sugar phosphate linkage is normal.

7. The helix is narrower and more elongated than A form.

8. The major groove is wide which is easily accessible to proteins.

9. The minor groove is narrow.

10. The conformation is favored at high water concentrations.

11. The base pairing is nearly perpendicular to helix axis.

12. The sugar puckering is C2′-endo.

ii. A-form :

1. This form is observed when the humidity of the sample is 75%.

2. The coiling is in the right direction.

3. The number of bases is 10.7 per turn of helix.

4. The pitch is 2.8 nm.

5. The sugar phosphate linkage is normal.

6. The major groove is deep and narrow which is not easily accessible to proteins.

Comparison of A,B and Z Forms of DNA

7. The minor groove is wide and shallow which is accessible to proteins, but information content is lower than major groove.

8. The helix is shorter and wider than B form.

9. The conformation is favored at low water concentrations.

10. The base pairs are tilted to helix axis.

11. The sugar puckering is C3′-endo.

iii. Z-form :

1. The helix has left-handed coiling pattern.

2. The number of bases is 12 per turn of helix.

3. The pitch is 4.5 nm.

4. The sugar phosphate linkage is zigzag.

5. The major “groove” is not really a groove.

6. The minor groove is narrow.

7. The helix is narrower and more elongated than A or B form.

8. The base pairing is nearly perpendicular to helix axis.

9. The conformation is favored by high salt concentrations.

10. The sugar puckering is C: C2 endo, G : C2 exo.

Related Articles:

  • Structural Features of DNA | Genetics
  • Watson-Crick Model of DNA| Genetics

Essay , Biology , Genetics , Molecular Genetics , Nucleic Acid , DNA

  • Anybody can ask a question
  • Anybody can answer
  • The best answers are voted up and rise to the top

Forum Categories

  • Animal Kingdom
  • Biodiversity
  • Biological Classification
  • Biology An Introduction 11
  • Biology An Introduction
  • Biology in Human Welfare 175
  • Biomolecules
  • Biotechnology 43
  • Body Fluids and Circulation
  • Breathing and Exchange of Gases
  • Cell- Structure and Function
  • Chemical Coordination
  • Digestion and Absorption
  • Diversity in the Living World 125
  • Environmental Issues
  • Excretory System
  • Flowering Plants
  • Food Production
  • Genetics and Evolution 110
  • Human Health and Diseases
  • Human Physiology 242
  • Human Reproduction
  • Immune System
  • Living World
  • Locomotion and Movement
  • Microbes in Human Welfare
  • Mineral Nutrition
  • Molecualr Basis of Inheritance
  • Neural Coordination
  • Organisms and Population
  • Photosynthesis
  • Plant Growth and Development
  • Plant Kingdom
  • Plant Physiology 261
  • Principles and Processes
  • Principles of Inheritance and Variation
  • Reproduction 245
  • Reproduction in Animals
  • Reproduction in Flowering Plants
  • Reproduction in Organisms
  • Reproductive Health
  • Respiration
  • Structural Organisation in Animals
  • Transport in Plants
  • Trending 14

Privacy Overview

CookieDurationDescription
cookielawinfo-checkbox-analytics11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional11 monthsThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy11 monthsThe cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

web counter

Nature vs. Nurture Essay

  • To find inspiration for your paper and overcome writer’s block
  • As a source of information (ensure proper referencing)
  • As a template for you assignment

Nature is the influence of genetics or hereditary factors in determining the individual’s behavior. In other words, it is how natural factors shape the behavior or personality of an individual. In most cases, nature determines the physical characteristics which in effect influence the behavior of an individual. Physical characteristics such as physical appearance, type of voice and sex which are determined by hereditary factors influences the way people behave.

Nurture on the other is the upbringing of an individual according to the environmental conditions. That is, the way individuals are socialized. Basically, nurture is the influence of environmental factors on an individual’s behavior.

According to this paradigm, an individual’s behavior can be conditioned depending on the way one would like it to be. Often, individuals’ behaviors are conditioned by the socio-cultural environmental factors. It is because of socio-cultural environmental conditions that the differences in the behavior of individuals occur.

Nature determines individual traits that are hereditary. In other words, human characteristics are determined by genetic predispositions which are largely natural. Hereditary traits are normally being passed from the parents to the offspring. They include characteristics that determine sex and physical make up. According to natural behaviorists, it is the genes that will determine the physical trait an individual will have. These are encoded on the individuals DNA.

Therefore, behavioral traits such as sexual orientation, aggression, personality and intelligence are also encoded in the DNA. However, scientists believe that these characteristics are evolutionary. That is, they change over time depending on the physical environment adaptability. Evolutionary scientists argue that changes in genes are as a result of mutations which are caused by environmental factors. Thus, natural environment determines individual characteristics which are genetically encoded in the DNA.

Conversely, individuals possess traits that are not naturally determined. These are characteristics that are learnt rather than being born with. These are traits which largely determined by the socio-cultural environmental factors or the way the individuals are socialized within the society depending on the societal values.

These traits are learnt as an individual develops and can easily be changed by the socio-cultural environment where the individual is currently staying. These characteristics include temperament, ability to master a language and sense of humor. Behavioral theorists believe that these traits can be conditioned and altered much like the way animal behavior can be conditioned.

From the discussion it can be deduced that individuals’ traits are determined by hereditary genes and at the same time can be natured. There are those traits that cannot be changed in an individual no matter what condition the person is exposed to. These traits are inborn and embed within the individual hereditary factors.

In most cases, they constitute the physical characteristics of an individual. They also determine the physical behaviors such as walking style, physical appearance and eating habits. At the same time there are learned characteristics which are normally being conditioned by the socio-cultural values. Individuals learn these traits from the way they are socialized within the immediate social or cultural environment. In other words, such behaviors are conditioned by the cultural values encouraged by the immediate society.

In conclusion, nature vs. nurture debate still remains controversial. However, all agree that nature and nurture play a crucial role in determining an individual’s behavior. Nature is associated with heredity roles in determining the individuals characteristics where as nurture is associated with the role of socio-cultural environment in determining the individuals behavior.

  • Human Development: Adolescence as the Most Important Age Range
  • Pieces of the Personality Puzzle: Individual Psychology Theory
  • A Middle Ground of the Nature vs. Nurture Debate
  • The Nature-Nurture Controversy
  • The Difficult Issue of Nature vs. Nurture
  • Practical aspects of the field of speech and language development
  • Conformity, Groupthink, and Bystander Apathy
  • Students Drinking Behavior at HBCU'S
  • Seduction and Flirtation Devices
  • Motivation Theories in Business Environment
  • Chicago (A-D)
  • Chicago (N-B)

IvyPanda. (2018, November 6). Nature vs. Nurture. https://ivypanda.com/essays/nature-vs-nurture/

"Nature vs. Nurture." IvyPanda , 6 Nov. 2018, ivypanda.com/essays/nature-vs-nurture/.

IvyPanda . (2018) 'Nature vs. Nurture'. 6 November.

IvyPanda . 2018. "Nature vs. Nurture." November 6, 2018. https://ivypanda.com/essays/nature-vs-nurture/.

1. IvyPanda . "Nature vs. Nurture." November 6, 2018. https://ivypanda.com/essays/nature-vs-nurture/.

Bibliography

IvyPanda . "Nature vs. Nurture." November 6, 2018. https://ivypanda.com/essays/nature-vs-nurture/.

Pardon Our Interruption

As you were browsing something about your browser made us think you were a bot. There are a few reasons this might happen:

  • You've disabled JavaScript in your web browser.
  • You're a power user moving through this website with super-human speed.
  • You've disabled cookies in your web browser.
  • A third-party browser plugin, such as Ghostery or NoScript, is preventing JavaScript from running. Additional information is available in this support article .

To regain access, please make sure that cookies and JavaScript are enabled before reloading the page.

  • Sponsored Article

Introduction to Gene Ontology

Gene ontology is an expanding knowledgebase that provides scientists with universal definitions to describe the functions of genes and their products. .

Uzma Rentia

Uzma Rentia is a third-year medical student at George Washington University. She has a background in bioinformatics techniques and has published work on ovarian cancer epigenetics, researched CAR T-cell therapies...

View full profile.

Learn about our editorial policies.

This article was review by Paul Sternberg , PhD from the California Institute of Technology and the Gene Ontology Consortium (GOC).

Stay up to date on the latest science with Brush Up Summaries.

A scientist in a lab coat analyzes data on a monitor

What Is Gene Ontology?

The Gene Ontology (GO) database is an effort to create an explicitly defined dictionary to describe the roles of gene products and their relationships to each other. 1,2   “It is essentially a vocabulary that allows scientists or biologists across all domains of life to describe what they are seeing… Every term is defined and has a relation to other terms,” said Paul Sternberg , a molecular biologist and geneticist at the California Institute of Technology and a principal investigator with the Gene Ontology Consortium (GOC) . 3 The GOC consists of a group of scientists dedicated to developing and maintaining the GO. Through the GOC-run GO, all biologists have a universal vocabulary to discuss the function of genes, even if they are working with vastly different species.

What Information Is Included in an Individual Ontology?  

The most basic unit of a specific ontology is a “ term ,” which refers to an attribute used to describe a specific gene function or product. 3 GO terms are themselves composed of several core elements.

  • An alphanumeric ID, which serves as a unique identifier
  • A human-readable name
  • The domain to which it belongs (molecular function, cellular component, or biological process)
  • A definition of the function along with a cited source
  • Its relationship to other terms in the ontology

All of the information held in a GO about a certain term falls under one of three broader biological domains . 4

  • Molecular Function: Activities conducted by gene products at the molecular level.
  • Cellular Component: The cellular entity in which the gene product carries out its function, or the larger complex the gene product is a part of.
  • Biological Process: The overarching process to which the gene product contributes.

Components of a gene ontology: A gene ontology becomes less specific as you move from the bottom to the top. The key base term for an individual ontology is at the bottom of the graph. As you move up, terms that describe the role and function of the base term and subsequent terms are added in layers that become progressively more general. These terms are connected by color-coded arrows, called edges, which indicate the relationships between terms.

Specific ontologies are loosely hierarchal structures that graph the relationships between these terms. They are represented using directed acyclic graphs (DAGs), a system in which general “parent” terms give rise to more specific “child” terms, like a flowchart. The terms are called “nodes,” and their relationships are described using “edges.” Common edge phrases include the following.

  • Is_a: This indicates that a “child” node is a subclass of a “parent” node. For example: the endoplasmic reticulum is an  organelle.
  • Part_of: This indicates a “child” node is a component of a “parent” node. In these designations, the “child” cannot exist without the “parent.” For example, the endoplasmic reticulum is part of the cell.
  • Regulates: This indicates that one process directly modifies another.   For example, certain terms may regulate transcription or regulate apoptosis. Edges that specify positive or negative regulations also exist.
  • Has_part: This indicates that a “parent” node absolutely requires the “child” node to exist or function. For example, the electron transport chain has part Coenzyme Q (CoQ). 

Blue circles arranged in five rows connected by wavy blue lines.

Gene Ontology annotations 

Another vital aspect of GO are annotation s, which establish a connection between a GO term (a function) and the associated gene or gene product it describes. 5 In simpler terms, GO annotations utilize GO terms to attribute a function to a gene or gene product. For instance, tumor protein 53 ( TP53 ), breast cancer gene 1 ( BRCA1 ), ABL proto-oncogene 1 ( ABL1 ), and vascular endothelial growth factor ( VEGF ) are all genes or gene products. The GO terms included in their respective annotations could encompass DNA-binding transcription factor activity (GO:0003700), DNA repair (GO:0006281), protein tyrosine kinase activity (GO:0004713, and positive regulation of angiogenesis (GO:0045766). To ensure accuracy, annotations must reference published scientific literature that backs the association between a GO term and a gene or gene product. 

Gene Ontology Knowledgebase 

Altogether, the standard ontologies, GO-CAM, and annotations make up the GO Knowledgebase. As of April 2024, the GO Knowledgebase contained 42,255 GO terms, 7,671,375 annotations, 1,536,921 gene products, and 5,404 species. 7 Included species range from model organisms such as Schizosaccharomyces pombe (fission yeast), Danio rerio (zebrafish), and Saccharomyces cerevisiae (brewer's or baker's yeast) to lesser-known organisms such as Gallus gallus (red junglefowl) and Dictyostelium discoideum (an amoeba).

The enormous range of terms, annotations, gene products, and represented species illustrates the significant expansion GO has undergone since its creation in 1998 . 1 It continues to grow to this day; ontologies are considered “dynamic” in that they grow and shift as scientific knowledge expands. Members of the GO Consortium collaborate with genomic databases, such as UniProt, MGI, and Reactome, to constantly update, review, and revise the information included in the Knowledgebase. The Consortium also welcomes community feedback for revisions and submissions for new annotations.

Using GO for research

Scientists mostly use GO to analyze the results of high-throughput data, such as next-generation sequencing and microarray results. Most often, GO is used in descriptive “-omics” papers that compare the genomic, proteomic, or transcriptomic differences between two cell types or conditions. Most of these experiments will yield thousands upon thousands of genes, making it unreasonable to sift through individual genes and provide a function for each.  In that case, analysis tools are employed to initially narrow down the results to a subset of critical genes exhibiting significant differential expression. Subsequently, researchers utilize GO-specific programs to infer conclusions regarding represented pathways and processes. Two foundational GO analyses are as follows.

  • Over Representation Analysis (ORA) / Enrichment Analysis : 8 This is one of the most common and simple GO analyses conducted. This analysis assigns inputted genes of interest to pathways and calculates whether each pathway is more or less represented than would be expected. This first requires placing differential expression results in the context of the original raw inputs. For example, say an analysis started with 1000 genes, 100 of which were related to apoptosis. If 100 significantly differentially expressed genes were identified, 10 would be expected to be related to apoptosis. A deviation from that expected number would be flagged, but it could also happen purely by chance. To determine which over- or under-representations are truly statistically significant, GO tools use statistical models such as hypergeometric distribution, Fisher’s exact test, or chi-square test to calculate a p-value. 
  • Functional Class Scoring (FCS) : 9 Functional class scoring refers to a class of methodologies, but the most well-known one is gene set enrichment analysis (GSEA). This approach takes ORA analysis further by ranking genes based on their differential expression value and direction of change. Next, an enrichment score for various pathways is determined based on where their associated genes fall on the ranked list. Unlike ORA, GSEA is largely unbiased because it takes all genes as inputs, not just those that were significantly differentially expressed. 

Commonly Used Tools for Gene Ontology Analysis 

There are a variety of Web-, R-, Python-, and even Java-based tools to conduct ORA, FCS, and other analyses. Below is a table describing some of the commonly used tools in GO analysis, though is by no means an exhaustive list

Table: Commonly-used GO analysis tools 

R package for analysis of both coding and non-coding genomic data

Supports both ORA and GSEA, also allows for comparison of gene lists from different experimental conditions or time points

Outputs include a variety of visually appealing graphs that integrate the tidy interface

Includes databases outside of GO, such as the Kyoto Encyclopedia of Genes and Genomes (KEGG)

Requires knowledge of R programming

Web-based tool for ORA/enrichment analysis 

Also allows for retrieval of orthologous genes and single-nucleotide polymorphisms

Has a variety of visualization options 

Includes databases outside of GO, such as KEGG, Human Protein Atlas, and Human Phenotype Ontology

Web-serve is user-friendly, especially for those without a programming background. Also available as an R package

A 2019 update addressed previous limitations by expanding the included species and databases

Web-based tool for ORA/enrichment analysis 

Similar benefits to gProfiler, but its user-interface makes it better for complete beginners

Enrichr has undergone improvements over the years, notably by enabling the inclusion of background gene lists. Background genes establish a standardized baseline against which differentially expressed genes can be consistently compared

Web-based tool 

Provides functional annotation and pathway information for input gene lists

Released in 2003, DAVID is one of the older GO analysis tools. While still a valuable resource, it may not be as up-to-date as other platforms

Web-based tool for ORA/enrichment analysis 

PANTHER is run by Paul Thomas, one of the principal investigators with the GOC

Developers have frequently updated PANTHER over the past two decades. One future update that developers have identified is better handling of gene fusion events 

Drawbacks and GO Research Pitfalls

One of the drawbacks of GO is that it is incomplete. “If you ask most experts to look at the Gene Ontology and the relationship with the terms and see if their domain is covered [they will say] it is not covered that well,” said Sternberg. Despite the considerable effort put into integrating new literature into the GO, there is always more ground to cover. Consequently, the biological snapshot provided by GO annotations and GO-CAMs likely only represents part of the story. Researchers still need to conduct their investigations and utilize their knowledge base when interpreting the results of a GO analysis. Additionally, researchers must also consider what a GO analysis might be leaving out. A single gene that is differentially expressed and responsible for a vital function may be overlooked in a GO analysis if the larger gene set it belongs to is not enriched. Conversely, a pathway may be enriched in a GO analysis, but that does not guarantee that it is contributing to a significant or interesting function. 

While GO serves as a valuable tool, it should not stop researchers from applying their expertise to interpret their data and results. Additionally, encouraging investigators to contribute to GO annotations themselves enhances the quality and comprehensiveness of the GO database. Sharing knowledge and contributing to a community resource are critical aspects of promoting the utility of GO.

1. Ashburner M, et al. Gene Ontology: tool for the unification of biology . Nat Genet . 2000;25(1):25-29.

2. Aleksander SA, et al. The Gene Ontology knowledgebase in 2023 . Genetics . 2023;224(1):iyad031.

3. About the GO . Gene Ontology. Published May 30, 2024. Accessed June 22, 2024. http://geneontology.org/docs/introduction-to-go

4. GO term elements . Gene Ontology Resource. Published May 30, 2024. Accessed June 12, 2024. http://geneontology.org/docs/GO-term-elements

5. Gene Ontology Consortium. The Gene Ontology (GO) database and informatics resource . Nucleic Acids Res . 2004;32(suppl_1):D258-D261. 

6. Hill DP, et al. Gene Ontology annotations: what they mean and where they come from . BMC Bioinformatics . 2008;9(5):S2.

7. Thomas PD, et al. Gene Ontology Causal Activity Modeling (GO-CAM) moves beyond GO annotations to structured descriptions of biological functions and systems . Nat Genet . 2019;51(10):1429-1433.

8. Gene Ontology Resource . Gene Ontology Resource. Accessed June 12, 2024. http://geneontology.org/

9. Reimand J, et al. Pathway enrichment analysis and visualization of omics data using g:Profiler, GSEA, Cytoscape and EnrichmentMap . Nat Protoc . 2019;14(2):482-517.

10. Pathways and gene sets: What is functional enrichment analysis? NIH Center for Cancer Research. Accessed May 28, 2024. https://bioinformatics.ccr.cancer.gov/btep/pathways-and-gene-sets-what-is-functional-enrichment-analysis/

11. Yu G, et al. clusterProfiler: an R package for comparing biological themes among gene clusters . OMICS J Integr Biol . 2012;16(5):284-287.

12. Reimand J, et al. g:Profiler—a web-based toolset for functional profiling of gene lists from large-scale experiments . Nucleic Acids Res . 2007;35(suppl_2):W193-W200.

13. Chen EY, et al. Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinformatics . 2013;14:128.

14. Huang DW, et al. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources . Nat Protoc . 2009;4(1):44-57.

15. Thomas PD, et al. PANTHER: A library of protein families and subfamilies indexed by function . Genome Res . 2003;13(9):2129-2141.

Brush Up Summaries

Warning: The NCBI web site requires JavaScript to function. more...

U.S. flag

An official website of the United States government

The .gov means it's official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you're on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings
  • Browse Titles

NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

National Academies of Sciences, Engineering, and Medicine; National Academy of Medicine; National Academy of Sciences; Committee on Human Gene Editing: Scientific, Medical, and Ethical Considerations. Human Genome Editing: Science, Ethics, and Governance. Washington (DC): National Academies Press (US); 2017 Feb 14.

Cover of Human Genome Editing

Human Genome Editing: Science, Ethics, and Governance.

  • Hardcopy Version at National Academies Press

1 Introduction

Genome editing 1 is a powerful new tool for making precise additions, deletions, and alterations to the genome—an organism's complete set of genetic material. The development of new approaches—involving the use of meganucleases; zinc finger nucleases (ZFNs); transcription activator-like effector nucleases (TALENs); and, most recently, the CRISPR/Cas9 system—has made editing of the genome much more precise, efficient, flexible, and less expensive relative to previous strategies. With these advances has come an explosion of interest in the possible applications of genome editing, both in conducting fundamental research and potentially in promoting human health through the treatment or prevention of disease and disability. The latter possibilities range from editing somatic cells to restore normal function in diseased organs to editing the human germline to prevent genetic diseases in future children and their descendants.

As with other medical advances, each application comes with its own set of benefits, risks, regulatory questions, ethical issues, and societal implications. Important questions raised with respect to genome editing include how to balance potential benefits against the risk of unintended harms; how to govern the use of these technologies; how to incorporate societal values into salient clinical and policy considerations; and how to respect the inevitable differences, rooted in national cultures, that will shape perspectives on whether and how to use these technologies.

Recognizing both the promise and concerns related to human genome editing, the National Academy of Sciences (NAS) and the National Academy of Medicine (NAM) 2 convened the Committee on Human Gene Editing: Scientific, Medical, and Ethical Considerations to carry out the study that is documented in this report. While genome editing has potential applications for use in agriculture and nonhuman animals, 3 this committee's task (see Box 1-1 ) was focused on human applications. 4 The charge to the committee included elements pertaining to the state of the science in genome editing, possible clinical applications of these technologies, potential risks and benefits, whether standards can be established for quantifying unintended effects, whether current regulatory frameworks provide adequate oversight, and what overarching principles should guide the regulation of genome editing in humans.

Statement of Task.

  • STUDY CONTEXT

The NAS and the NAM Human Gene-Editing Initiative

In light of the promise of genome editing and the associated regulatory and ethical issues, the NAS and the NAM established an initiative to explore these issues in greater depth and facilitate U.S. and international dialogue on how to address them. The first activity of this Human Gene-Editing Initiative was the convening of the International Summit on Human Gene Editing: A Global Discussion jointly with the Chinese Academy of Sciences and The Royal Society of the United Kingdom. This 3-day event addressed a number of scientific advances in the development of modern genome-editing tools, potential medical uses of these tools in human patients, and ethical and social issues their uses might pose. The organizing committee released a statement that summarized its conclusions from the meeting ( NASEM, 2016d ). Panel chair David Baltimore also noted “we hope that our discussion here will serve as a foundation for a meaningful and ongoing global dialogue” ( NASEM, 2016d , p. 6). All three nations embraced the statement's call for continued research on gene editing, further deliberation with regard to heritable changes, and a continued public discourse on the topic. 5 The summit provided important input to the present study, as did other studies by the NAS and the NAM on related topics (see Box 1-2 ).

Related Studies of the NAS and the NAM.

This committee was convened to continue the dialogue initiated by the International Summit and to undertake a year-long, in-depth consensus study. As specified in its statement of task (see Box 1-1 ), the committee examined the state of the science in human genome editing, its potential applications, and the ethical issues that need to be considered in deciding how to govern the use of these powerful new tools. This report is the product of that study and, as with all other National Academies consensus studies, underwent peer review by an independent panel of experts. Additional activities of the Chinese Academy of Sciences and The Royal Society of the United Kingdom are anticipated, including another international summit to take place in China in 2018.

U.S. and International Policy Discussions

Among the earliest calls for a detailed examination of the implications of genome-editing technologies were those made by members of the scientific community engaged in developing these tools and advancing their clinical applications. In 2015 a group of investigators and ethicists, including CRISPR/Cas9 developers, met in Napa, California, and subsequently published a request for the community to explore the nature of human genome editing and provide guidance on its acceptable uses ( Baltimore et al., 2015 ). That same year, a number of articles and commentaries appearing in scientific journals and the popular press called attention to scientific and ethical challenges that would be posed by CRISPR/Cas9 and similar genetic tools ( Bosley et al., 2015 ; Editing Humanity, 2015 ; Lanphier et al., 2015 ; Maxmen, 2015 ; Specter, 2015 ).

Professional bodies, international organizations, and national academies of sciences and medicine further raised the profile of genome editing by issuing statements on its appropriate uses, particularly in reference to the potential for creating heritable genetic modifications. Among others, they included the U.K. Academy of Medical Sciences and a number of collaborative partners; the European Group on Ethics in Science and New Technologies, an advisory body to the president of the European Commission; the Council of Europe; and the International Society for Stem Cell Research ( AMS et al., 2015 ; Council of Europe, 2015 ; EGE, 2016 ; Friedmann et al., 2015 ; Hinxton Group, 2015 ; ISSCR, 2015 ). The United Nations Educational, Scientific, and Cultural Organization (UNESCO) (2015) issued updated guidance to reflect genome-editing advances. Others launched activities to examine the implications of genome editing in greater detail, including the Académie Nationale de Médecine (France) ( ANM, 2016 ); Institut Nationale de la Santé et de la Récherche Médicale (France) (INSERM; Hirsch et al., 2017 ); Berlin-Brandenburg Academy of Sciences and Humanities ( BBAW, 2015 ); National Academy of Sciences Leopoldina, in partnership with the Deutsche Akademie der Technikwissenshaften (National Academy of Science and Engineering: “acatech”), Deutsche Forschungsgemeinschaft (German Research Foundation: DFG), and Union der deutschen Akademien der Wissenschaften (Union of German Academies of Sciences and Humanities: Academien Union) ( National Academy of Sciences Leopoldina et al., 2015 ); Federation of European Academies of Medicine ( FEAM, UKAMS, and ANM, 2017 ); Royal Netherlands Academy of Arts and Sciences ( KNAW, 2016 ); Nuffield Council on Bioethics ( Nuffield Council, 2016b ); and others (see Box 1-3 ).

Excerpts from Selected Calls Around the World for Continued Study and Public Discussion.

The Technologies

New or improved tools facilitate scientific progress by making it possible to investigate new kinds of questions and to generate new solutions. In the area of health and medicine, scientists and clinicians have long sought to apply the techniques of molecular biology to understand basic biology—including embryonic development, physiology, and the immune and nervous systems—and to treat or prevent disease. Much progress has been made in elucidating the role of genetics in diseases, ranging from sickle-cell anemia, muscular dystrophy, and cystic fibrosis, to such conditions as deafness, short stature, and blindness. The development of many such diseases and conditions has a genetic component. Some result from straightforward single-gene changes, but most involve a complex interplay of genetic, environmental, and other factors that remain only imperfectly understood. Furthermore, genetic sequences themselves paint only part of the biological picture. Regulation of how and when genes are turned on and off, including the role of the epigenome, 6 continues to be actively explored. Controlled gene expression and epigenetic alterations influence how tissues develop and differentiate and have clinical ramifications in such areas as cancer and embryonic development.

Tools that enable investigators to alter DNA sequences in order to understand or improve their function are not new. Recent years, however, have seen the development of a suite of genome-editing tools that allow for easier, better controlled, and more accurate changes to DNA inside cells. These tools are based on exogenous enzymes that cut DNA at specific locations, combined with endogenous processes that repair the broken DNA, thereby enabling letters of the genetic code to be added, modified, or deleted. The speed with which this technology has been adopted in research laboratories and further adapted to tackle additional scientific challenges is a reflection of how powerful a technique the editing of genes and genomes will be for the scientific and clinical communities.

The earliest applications of nuclease-based genome-editing methods employed targeted recognition of specific DNA sequences by proteins: homing nucleases (also known as meganucleases), ZFNs, and TALENs. However, the recent development of RNA-based targeting has greatly simplified the process of genome editing. The first publications on the subject, in 2012-2013, explained how the CRISPR/Cas9 system, derived from a natural bacterial defense mechanism against infecting viruses, can be harnessed to make controlled genetic changes in any DNA, including that of human cells ( Cho et al., 2013 ; Cong et al., 2013 ; Jinek et al., 2012 , 2013 ; Mali et al., 2013 ). This was a game-changing advance. These methods have rapidly been adopted by scientists worldwide and have greatly accelerated fundamental research that has included altering cells in the laboratory to study the functions of particular genes, developing models for studies of human diseases using stem cells or laboratory animals, creating modified plants and animals to improve food production, and developing therapeutic uses in humans. Genome editing has rapidly become an invaluable core technology in research laboratories and biotechnology companies, and is already moving into clinical trials (e.g., Cyranoski, 2016 ; Reardon, 2016 ; Urnov et al., 2010 ).

Individual-Level Concerns

As with other types of medical interventions, whether genome editing can be used in patients will depend largely on understanding the safety and efficacy of the treatment and evaluating whether the anticipated benefits are reasonable with respect to the risk of adverse effects. Treatments based on genome editing are intended to make controlled modifications to specific portions of the DNA that affect the functions of their target(s) while avoiding changes to other portions whose alteration is not desired. The latter alterations, referred to as off-target events, could have consequences, many unnoticeable but others damaging, depending on their location and their effects. In general, human genome editing raises questions common to the process of researching and developing new treatments: which conditions or diseases are most suitable to address with these technologies, how to identify and evaluate off-target events and other potential side effects, and which patients are most appropriate for studies. As described in this report, regulatory systems for addressing the individual-level concerns associated with genome editing already exist in the United States and many other countries, but can be improved.

Societal-Level Concerns

The use of genome editing also has significant social dimensions that vary depending on the proposed application. The use of a genome-editing treatment whose effects are nonheritable and are restricted to an individual patient may not differ greatly from the use of a traditional drug or medical device. By contrast, making changes that may be inherited by future generations raises questions about the extent to which the long-term effects of proposed edits can be predicted and whether it is appropriate for humans to purposely alter any aspect of their genetic future ( Frankel and Chapman, 2000 ; Juengst, 1991 ; Parens, 1995 ). In addition, identifying the increased range of applications made possible by genome editing may be yet another challenge to conventional conceptions of what constitutes a disease or disability. Societal-level concerns are particularly acute with respect to genome-editing interventions aimed at enhancing human capabilities. Such applications also raise questions about how to define and promote fairness and equity ( President's Council on Bioethics, 2003 ). Moreover, as with other genetic technologies, such genome-editing applications may raise concerns about coercive and abusive eugenics programs of the past, which were based on faulty science and served discriminatory political goals ( Wailoo et al., 2012 ).

Looking Beyond Safety and Efficacy

Although the nature of the debate surrounding genome editing is not new, the tools available in the past for making genetic modifications in human cells were time-consuming, difficult, and expensive, and were unlikely to be used outside of specialized medical applications. Recent genome-editing technologies, particularly the CRISPR/Cas9 system, have greatly expanded the landscape of potential applications and potential users. Their rapid development and adoption also have shortened the timeline for discussion of what appropriate governance structures need to be identified or developed. As the safety and efficacy of these technologies continue to improve, the critical question will become not whether scientists and clinicians can use genome editing to make a certain change, but whether they should. There is already discussion of do-it-yourself (DIY) editing and the use of genome-editing tools by the biohacker and DIY biology communities, albeit in nonhuman organisms ( Brown, 2016 ; Ledford, 2015 ). Thorny issues around acceptable uses of the technology in humans will depend on more than scientific considerations, and may increasingly involve weighing factors beyond individual-level risks and benefits ( NRC, 1996 ).

Layered on the scientific and ethical issues associated with human genome editing is the question of how to govern its application so as to facilitate its appropriate use and avoid its misuse. Determining the limits of the technologies' uses and the regulatory mechanisms needed to enforce these limits will vary according to each nation's cultural, political, and legal context. But whether and how best to move human genome editing forward has implications for transnational scientific cooperation that require ongoing public discussion and input into policy making. There is ample precedent for scientists and other stakeholders to engage in just such activities, and this report is intended to build on points raised by a number of international conventions and declarations, such as the Oviedo Convention (1997), the International Declaration on Human Genetic Data (2003), and the Universal Declaration on Bioethics and Human Rights (2005) ( Andorno, 2005a , b ; UNESCO, 2004a , 2005 ).

  • STUDY APPROACH

To address its complex task (see Box 1-1 ), the committee included members with expertise in basic and clinical research, in the development of human genetic therapies, and in U.S. and international legal and regulatory frameworks. It included biologists, bioethicists, and social scientists, and incorporated perspectives from potentially affected patient and stakeholder communities. Because the ethical and social issues posed by human genome editing transcend national boundaries, the committee included not only U.S. members but also those who are citizens of or are currently working in Canada, China, Egypt, France, Germany, Israel, Italy, Spain, and the United Kingdom. Brief biographies of the committee members are found in Appendix D .

This study was informed not only by the International Summit described earlier, which immediately preceded the committee's first meeting, but also by review of the salient literature, additional meetings, and speakers who generously shared their knowledge with the committee. Further information on the process by which the committee conducted this study is provided in Appendix C .

In evaluating the implications of new genome-editing tools, the committee also reviewed scientific progress, ethical debates, and regulatory structures related to the use in humans of medical developments such as assisted reproductive technologies, stem cell therapies, gene transfer, and mitochondrial replacement techniques. These developments interface with those of genome editing because editing of stem cells has potential clinical applications for treating or preventing disease, and reproductive technologies would have to be used in combination with genome editing for any heritable application of the latter technologies. As these other technologies have advanced, legal and regulatory frameworks and ethical norms of conduct have been developed to provide guidance on their appropriate human uses and oversight ( Health Canada, 2016 ; HFEA, 2014 ; IOM, 2005 ; NASEM, 2016e ; NRC and IOM, 2007 , 2008 ; Nuffield Council, 2016a ; Präg and Mills, 2015 ; Qiao and Feng, 2014 ). The reports cited here helped provide a basis for the committee's assessment of the use of genome-editing tools in humans and are referenced in subsequent chapters where relevant.

  • ORGANIZATION OF THE REPORT

The report begins by reviewing international norms that are embodied in the set of overarching principles adopted by the committee for governance of human genome editing ( Chapter 2 ). The chapter continues with an overview of the U.S. regulation of research and clinical application of genome editing, drawing comparisons where appropriate to other national systems of oversight.

With this grounding in principles and regulation, Chapters 3 - 6 delve into human genome-editing technology and the scientific issues, regulatory context, and ethical implications of four specific applications. Laboratory research conducted in somatic cells and nonheritable laboratory research in human germ cells, gametes, or early-stage embryos is covered in Chapter 3 . Chapter 4 examines the uses of genome editing for somatic interventions focused on therapy, including fetal therapy. Chapter 5 addresses the use of genome-editing technology in germline cells for potential research and clinical therapeutic applications in human patients. Chapter 6 considers the potential use of human genome editing to enhance human functions rather than to treat or prevent disease or disability.

The subsequent chapter ( Chapter 7 ) turns from analysis of these categories of application to the role of public input in determining how genome-editing technology should be governed in the future, both in the United States and in other countries. The chapter considers public engagement for different categories of genome-editing applications and explores strengths and limitations of potential models for undertaking such public engagement.

Finally, Chapter 8 returns to the set of overarching principles and the responsibilities that flow from them in the context of human genome editing. The chapter pulls together the report's conclusions and recommendations in light of these fundamental concepts.

The term “genome editing” is used throughout this report to refer to the processes by which the genome sequence is changed by adding, replacing, or removing DNA base pairs. This term is used in lieu of “gene editing” because it is more accurate, as the editing could be targeted to sequences that are not part of genes themselves, such as areas that regulate gene expression.

The NAS and the NAM are referred to throughout this report simply as the National Academies, or the U.S. National Academies when discussed in relation to the academies of other nations. Until 2016, the NAM was known as the Institute of Medicine (IOM).

In January 2017, the U.S. Food and Drug Administration (FDA) issued revised draft guidance addressing the regulatory pathway for intentionally altered genomic DNA in plants and nonhuman animals. This would include DNA intentionally altered through genome editing. The guidance does not affect the regulatory pathway for human applications that are regulated as human drugs, devices, and biologics. See FDA “Regulation of Intentionally Altered Genomic DNA in Animals—Draft Guidance” (January 2017) at http://www ​.fda.gov/downloads ​/AnimalVeterinary ​/GuidanceComplianceEnforcement ​/GuidanceforIndustry ​/ucm113903.pdf (accessed January 30, 2017) and “Genome Editing in New Plant Varieties Used for Foods; Request for Comments” at https://www ​.regulations ​.gov/document?D=FDA-2016-N-4389-0001 (accessed January 30, 2017).

The regulatory roles of the federal departments and agencies and the overall framework for regulation of applications of biotechnology are outlined in “Modernizing the Regulatory System for Biotechnology Products: Final Version of the 2017 Update to the Coordinated Framework for the Regulation of Biotechnology” (January 4, 2017) and “National Strategy for Modernizing the Regulatory System for Biotechnology Products” (September 2016) ( https: ​//obamawhitehouse ​.archives.gov/blog ​/2017/01/04/increasing-transparency-coordination-and-predictability-biotechnology-regulatory [accessed January 30, 2017]).

Statement by Ralph J. Cicerone, President, U.S. National Academy of Sciences; Victor J. Dzau, President, U.S. National Academy of Medicine; Chunli Bai, President, Chinese Academy of Sciences; and Venki Ramakrishnan, President, The Royal Society ( http://www8 ​.nationalacademies ​.org/onpinews/newsitem ​.aspx?RecordID=12032015b [accessed January 24, 2017]).

The term “epigenome” refers to a set of chemical modifications to the DNA of the genome and to proteins and RNAs that bind to DNA in the chromosomes to affect whether and how genes are expressed.

  • Cite this Page National Academies of Sciences, Engineering, and Medicine; National Academy of Medicine; National Academy of Sciences; Committee on Human Gene Editing: Scientific, Medical, and Ethical Considerations. Human Genome Editing: Science, Ethics, and Governance. Washington (DC): National Academies Press (US); 2017 Feb 14. 1, Introduction.
  • PDF version of this title (3.9M)

In this Page

Recent activity.

  • Introduction - Human Genome Editing Introduction - Human Genome Editing

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

Connect with NLM

National Library of Medicine 8600 Rockville Pike Bethesda, MD 20894

Web Policies FOIA HHS Vulnerability Disclosure

Help Accessibility Careers

statistics

Loading metrics

Open Access

Peer-reviewed

Research Article

Pervasive tissue-, genetic background-, and allele-specific gene expression effects in Drosophila melanogaster

Roles Conceptualization, Formal analysis, Investigation, Methodology, Visualization, Writing – original draft, Writing – review & editing

* E-mail: [email protected] (AGS)

Affiliation Division of Evolutionary Biology, Faculty of Biology, Ludwig-Maximilians-Universität München, Munich, Germany

ORCID logo

Roles Formal analysis, Investigation, Visualization, Writing – review & editing

Affiliation Department of Insect Symbiosis, Max-Planck-Institute for Chemical Ecology, Jena, Germany

Roles Funding acquisition, Methodology, Writing – review & editing

Roles Conceptualization, Funding acquisition, Methodology, Writing – review & editing

  • Amanda Glaser-Schmitt, 
  • Marion Lemoine, 
  • Martin Kaltenpoth, 
  • John Parsch

PLOS

  • Published: August 23, 2024
  • https://doi.org/10.1371/journal.pgen.1011257
  • Peer Review
  • Reader Comments

This is an uncorrected proof.

Fig 1

The pervasiveness of gene expression variation and its contribution to phenotypic variation and evolution is well known. This gene expression variation is context dependent, with differences in regulatory architecture often associated with intrinsic and environmental factors, and is modulated by regulatory elements that can act in cis (linked) or in trans (unlinked) relative to the genes they affect. So far, little is known about how this genetic variation affects the evolution of regulatory architecture among closely related tissues during population divergence. To address this question, we analyzed gene expression in the midgut, hindgut, and Malpighian tubule as well as microbiome composition in the two gut tissues in four Drosophila melanogaster strains and their F1 hybrids from two divergent populations: one from the derived, European range and one from the ancestral, African range. In both the transcriptome and microbiome data, we detected extensive tissue- and genetic background-specific effects, including effects of genetic background on overall tissue specificity. Tissue-specific effects were typically stronger than genetic background-specific effects, although the two gut tissues were not more similar to each other than to the Malpighian tubules. An examination of allele specific expression revealed that, while both cis and trans effects were more tissue-specific in genes expressed differentially between populations than genes with conserved expression, trans effects were more tissue-specific than cis effects. Despite there being highly variable regulatory architecture, this observation was robust across tissues and genetic backgrounds, suggesting that the expression of trans variation can be spatially fine-tuned as well as or better than cis variation during population divergence and yielding new insights into cis and trans regulatory evolution.

Author summary

Genetic variants regulating gene expression can act in cis (linked) or in trans (unlinked) relative to the genes they affect and are thought to be important during adaptation because they can spatially and temporally fine-tune gene expression. In this study, we used the fruit fly Drosophila melanogaster to compare gene expression between inbred parental strains and their offspring in order to characterize the basis of gene expression regulation and inheritance. We examined gene expression in three tissues (midgut, hindgut, and Malpighian tubule) and four genetic backgrounds stemming from Europe and the ancestral range in Africa. Additionally, we characterized the bacterial community composition in the two gut tissues. We detected extensive tissue- and genetic background-specific effects on gene expression and bacterial community composition, although tissue-specific effects were typically stronger than genetic background effects. Genes with cis and trans regulatory effects were more tissue-specific than genes with conserved expression, while those with trans effects were more tissue-specific than those with cis effects. These results suggest that the expression of trans variation can be spatially fine-tuned as well as (or better than) cis variation as populations diverge from one another. Our study yields novel insight into the genetic basis of gene regulatory evolution.

Citation: Glaser-Schmitt A, Lemoine M, Kaltenpoth M, Parsch J (2024) Pervasive tissue-, genetic background-, and allele-specific gene expression effects in Drosophila melanogaster . PLoS Genet 20(8): e1011257. https://doi.org/10.1371/journal.pgen.1011257

Editor: Trudy F. C. Mackay, Clemson University, UNITED STATES OF AMERICA

Received: April 14, 2024; Accepted: July 30, 2024; Published: August 23, 2024

Copyright: © 2024 Glaser-Schmitt et al. This is an open access article distributed under the terms of the Creative Commons Attribution License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The gut RNA-seq and microbiome data that support the findings of this study are publicly available from National Center for Biotechnology Information (NCBI) under the BioProject accession number PRJNA1094401. Gut non-ASE gene count and expression data are available from NCBI Gene Expression Omnibus (GEO) under the series accession number GSE263264. Malpighian tubule RNA-seq data are available under the NCBI GEO accession number GSE103645. All other relevant data are within the manuscript and its Supporting Information files.

Funding: This work was supported by a Deutsche Forschungsgemeinschaft (DFG, www.dfg.de ) grant (number 274388701) to JP as part of the priority program “SPP 1819: Rapid evolutionary adaptation”, and a DFG grant (KA2846/5-1, project number 347368302) to MK as part of FOR2682 “Seasonal temperature acclimation in Drosophila”. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Gene expression variation is extensive at all organismal levels, including among tissues [ 1 – 2 ], cells [ 3 – 4 ], or alleles [ 5 – 6 ] of the same individual, and underlies much of the phenotypic variation that we see among individuals, populations, and species [ 7 – 9 ]. A long-standing challenge in evolutionary genetics has been to identify and characterize this variation. Indeed, elucidating the scope and architecture of gene expression variation as well as the mechanisms that shape it is an integral part of better understanding complex phenotypic traits [ 10 – 12 ], such as body size or disease susceptibility, and their evolution.

At the DNA sequence level, genetically heritable variants can modulate expression in two general ways: cis -regulatory variants, such as those within enhancers or promoters, affect the expression of linked, nearby genes, while trans -regulatory variants, such as those affecting transcription factors or regulatory RNAs, affect the expression of unlinked genes that can be located anywhere in the genome (reviewed in [ 13 – 14 ]). One way to interrogate the relative contribution of these types of regulatory variants to gene expression variation in species such as Drosophila , where inbred, relatively isogenic strains are available, is to compare gene expression of two parental strains or species as well as expression of their alleles in F1 hybrids [ 15 ]. Due to linkage with the allele they regulate, cis -regulatory variants affect only one of the two F1 hybrid alleles, leading to allele-specific expression (ASE), while trans -regulatory variants equally affect both alleles in the hybrid and do not lead to ASE. While cis -regulatory variation is thought to accumulate and become more predominant over larger evolutionary distances, i.e. between species [ 16 – 18 ], trans -regulatory variation tends to be more common among individuals within a species [ 5 – 6 , 19 ]. However, deviations from this pattern of regulatory variation have been documented in Drosophila [ 20 – 23 ] as well as other species [ 24 – 25 ], which underscores that there remains much to learn about the evolution of gene expression regulation, especially over short evolutionary distances.

An advantage of utilizing ASE to investigate the regulation of gene expression is that both the genetic basis of expression variation (e.g. cis versus trans ) and the mode of expression inheritance (e.g., dominance versus additivity) can be assessed. Indeed, previous studies of ASE in Drosophila utilizing expression in F1 hybrids have found that environment [ 6 , 26 ], sex [ 27 – 28 ], genetic background [ 19 , 21 , 28 ], and body part or tissue [ 21 – 22 , 28 ] can affect regulatory architecture. However, previous studies have largely focused on single populations, long term lab strains, or comparatively closely related populations [ 5 – 6 , 20 – 22 , 26 – 28 ] (for an exception see [ 19 ]). Moreover, previous studies measured expression in whole animals, body parts (e.g. heads), single tissues, and/or highly functionally diverged tissues (i.e. testes versus ovaries or heads); thus, little is known about how regulatory architecture and inheritance vary among individual tissues that are spatially and/or functionally proximate. To investigate the effect of natural genetic variation from divergent populations on regulatory architecture in multiple functionally related, interconnected tissues, we analyzed messenger RNA-sequencing (mRNA-seq) data of midgut, hindgut, and Malpighian tubule tissues in four D . melanogaster strains and their F1 hybrids. Two of the strains were from a population in Umeå, Sweden [ 29 ], representing the northern edge of the species’ derived distribution, while the other two strains were from a population in Siavonga, Zambia, representing the species’ inferred ancestral range [ 30 ]. Since their divergence from ancestral populations ~12,000 years ago [ 31 ], derived D . melanogaster populations have had to adapt to new habitats, and previous studies have found evidence that at least some of the expression divergence detected between derived and ancestral African populations is adaptive [ 32 – 36 ].

The midgut, hindgut, and Malpighian tubules, which are analogous to the mammalian small and large intestines and kidneys, respectively, physically connect to and interact with one another at the midgut-hindgut junction and are part of the D . melanogaster digestive tract (midgut and hindgut, together with the foregut) and excretory system (hindgut and Malpighian tubules). Both systems play important roles in the regulation of homeostasis as well as the immune response [ 37 – 38 ] and the investigated tissues are known to engage in interorgan communication with each other, as well as with other tissues [ 37 – 39 ]. The excretory system is involved in waste excretion as well as ionic- and osmoregulation [ 38 ], while the digestive tract is an important modulator of food intake, nutrient absorption, energy homeostasis, and insulin secretion that can shape physiology and behavior through its interaction with the microbiome [ 37 , 40 ]. To investigate the effect of natural genetic variation from divergent populations on digestive tract microbiome composition, we further performed microbiome sequencing on the same gut samples for which we performed mRNA-seq. In both the mRNA-seq and microbiome data, we found extensive tissue- and genetic background-specific effects. From the ASE data, we found that genes with both cis and trans effects were more tissue-specific than genes with no differential expression regulation, although trans effects were more tissue-specific than cis effects. Despite the context specificity that we detected for regulatory architecture across tissues and genetic backgrounds, the increased specificity of trans effects was consistent, suggesting that trans- regulatory variation can be spatially fine-tuned as well as or, potentially, better than cis- regulatory variation.

We performed mRNA-seq in the midgut and hindgut of two isofemale D . melanogaster strains from the northern limit of the derived species range in Sweden (SU26 and SU58) and two strains from the ancestral species range in Zambia (ZI418 and ZI197) as well as F1 hybrids between the Swedish and Zambian strains (SU26xZI418, SU26xZI197, SU58xZI418, and SU58xZI197). We additionally reanalyzed previously published mRNA-seq data from the Malpighian tubule [ 19 ] in a subset of these genotypes (SU26, SU58, ZI418, SU26xZI418, and SU58xZI418). We detected 7,675–8,209 genes as expressed in the individual tissues, with 6,894 genes that could be analyzed in all genotypes in all tissues. We focus on the genes that could be analyzed in all examined genotypes and tissues unless otherwise indicated. When considering gene expression variation across all samples, biological replicates clustered strongly by tissue type ( Fig 1A ). Within tissues, replicates mostly clustered by genotype, although in the hindgut there was some overlap between SU58, ZI418 and their F1 hybrid, as well as SU26 and one of its F1 hybrids ( Fig 1 ).

thumbnail

  • PPT PowerPoint slide
  • PNG larger image
  • TIFF original image

The legend on the right indicates that replicates of each genotype share the same color, while shape indicates tissue.

https://doi.org/10.1371/journal.pgen.1011257.g001

Differential expression among tissues and genotypes

We detected 116–2,589 (mean 961–1,398) genes as differentially expressed between genotypes within each tissue ( Fig 2A ). However, gene expression divergence (i.e. the cumulative differences in expression across all analyzed genes, as measured by 1 –Spearman’s rho, ρ) between genotypes within each tissue was not significantly different among tissues ( t -test; Bonferroni-corrected P > 0.8 for all; S1A Fig ). Expression divergence tended to be lower between strains derived from the same population (i.e. Swedish strains were more similar to each other than to the Zambian strains and vice versa), although in the hindgut and Malpighian tubule, SU58 was equally or more similar to one or both Zambian strains than to the SU26 strain ( Fig 2A ). This pattern was not evident in the Malpighian tubule when all genes that could be analyzed in this tissue were included in the analysis ( S2 Fig ). When we compared expression within the same genotype among tissues, we detected 4,524–5,139 (mean 4,844) genes as differentially expressed between any two tissues ( Fig 2B ), 50–58% of which were differentially expressed in all pairwise tissue comparisons within a genotype and 1,619–1,880 of which were shared among at least two genotypes, with 1,045 genes differentially expressed among all tissues within all genotypes ( S1 Table ). Of these shared differentially expressed genes, 1,243–1,594 were consistently upregulated in the same tissue within the same genotype, with 700–924 genes consistently upregulated in the same tissue in all genotypes ( S1 Table ). Interestingly, overall gene expression divergence within the same genotype between the midgut and Malpighian tubule was significantly lower than gene expression divergence between either of these two tissues and the hindgut ( t -test; Bonferroni-corrected P < 5 x 10 −5 for both; S1C Fig ), suggesting that among these three tissues, expression within the same genetic background is most similar between the Malpighian tubule and the midgut. When we compared gene expression divergence among genotypes within tissues to gene expression divergence within the same genotype among tissues, gene expression divergence was higher among than within tissues (Bonferroni-corrected P = 8.58 x 10 −15 ; Figs 2C and S1A ). Thus, expression diverges more within a genotype among tissues than among genotypes within a tissue, suggesting that tissue is more predictive of gene expression than genotype.

thumbnail

A) The numbers of differentially expressed (DE) genes between genotypes within the midgut (triangles), hindgut (circles), and Malpighian tubule (squares) are shown above the diagonal, while expression divergence (as measured by 1 –ρ) between genotypes is shown below the diagonal. B) The numbers of differentially expressed genes between the same genotype among midgut (MG), hindgut (HG), and Malpighian tubule (MT) tissues are shown. Dashes indicate missing data. C) Expression divergence among genotypes within the same tissue (across genotype) versus expression divergence between the same genotype among tissues (across tissue). Significance was assessed with a t -test. *** Bonferroni-corrected P < 10 −14 .

https://doi.org/10.1371/journal.pgen.1011257.g002

Mode of expression inheritance is highly tissue- and genetic background-specific

In order to understand how the mode of expression inheritance varies among genotypes and tissues, we categorized genes according to their expression in the two parental strains and the respective F1 hybrid into the following categories (see Methods for more details): similar, P1 dominant, P2 dominant, additive, overdominant, and underdominant, with the Swedish strains being P1 and the Zambian strains P2 ( Fig 3 ; S2 and S3 Tables). For all backgrounds and tissues, the similar category (i.e. genes with similar expression in parents and hybrids) was the largest ( Fig 3A ) and showed the greatest overlap among tissues ( S3 and S4 Figs). The basic expression inheritance categories (those with genes showing additive or P1 or P2 dominant expression in hybrids in comparison to parents) were the next largest categories (Figs 3A , S3 , and S4 ), and typically similar numbers of genes were classified into these categories. However, there were some exceptions depending on category, background, and tissue ( Fig 3A ). For example, 1.5-fold more genes were categorized as dominant in either Swedish strain in comparison to the ZI418 strain in the midgut in comparison to the other tissues, while 3.3–4.6-fold more genes were categorized as dominant in the ZI418 strain in comparison to the SU26 strain in the midgut and Malpighian tubule in comparison to the hindgut. Similarly, 1.8–2.5-fold more genes were categorized as dominant in the ZI197 strain in comparison to either Swedish strain in the midgut than in the hindgut. Genes in these basic inheritance categories were often unique to both the tissue and category (Figs 3B , 3C , S3 and S4 ), with little overlap within each category across all three examined tissues (Figs 3 and S3 ). Unsurprisingly, in background combinations for which we only had data for two tissues, the overlap we detected within categories across tissues was higher ( S4 Fig ). The smallest number of genes were categorized into misexpression categories, i.e. genes showing either over- or underdominance in the hybrid in comparison to the parents (Figs 3A , 3C , S3 and S4 ). Genes in misexpression categories tended to be tissue-specific with little or no overlap among the examined tissues (Figs 3A , 3C , S3 and S4 ). Similar to what we observed for basic inheritance categories, certain combinations of genetic backgrounds and tissues showed larger numbers of misexpressed genes than others (Figs 3A , S3 , and S4 ). For example, we detected relatively high levels of misexpression in the SU26xZI418 background in the midgut and Malpighian tubule, but not the hindgut ( Fig 3A ). Taken together, our results suggest that the mode of expression inheritance is both tissue- and genetic background-specific.

thumbnail

A) The number of genes in each mode of expression inheritance category within the midgut (triangles), hindgut (circles), and Malpighian tubule (squares) at a 1.25-fold change cut-off. Results using alternative cut-offs or for individual tissue analyses can be found in S2 and S3 Tables (see Methods ). Dashes indicate missing data. B, C) Upset plots showing unique and overlapping genes within each tissue in the SU26xZI418 background as an example. Upset plots for the other genotypes can be found in S3 and S4 Figs. Horizontal bars represent the total number (num.) of genes in a tissue and inheritance category combination. Vertical bars represent the number of genes in an intersection (intersect.) class. A filled circle underneath a vertical bar indicates that a tissue and inheritance category combination is included in an intersection class. A single filled circle represents an intersection class containing only genes unique to a single tissue and inheritance category combination, while filled circles connected by a line indicate that multiple tissue and inheritance category combinations are included in an intersection class. Genes categorized into B) basic expression inheritance (inherit.), i.e. P1 dominant (P1 dom.), P2 dominant (P2 dom.), and additive (add.) and C) misexpression (misexpress.) categories are shown. Only intersection classes comprised of either a single tissue and inheritance category combination or an inheritance category in all examined tissues are shown. Additional intersection classes and upset plots for genes categorized into the similar category are shown in S3 Fig D) Phenotypic dominance ( h ) divergence (as measured by 1 –ρ) among backgrounds within the same tissue (across genotype) versus dominance divergence between the same background among tissues (across tissue). Significance was assessed with a t -test. The Bonferroni-corrected P value is shown.

https://doi.org/10.1371/journal.pgen.1011257.g003

Phenotypic dominance and the mode of expression inheritance.

In order to better understand potential variation in the magnitude of phenotypic dominance during expression inheritance, we calculated the degree of dominance, h . In order to compare the magnitude of dominance regardless of which allele was dominant, we calculated h such that values between 0 and 1 or 0 and -1 represent varying degrees of additivity and dominance, with values closer to -1 representing complete dominance of the Swedish background and 1 representing complete dominance of the Zambian background, while values outside this range represent cases of overdominance of the respective background (see Methods for more details). For all genetic backgrounds and tissues, we did not detect any significant difference in the overall magnitude of phenotypic dominance between the two parental backgrounds ( t -test, Bonferroni-corrected P > 0.6 for all). For the majority of tissues and genetic backgrounds, we did not detect differences in the magnitude of dominance within the same genetic background between tissues (Bonferroni-corrected P > 0.26 for all comparisons). We only detected a significant difference in the overall magnitude of dominance within the SU26xZI418 background between the midgut and the Malpighian tubule (Bonferroni-corrected P = 0.015), which may be driven by the large amount of misexpression that we detected in this background, particularly in the Malpighian tubule ( Fig 3A ). Overall, these results suggest that the differences we detected in the mode of expression inheritance among genetic backgrounds and tissues occur on the individual gene level rather than being driven by general, genome-wide changes in dominance. Overall dominance divergence among genetic backgrounds (i.e. the cumulative differences in dominance across all analyzed genes, as measured by 1 –ρ) was not significantly different between the midgut and hindgut (Bonferroni-corrected P = 0.264, S1B Fig ), but could not be compared to the Malpighian tubule for which only 2 background combinations were available. When we compared overall dominance divergence among genetic backgrounds within tissues to dominance divergence within the same genetic background among tissues, dominance divergence was significantly higher among than within tissues (Bonferroni-corrected P = 0.012, Fig 3D ). We observed a similar pattern when we examined gene expression divergence ( Fig 2C ), suggesting that in general divergence is higher among tissues than among different genetic backgrounds within a tissue. However, divergence was higher for dominance than for gene expression ( t -test, Bonferroni-corrected P < 10 −14 ), suggesting that phenotypic dominance of expression is much less conserved among tissues and genotypes than expression itself, although it is possible that this difference can be explained in part by differences in how each trait was measured as expression was measured in a single genotype but dominance was calculated based on three genotypes.

Genetic basis of expression variation is highly tissue- and genetic background-specific

In order to identify genes in our dataset with any level of cis -regulatory divergence between the parental alleles in any genetic background and tissue, we tested for ASE in genes for which we could distinguish between the parental alleles in the hybrid (see Methods ). Of the 4,305–4,592 genes we were able to analyze in all tissues of a genetic background, we detected 80–370 genes showing significant ASE (FDR <5%) depending on genetic background and tissue ( Table 1 ), with a total of 356, 408, 460, and 256 non-redundant genes detected as having ASE in any tissue in the SU26xZI418, SU58xZI418, SU26xZI197, and SU58xZI197 backgrounds, respectively, and a total of 958 genes in all tissues and backgrounds. Within each genetic background 55–86% of genes showing ASE in a particular tissue were unique to that tissue, while, within each tissue, 55–76% of ASE genes were unique to a single genetic background. Indeed, within each genetic background, only 9–73 genes were detected as having ASE in all examined tissues, with backgrounds in which only 2 tissues were examined sharing more ASE genes ( Table 1 ). Thus, allele-specific expression is largely tissue- and genetic background-specific.

thumbnail

https://doi.org/10.1371/journal.pgen.1011257.t001

In order to further understand how the genetic basis of expression varies among genetic backgrounds and tissues, we classified genes in each genetic background and tissue combination into six regulatory categories: “conserved”, “ cis -only”, “ trans -only”, “ cis + trans “, “ cis x trans “, “compensatory“, and “ambiguous”[ 5 ] (see Methods for more details). The proportion of genes falling into each regulatory category was dependent upon tissue and genetic background, although, in general, when considering genes with non-ambiguous regulatory divergence in all tissues and genetic backgrounds, the largest proportion fell into the trans -only category which contained 2.9–30.6-fold more genes than the cis -only category ( Fig 4A ). The midgut had a higher proportion of ambiguous genes and a smaller proportion of conserved genes than the other examined tissues ( Fig 4A ). We detected the most cis -only genes in the hindgut, with 2.3–4.2 fold more genes categorized as cis -only in comparison to the other examined tissues ( Fig 4A ). In comparison to other genetic backgrounds, the SU58xZI197 background had a higher proportion of ambiguous genes and a smaller proportion of conserved genes as well as 2.2–4.7- and 2.4–10.4-fold fewer genes categorized as cis -only and compensatory, respectively ( Fig 4A ). Within each genotype, genes with non-ambiguous regulatory divergence were often unique to both the tissue and regulatory category (Figs 4 , S5 , and S6 ), with little overlap within each category across all three examined tissues (Figs 4 and S5 ). Similarly, within each tissue, genes with non-ambiguous regulatory divergence were often unique to a genetic background, with 35–91% of genes unique to a single genetic background within a regulatory class and tissue, while 31–87% of genes were unique to the genetic background and tissue within a regulatory class ( S7 Fig ). Overlap among all genotypes within a tissue was highest for genes in the trans -only category with 2.8–30.7-fold more shared genes categorized as trans -only in comparison to other non-ambiguous regulatory divergence categories ( S7 Fig ). Overall, our results suggest that the genetic basis of expression inheritance is both tissue- and genetic background-specific.

thumbnail

A) The number of genes in each regulatory class within the midgut (triangles), hindgut (circles), and Malpighian tubule (squares). Results for individual tissue analyses can be found in S5 Table (see Methods ). Dashes indicate missing data. B) Upset plot showing unique and overlapping genes with non-ambiguous regulatory divergence within each tissue in the SU26xZI418 background. Upset plots for other genotypes can be found in S5 and S6 Figs. Only intersection classes comprised of either a single tissue and regulatory category combination or a regulatory category in all examined tissues are shown. Additional intersection classes are shown in S5 Fig . C) Dominance and D) magnitude of dominance h for genes categorized as cis -only ( c , light) and trans -only ( t , dark) in each background and tissue. Magnitude of dominance was calculated as the absolute value of dominance h . Only h values with magnitudes of 5 or below are shown. Boxplots including more extreme h values can be found in S8 Fig . Significance was assessed with a t -test. Bonferroni-corrected P values are shown in grey. *** P < 0.005, ** P < 0.01, * P < 0.05.

https://doi.org/10.1371/journal.pgen.1011257.g004

Phenotypic dominance and the genetic basis of expression variation.

Previous studies have found that cis -regulatory variation tends to be more additive [ 20 , 27 – 28 ], while trans variation tends to be more dominant [ 27 ]. In order to better understand the relationship between the genetic underpinnings of expression variation and dominance, we examined phenotypic dominance ( h ; see Methods ) in genes categorized as cis -only or trans -only. Overall dominance in genes categorized as trans -only was often biased towards one parent, with 5 out of 10 tissue and genetic background combinations significantly more dominant in one parental background than the other (Figs 4C and S8 ; t -test), and the more dominant parent dependent on the tissue and genetic background ( Fig 4C ). On the other hand, overall phenotypic dominance in genes categorized as cis -only was not significantly biased towards one parental background for any of the examined tissue and genetic background combinations ( Fig 4C ; t -test, P > 0.5 for all). When we compared the overall magnitude of dominance (as measured by the absolute value of dominance h ), trans -only genes were only sometimes more dominant than cis -only genes and this was significant after multiple test correction only in the midgut of the SU26xZI418 background ( Fig 4D ; t -test, P < 0.05). However, this lack of significance might be due to lack of power, particularly for cis -only genes, of which we detected fewer ( Fig 3A ). When we included all genes that could be analyzed in each individual tissue and genotype in the analysis, we were able to examine phenotypic dominance in 1.9–3.7-fold more cis -only and 1.6–2.5-fold more trans -only genes ( S6 Table ). The results, however, remained similar, with no increased detection of differences in dominance ( S6 Table ), which suggests that although we cannot completely rule it out, these results are unlikely to be due to a lack of statistical power. Thus, although we detected trans -regulatory variants as more dominant and cis -regulatory variants as more additive, which has been reported by previous studies [ 20 , 27 – 28 ], we only detected this trend in a single background and tissue. Moreover, the phenotypic dominance of trans - but not cis -regulatory variants tended to be biased toward one parental background.

Functional classification of genes displaying ASE.

In order to understand the types of genes showing ASE in our dataset, we tested for an enrichment of gene ontology (GO) biological process and molecular function terms for genes with ASE in each background and tissue. The most commonly enriched GO terms across all backgrounds and tissues were related to oxidoreductase activity ( S7 Table ). Indeed, we detected at least one oxidoreductase activity term for every genetic background and tissue combination in which we detected enriched GO terms. For genes displaying ASE in all examined tissues within a genotype, we could only detect two enriched GO terms, oxidoreductase activity and response to toxic substance, in the SU26xZI197 background ( S7 Table ). Thus, despite ASE genes tending to be tissue- and genetic background-specific, in general ASE genes tended to be enriched for genes predicted to be involved in oxidoreductase activity.

ASE genes are enriched for sex-biased gene expression in a context-dependent manner.

A previous study on ASE in D . melanogaster found differences in the relative contribution of cis -regulatory effects among genes with different levels of sex bias and among two tissues/body parts [ 28 ], while another using hybrids between D . simulans and D . mauritiana found that sexually dimorphic regulatory effects are often in cis [ 27 ]. In order to better understand the relationship between sex-biased expression and cis -regulatory variation in the midgut, hindgut, and Malpighian tubule, we categorized genes according to their level of sex bias using data from FlyAtlas2 [ 41 ] (see Methods ). When considering genes displaying ASE ( Table 1 ), in the Malpighian tubule we detected a significant enrichment of sex-biased genes for both genetic backgrounds (χ 2 test, P = 0.001 for both; S8 Table ), and in the hindgut we detected an enrichment of sex-biased genes in all genetic backgrounds ( P < 0.02 for all; S8 Table ) except SU26xZI418 ( P = 0.2929; S8 Table ). In the midgut, we detected a significant enrichment of sex-biased genes only in the SU26xZI197 background ( P = 0.0116; S8 Table ), despite the midgut having more sex-biased genes, particularly male-biased genes, than the other tissues ( S8 Table ). In the tissues and genetic backgrounds where we detected an enrichment of sex-biased ASE genes, the enrichment did not appear to be driven by bias towards one particular sex (<1.63-fold difference in the prevalence of male- and female-biased genes for all), with the exception of the SU26xZI197 background in the midgut, where male-biased genes were 6-fold more prevalent than female-biased genes ( S8 Table ). It should be noted, however, that only females were used in our experiments to measure ASE. Overall, genes displaying ASE were enriched for sex-biased gene expression, however this enrichment was dependent upon the tissue and genetic background in which they were detected.

The effects of inversions on regulatory variation.

Chromosomal inversions have been shown to affect expression in Drosophila [ 42 ]. Two inversions that are at high frequency worldwide and in sub-Saharan Africa, respectively [ 43 ], were present in our study: In(2L)t in SU26 [ 19 ] and In(3R)K in ZI197 [ 44 ]. A previous study utilizing the current Malpighian tubule data found that while In(2L)t made a minor contribution to gene expression variation, its presence could not explain the patterns of gene expression detected in F1 hybrids with SU26 as a parent [ 19 ]. To better understand how the presence of these inversions potentially affects expression and regulatory variation, we tested for a significant over- or underrepresentation of genes differentially expressed or displaying ASE between parental strains among genes located within these inversions. Genes differentially expressed between parental strains were significantly enriched for genes located within both inversions for all comparisons, including comparisons between strains that did not contain the focal or any inversion (χ 2 test, P < 10 −15 for all; S9 Table ). This finding suggests that the 2L and 3R chromosome arms are enriched for differentially expressed genes, rather than the inversions themselves, which is in line with a previous study that found that linked genetic variation within inversions is what drives differential expression rather than the structural variation itself [ 45 ]. For genes displaying ASE ( Table 1 ), we did not find any dearth or enrichment for genes located within either inversion ( P > 0.29; S9 Table ), with the exception of the SU58xZI418 genetic background in the Malpighian tubule, which was significantly enriched for genes located on the In(2L)t ( P = 0.003; S9 Table ), despite having the standard chromosomal arrangement. Thus, the presence of the In(2L)t and In(3R)K inversions does not appear to explain the regulatory patterns that we detected.

Tissue specificity varies depending on regulatory type and genetic background

For genetic background combinations for which we had transcriptome data in all three tissues (SU26xZI418 and SU58xZI418), we were able to examine the relationship between regulatory variation and tissue specificity. To do so, for every gene in each strain we calculated the tissue specificity index τ, which ranges in value from 0 to 1, with higher numbers indicating higher tissue specificity. When we compared overall τ among all genetic backgrounds, tissue specificity in ZI418 was higher than in either Swedish background as well as both F1 hybrids, although this difference was not statistically significant for SU26 ( Fig 5A ). On the other hand, tissue specificity in the Swedish strains was not significantly different from each other or their respective hybrid ( Fig 5A ). Thus, tissue specificity in F1 hybrids was more similar to the Swedish than the Zambian parent. In order to better understand how tissue specificity varies based on regulatory variation type, we performed pairwise comparisons of τ between genes with trans -only variation, cis -only variation, and conserved gene regulation. Both cis - and trans -regulated genes were significantly more tissue-specific than conserved genes for all strains in all backgrounds ( Fig 5B ). Interestingly, genes with trans -only regulatory variation were more tissue-specific than genes with cis -only regulatory variation, although this difference was not significant for SU58 and the F1 hybrid in the SU58xZI418 background ( P = 0.099 and 0.076, respectively; Fig 5B , S10 Table ). Thus, trans effects were more tissue-specific than cis effects in our dataset. Unlike for the genetic basis of expression variation itself ( Fig 4 ), we detected very few tissue-specific or tissue-by-regulatory type interaction effects on tissue specificity ( S10 Table ). Thus, the genetic basis of regulation (i.e. cis versus trans ) and, to a lesser degree, the genetic background are predictive of tissue specificity, while the tissue in which the regulatory variation was detected tends not to be. Indeed, the type of regulatory variation appears to have the largest influence on tissue specificity, as we were able to detect consistent patterns across tissues and genetic backgrounds ( Fig 5B ).

thumbnail

A) Overall tissue specificity as measured by τ in the examined strains. Significance was assessed with a t -test and Bonferroni-corrected P values are shown. B) Tissue specificity τ in each strain for genes categorized into cis -only (light), trans -only ( tran , dark), and conserved (cons, grey) regulatory (reg.) classes for each genetic background in the midgut (triangles), hindgut (circles), and Malpighian tubule (squares). Significance was assessed with an ANOVA ( S10 Table ). *** P < 0.005, ** P < 0.01, * P < 0.05, ns not significant ( P > 0.05).

https://doi.org/10.1371/journal.pgen.1011257.g005

Microbiome composition varies depending on tissue and genetic background

Bacterial community composition has been shown to affect host gene expression in the digestive tract depending upon host genotype [ 46 ]. In order to better understand the relationship between genetic background and microbiome composition, we performed microbiome sequencing in the midgut and hindgut for the same RNA samples for we which we performed mRNA-seq (see Methods ). For all samples, Wolbachia was highly predominant in the bacterial community (10.36–99.37%; S9 Fig ). In order to ensure its presence did not mask more subtle differences in diversity, we focus on analyses with Wolbachia removed (Figs 6 and 7 ), but results including Wolbachia were qualitatively similar ( S9 and S10 Figs, S11 – S13 Tables), and we did not detect any pattern of relative Wolbachia abundance among genotypes (lmer, P = 0.246; S14 Table ). After removal of Wolbachia , Acetobacter , one of the most common D . melanogaster gut microbial taxa [ 47 – 48 ], remained predominant in the midgut (54.7–99.8%; Fig 6 ). In the hindgut, where the microbiome composition was more diverse, Acetobacter was only dominant in a subset of individuals (1.44–78.52%; Fig 6 ). Interestingly, we did not detect Lactobacillus , another of the most common gut microbial taxa [ 47 – 48 ]. However, because we performed amplicon sequencing using RNA rather than DNA as the starting material (see Methods ), the microbiome composition we detected is representative of metabolic activity rather than presence. Thus, it may be that Lactobacillus was present but its metabolic activity was not high enough for us to detect.

thumbnail

Colored sections of each bar show bacterial genera (excluding Wolbachia ) with a relative abundance above 5% in each sample. The remaining genera are compiled in the “Others” category. Bacterial community composition including Wolbachia can be found in S9 Fig .

https://doi.org/10.1371/journal.pgen.1011257.g006

In order to detect differences in gut bacterial community composition, we computed the Bray-Curtis index (see Methods ). We detected significant differences in gut bacterial communities between the midgut and hindgut (Figs 6 and 7 , S11 Table ; PERMANOVA, P = 0.001), which is unsurprising given that these gut regions differ in their pH and associated digestive functions [ 37 ]. We also detected a significant effect of the genetic background (i.e. strain) as well as a significant interaction effect between the examined strain and gut region on the bacterial community ( Fig 7 , S11 Table ; PERMANOVA, P ≤ 0.015 for both), suggesting that genetic background affects microbiome composition, and this effect is at least partially tissue-dependent. Indeed, when we examined community composition within each tissue individually, the genetic background significantly influenced the gut bacterial community in the hindgut while it had no significant effect on the structure of the community in the midgut ( Fig 7 , S11 Table ; PERMANOVA, P = 0.002 and P = 0.89 respectively). We also detected significant tissue and genetic background effects on bacterial alpha-diversity ( S11 Fig , S12 and S13 Tables; LMM, P = 0.017 and P < 0.001), with the hindgut being more diverse and displaying stronger differentiation between parental and F1 strains ( S11 Fig ). Thus, the diverse bacterial community of the hindgut offered more possibility for differentiation while the midgut community was dominated by Acetobacter among all samples. In contrast, gene expression in the hindgut was less differentiated among genetic backgrounds but more differentiated from the other tissues while the midgut showed the opposite pattern (Figs 1 and S1C ), suggesting that the expression and regulatory variation we detected in these tissues is unlikely to be driven by bacterial community composition. Thus overall, tissue type (i.e. gut region) had the largest impact on microbial community composition and diversity, with genetic background also affecting microbiome variation to a lesser degree, especially within the hindgut; however, these genetic background effects do not appear to be related to host gene expression variation.

thumbnail

Principal coordinate analysis of bacterial communities in A) both midgut and hindgut samples, B) hindgut, and C) midgut. The legend indicates that replicates of each genotype share the same color, while shape indicates tissue. Wolbachia was excluded from the analysis. Results including Wolbachia can be found in S10 Fig .

https://doi.org/10.1371/journal.pgen.1011257.g007

Using transcriptome data from parental and F1 hybrid D . melanogaster strains from an ancestral and a derived population in the midgut, hindgut and Malpighian tubule, we found that both the genetic basis of expression variation (i.e. cis versus trans ) and the mode of expression inheritance (i.e. dominant versus additive) were highly tissue- and genetic background-specific (Figs 3 and 4 , Table 1 ). Previous studies using F1 hybrids in Drosophila have found that genetic background [ 19 , 21 , 28 ] and body part or tissue [ 21 – 22 , 28 ] can have large effects on regulatory architecture; however, to our knowledge, this is the first study examining highly spatially and functionally proximate tissues that not only communicate with each other but also functionally and physically interact. Thus, our results demonstrate that even functionally related, interconnected tissues can show highly divergent regulatory architecture among tissues and genetic backgrounds. Indeed, overall gene expression was most similar between the Malpighian tubule and the midgut ( Fig 1 ), despite these tissues being part of the excretory and digestive system, respectively, while the two gut tissues are part of the same alimentary canal. Thus, our results suggest that the level of functional and physical interconnectivity between tissues may not necessarily be predictive of similarity in gene expression or regulatory architecture. Consistent with this interpretation, we detected similar fold-size differences in the proportion of cis -regulated genes in the hindgut versus the Malpighian tubule or midgut ( Fig 4A ) as has previously been reported in the testes versus the head or ovaries [ 28 ]. However, we should note that the tissues we examined in this study, and the midgut in particular, are known to be regionalized [ 37 – 38 , 40 ] with an estimated 22, 4, and 5 distinct cell types currently described in the midgut, hindgut, and Malpighian tubules, respectively [ 49 – 50 ]; therefore, it is possible that we may have missed some of the more subtle differences in gene regulation that occur among individual regions or cell types.

It has long been thought that regulatory changes and particularly cis -regulatory changes are important during adaptation as they can fine-tune gene expression both temporally and spatially [ 51 – 52 ]. Indeed, we found that genes with trans and cis effects were more tissue-specific than genes with no differential expression regulation ( Fig 5 ), suggesting that regulatory changes between diverged populations are often tissue-specific, which is likely driven by spatial fine-tuning of gene expression. Interestingly and somewhat surprisingly, we found that trans effects were more tissue-specific than cis -effects and this finding was consistent across tissues and genetic backgrounds ( Fig 5B ). Thus, our results reveal that trans -regulatory changes can be as or, potentially, even more tissue-specific than cis -regulatory changes that occur as populations diverge. In contrast to our findings, a recent study on ASE in two mouse tissues found that tissue-specific genes were largely cis -regulated during population divergence [ 25 ]. Indeed, cis -regulatory changes have long been thought to be more common during adaptation due to lower pleiotropy [ 53 ]. One of the disadvantages of our methods is that we were unable to assign regulatory effects to their underlying genetic variants and do not know the location or identity of the genetic variants underlying the detected tissue-specific trans effects. Thus, it is possible that the tissue-specific trans effects we detected are driven by cis -regulatory changes in the transcription factors or other regulators driving these trans effects.

Previous studies found that cis -regulatory variation tends to be more additive [ 20 , 27 , 28 ] than trans variation, which tends to be more dominant [ 27 ]. However, we found little evidence for this pattern in our dataset ( Fig 4D ). The discrepancy between the current study and previous ones may be due to differences in methods or the examined genetic background and/or body parts/tissues, suggesting that differences in additivity and dominance between cis and trans variation may be context-specific. Interestingly, we found that the phenotypic dominance of trans - but not cis -regulatory variation tended to be biased toward one parental background, with the direction of the bias variable among tissues and genetic backgrounds ( Fig 4C ). The context-dependent nature of this finding suggests that this bias may be driven by one or several trans factors affecting the expression of multiple genes in individual tissues and genetic backgrounds, which underscores the importance of taking genetic background and tissue into account when attempting to identify general patterns and trends in gene expression and its regulation. When we examined divergence in gene expression and phenotypic dominance (i.e. the cumulative differences in each trait across all analyzed genes), we found that divergence was higher among than within tissues (Figs 2C and 3D ), suggesting that although both are pervasive, tissue-specific effects outnumber or are larger than genetic background-specific effects, and these effects may be magnified when considering the phenotypic dominance of gene expression, as our findings suggest that it is much less conserved than expression itself.

A previous study in D . melanogaster larvae found that overall developmental (i.e. temporal) gene expression specificity increased during adaptation in a derived population [ 36 ]. In contrast, in our dataset the ancestral ZI418 genetic background showed the highest tissue (i.e. spatial) gene expression specificity ( Fig 5A ); however, it is possible that overall changes in gene expression specificity driven by adaptation are only detectable at the population rather than the individual level. Because we identified regulatory variation between an ancestral and a derived D . melanogaster population that had to adapt to new habitats, some, although not necessarily all, of the regulatory variation we identified may be adaptive. Indeed, one recent study examining ASE between warm- and cold-adapted mouse strains found signs of selection on ASE genes in the cold-adapted mice [ 25 ]. Genes we identified as showing ASE included several for which adaptive cis -regulatory divergence has previously been documented, such as MtnA [ 54 ], Cyp6g1 [ 55 ], Cyp6a20 [ 19 ], and Cyp12a4 [ 19 ]. We also detected an enrichment of oxidoreductase activity and response to toxic substance among ASE genes ( S7 Table ), suggesting any genes with adaptive cis -regulatory variation that we detected may be related to these processes. Indeed, the detected selection on Cyp6g1 expression is thought to have been driven by resistance to the pesticide DDT [ 55 ], while selection on MtnA is thought to be driven by increased oxidative stress resistance [ 34 , 54 ]. Indeed, the digestive system’s direct interaction with the external environment [ 37 ] and the excretory system’s role in detoxification and waste excretion [ 38 ] suggest that many of the ASE genes we identified may be candidates for adaptation.

Similar to our findings for the genetic basis of expression variation, the mode of expression inheritance, and phenotypic dominance (Figs 3 – 5 ), we detected significant effects of tissue and genetic background on bacterial community composition in our microbiome analysis, although the detected genetic background effects did not appear to explain host gene expression variation (Figs 6 , 7 , and S11 ). The endosymbiont Wolbachia , which is known to affect microbiome composition but is not present in the gut lumen [ 56 ], was predominant in all of our samples ( S9 Fig ) but we did not detect the very common Lactobacillus ( Fig 6 ), which suggests that physical abundance within the gut may not be predictive of metabolic activity levels and some bacterial community members may be more or less active than predicted by their physical abundance. Bacterial community composition was highly divergent between the two gut tissues, and the effect of genetic background appeared to be driven by higher diversity in the hindgut, which also showed more differentiation among strains (Figs 6 , 7 and S11 ). Because all flies were reared in the same lab environment, a large portion of the detected bacterial community was likely acquired in the lab. Rearing environment greatly influences bacterial community composition, with communities of lab-reared strains less diverse than their natural-reared counterparts [ 57 ]. Thus, it is difficult to draw conclusions about how the genetic background effects we detected might influence bacterial community composition in nature, although genetic differentiation among natural D . melanogaster populations is known to shape bacterial community structure [ 58 ].

Overall, our findings yield insight into the evolution of regulatory architecture, the effects of regulatory variation on tissue specificity, the effects of genetic background on expression and microbiome variation, as well as the importance of accounting for context-specificity in evolutionary studies.

Materials and methods

D . melanogaster samples and sequencing.

All D . melanogaster strains were reared on cornmeal-molasses medium under standard lab conditions (21°C, 14 hour light: 10 hour dark cycle). mRNA-seq and microbiome sequencing were performed for four isofemale strains, two from Umeå, Sweden (SU26 and SU58) [ 29 ] and two from Siavonga, Zambia (ZI418 and ZI197) [ 30 ] as well as F1 hybrids between the Swedish and Zambian parental lines (SU58xZI418, SU58xZI197, SU26xZI418, SU26xZI197). The SU58 and ZI418 strains have the standard arrangement for all known chromosomal inversion polymorphisms [ 19 , 44 ], while SU26 and ZI197 have the standard arrangement with the exception of In(2L)t , which was present in SU26 [ 19 ] and In(3R)K , which was present in ZI197 [ 44 ]. To determine if inversion status affected our findings, we tested for a significant over- or underrepresentation of genes differentially expressed or displaying ASE between parental strains among genes located within the In(2L)t or In(3R)K inversion using a χ 2 test. Reciprocal F1 hybrids were generated in both directions (i.e. parental genotypes were switched) by crossing 2–3 virgin females of one line with 4–5 males of the other line. Crosses were carried out in 8–13 replicate vials and parental strains were similarly reared (2–3 females and 3–5 males per vial with 8–12 replicate vials) in order to control for rearing density among genotypes.

Midguts (from below the cardia to the midgut/hindgut junction, 20 per biological replicate) and hindguts (from the midgut/hindgut junction to the anus, 60 per biological replicate) were dissected from 6-day-old females in cold 1X PBS and stored in RNA/DNA shield (Zymo Research Europe; Freiburg, Germany) at -80°C until RNA extraction. For F1 hybrids, half of the tissues were dissected from each of two reciprocal crosses in order to avoid potential parent-of-origin effects, although such effects are expected to be absent or very rare in D . melanogaster [ 6 , 59 ]. RNA was extracted from three biological replicates per genotype and tissue type (48 samples in total) with the RNeasy Mini kit (Qiagen; Hilden, Germany) as directed by the manufacturer. mRNA-seq and microbiome sequencing were performed using the same RNA extractions. Poly-A selection, fragmentation, reverse transcription, library construction, and high- throughput sequencing was performed by Novogene (Hong Kong) using the Illumina HiSeq 2500 platform (Illumina; San Diego, CA) with 150-bp paired reads. Malpighian tubule 125-bp paired read data for SU58, SU26, ZI418 and F1 hybrids (SU58xZI418, SU26xZI418), which was composed of 2 biological replicates per genotype (10 in total; 58 libraries in total across all tissues) was downloaded from Gene Expression Omnibus (accession number GSE103645).

Microbiome sequencing and analysis

Reverse transcription was carried out to generate complementary DNA (cDNA) which was used for amplicon sequencing targeting the V4 region of the 16S rRNA bacterial gene. First, template RNA was cleaned of potential residual genomic DNA with the PerfeCta DNase I (Quantabio; Beverly, MA) following the manufacturer’s instructions. Reverse transcription was performed using the FIREScript RT cDNA Synthesis (Solis BioDyne; Tartu, Estonia) with specific bacterial primers, 515F (5’-GTGYCAGCMGCCGCGGTAA-3’) and 806R (5’- GGACTACNVGGGTWTCTAAT-3’), also following the manufacturer’s instructions. The V4 region of the 16S rRNA gene was sequenced from the resulting cDNA on an Illumina Miseq platform using the 515F and 806R primer pair. Using the R package DADA2 (version 1.26.0) [ 60 ], Amplicon Sequence Variants (ASVs) were inferred after trimming (length of 240nt for forward reads and 180nt for reverse reads). Dereplication and chimera removal were performed using default parameters of DADA2. Each ASV was assigned taxonomically using the Silva classifier (version 138.1) [ 61 ]. ASVs assigned to the Eukaryotic and Archeal kingdoms were removed. Given that the gut bacterial community was highly dominated by one ASV assigned to the genus Wolbachia ( S9 Fig ), a known intracellular symbiont of Drosophila melanogaster , we chose to remove it for further statistical analysis, revealing the underlying diversity in the gut bacterial community.

All statistical analyses were performed in R-4.2.2 and each graph was generated with the ggplot2 package [ 62 ]. The composition of the bacterial gut community was analyzed using the Phyloseq package [ 63 ]. ASVs not present in more than 6.25% (3 replicates/48 samples = 0.00625) of the samples were removed for visualization purposes but kept in the data for the remaining analyses. Differences in beta-diversity were tested with permutational multivariate analysis of variance ( vegan package version 2.6–4) [ 64 ] on a Bray-Curtis dissimilarities matrix and Principal coordinate analyses (PCoA) was performed for visualization using the vegan package. Differences in bacterial alpha diversity (species richness, Shannon index, Simpson index and inverse Simpson index generated with vegan package) were tested with linear mixed models (lmer, lme4 package) [ 65 ] and pairwise comparisons were tested following the Tukey method ( emmeans package) [ 66 ]. The RNA extraction batch had no significant effect on differences in alpha and beta-diversity of the bacterial community.

mRNA-seq analyses

Reference genomes for each parental strain were constructed using published genome sequence assemblies of SU26 and SU58 [ 29 ], and ZI197 and ZI418 [ 44 , 67 ] as described in [ 19 ]. Briefly, if a nucleotide sequence difference on the major chromosome arms (X, 2R, 2L, 3R, 3L) occurred between a parental strain and the D . melanogaster reference genome (release 6) [ 68 ], the parental nucleotide variant was included in the new reference transcriptome. If the parental sequence contained an uncalled base (“N”), the reference sequence was used. All transcribed regions (including rRNAs, non-coding RNAs, and mRNAs) were then extracted from each parental reference genome using FlyBase annotation version 6.29 [ 68 ]. For each parental strain library, mRNA-seq reads were mapped to the corresponding parental reference genome. In order to prevent mapping bias for genes with greater sequence similarity to one of the parental reference genomes, reads for F1 hybrids were mapped to the combined parental reference genomes.

Reads were mapped to the reference transcriptomes using NextGenMap [ 69 ] in paired-end mode. Read pairs matching more than one transcript of a gene were randomly assigned to one of the transcripts of that gene. For downstream analyses, we analyzed the sum of read counts across all of a gene’s transcripts (across all annotated exons), i.e. on the individual gene-level. To identify genes with poor mapping quality, for each parental transcriptome, we simulated mRNA-seq data with 200 reads per transcript and either 125 bp or 150 bp reads, then mapped the reads back to the corresponding transcriptome. Genes for which more than 5% of reads mapped incorrectly were removed from the analyses of the corresponding read length (125 bp for Malpighian tubule and 150 bp for midgut and hindgut). Library size ranged from 34.7 to 55.0 million paired end reads, 97.0–98.6% of which could be mapped ( S15 Table ).

ASE and mode of expression inheritance analyses were performed within individual tissues as well as for all tissues together. Analyses for individual tissues were qualitatively similar to our analyses including all tissues; therefore, we focus in the main text on analyses including all tissues ( S1 and S2 Data) and have included individual tissue analyses as Supplementary material ( S3 – S5 Data , S3 – S6 Tables). To standardize statistical power across all libraries included in the analysis, we held the total number of mapped reads constant by setting the maximum number of mapped reads per sample to that of the library with the fewest mapped reads and randomly subsampling reads (without replacement) until the total number of mapped reads for each sample equaled the maximum. The number of reads we subsampled for each dataset were as follows: 34,009,757 in midgut, 31,611,417 in hindgut, and 30,820,759 in Malpighian tubule as well as for analyses including all tissues. We identified differentially expressed genes using a negative binomial test as implemented in DESeq2 [ 70 ]. Gene expression divergence between two strains or tissues was calculated as 1 –Spearman’s ρ between the mean normalized gene counts of the two. Significant differences in divergence were assessed with a t -test. To be considered as expressed in our dataset, we required that a gene have a minimum of 15 reads in each sample, which resulted in 7,684, 8,209, 7,675, and 6,894 genes in the midgut, hindgut, Malpighian tubule, and all tissues, respectively, that could be used in analyses.

Calculation of tissue specificity and phenotypic dominance h

essay introduction about genes

Inference of the mode of expression inheritance

To infer the mode of expression inheritance in F1 hybrids, we compared F1 hybrid expression to parental expression and classified genes into six categories: “similar,” “P1 dominant”, “P2 dominant”, “additive,” “overdominant,” and “underdominant” [ 5 ]. To do so, we compared the fold-change difference in expression as calculated by DESeq2 [ 70 ] for each gene between genotypes to a fold-change cutoff threshold. Genes where all expression differences were below the cutoff were classified as “similar”, while genes for which the expression difference was greater than the cutoff between the hybrid and only one parent were classified as dominant for that parent. Genes were categorized as additive if the expression differences between the hybrid and both parents was above the cutoff and the hybrid expression was between the expression of the two parental strains, or if the difference in expression between the two parents was above the cutoff and hybrid expression was between the two parental strains. Genes were categorized as overdominant if the expression difference between the hybrid and both parents was above the cutoff and hybrid expression was greater than that of both parents. A gene was categorized as underdominant if the expression difference between the hybrid and both parents was above the cutoff and hybrid expression was lower than that of both parents. We employed three fold-change cutoffs (1.25, 1.5, and 2) as well as a negative binomial test [ 70 ] and a 5% FDR cutoff, for which we also included an ambiguous category for genes that did not fit into the other categories. In the main text, we focus on the 1.25-fold cutoff as i) the relative proportion of genes falling into each of the non-similar categories was qualitatively similar for all cutoffs ( S2 Table ), ii) a fold-change cutoff (rather than a statistical test) should avoid bias in detecting differential expression between alleles/genes with higher expression, as the power of statistical tests increases with increasing read counts, iii) the 1.25-fold cutoff has been employed in several previous studies with the justification that most of the significant expression differences detected between samples tend to be of this magnitude [ 5 , 20 , 74 ], and iv) previous work using the Malpighian tubule data we use here empirically determined it to be a reasonable cutoff for this analysis [ 19 ]. The results for the other cut-offs and individual tissues are provided in S2 and S3 Tables.

ASE analysis

In order to detect expression differences between the two alleles in each F1 hybrid, we compiled lists of diagnostic SNPs that could be used to distinguish between transcripts for each pairwise combination of parental alleles in each examined tissue ( S16 Table ). To do so, we first compared the two parental reference genome sequences over all transcribed regions annotated in the D . melanogaster reference genome (version 6.29) [ 68 ] to compile an initial list of diagnostic SNPs. In order to exclude sites with potential residual heterozygosity or sequencing errors, for each tissue, we required that all SNPs inferred from the genome sequences be confirmed in the parental mRNA-seq data with a coverage of ≥ 20 reads and the expected variant in ≥ 95% of the mapped reads of each parent. Next, we called new SNPs from the parental mRNA-seq data if a site was not polymorphic (or contained an N) in the parental genome sequences, but had ≥ 20 mapped reads in each parent with ≥ 95% having the same base in one parent but a different base the other parent. The total number of high-confidence diagnostic SNPs meeting these criteria was 66,030–89,497, 74,388–105,634, and 59,294–60,668 for midgut, hindgut, and Malpighian tubule, covering 6,937–7,474, 7,399–8,166, and 6,394–6,412 genes, respectively.

To assess ASE in the F1 hybrids, we used the mapping data described in the mRNA-seq analyses section above, but used only reads containing at least one diagnostic SNP (i.e. reads that could be assigned to a parental allele). As described above, counts were summed over all transcripts of a gene and the ASE analysis was carried out on a per gene basis. To standardize statistical power between genetic background combinations and/or tissues while maximizing the number of reads that could be included in our analysis, the maximum number of diagnostic reads per sample (i.e. the maximum number of reads for 2 alleles) was set to that of the F1 hybrid with the fewest diagnostic reads. For all other samples, reads were randomly subsampled (without replacement) until the total number of diagnostic reads equaled the maximum for F1 hybrids or half of the maximum for parents. The maximum number of diagnostic reads was set to 15,446,286, 11,058,789, 12,477,714, and 11,058,789 for midgut, hindgut, Malpighian tubule, and all tissues, respectively. We tested for differences in allelic expression using a negative binomial test as implemented in DESeq2 [ 70 ], using only genes with a minimum of 15 diagnostic reads for each allele replicate, resulting in a total of 5,060–5,590, 5,650–6,141, 5,097–5,133, and 4,035–4,592 genes depending on genetic background combination that could be analyzed in the midgut, hindgut, Malpighian tubule, and all tissues, respectively, of which 4,228, 4,800, 4,397, and 2,845 genes could be analyzed in all genetic background combinations. In the main text, we focus on the 2,845 genes that could be directly compared across all genetic background combinations and tissues, although results for individual tissues and genetic background combinations were qualitatively similar ( S5 Table ).

Inference of the genetic basis of expression variation

We determined the genetic basis of expression variation for each gene using the outcome of three statistical tests: a negative binomial test for differential expression between the two parental strains, a negative binomial test for ASE in the F 1 hybrid, and a Cochran–Mantel–Haenszel (CMH) test of the ratio of expression between the two parents and the ratio between the two alleles in the hybrid. For all tests, P -values were adjusted for multiple testing [ 75 ] and an FDR cutoff of 5% was used to define significant differences. We employed the same subsampling procedure as described in the ASE analysis section above in order to balance statistical power between parents and hybrids. We classified genes into regulatory classes [ 5 ] as follows: “conserved” genes showed no significant difference in any test; “all cis ” genes showed significant ASE in hybrids and significant DE between parents, but the CMH test was not significant; “all trans ” genes showed significant DE between the parents and a significant CMH test, but no ASE; “compensatory” genes had no DE between parents, but showed significant ASE in hybrids and a significant CMH test; “ cis + trans ” genes were significant result for all three tests with the expression difference between the parents greater than the difference between the two alleles in the hybrid; “ cis × trans ” genes also had three significant tests, but the expression difference between the parents was less than the difference between the two alleles in the hybrid; and “ambiguous” genes were significant for only one test.

Gene set enrichment analysis

We used InterMine [ 76 ] to search for an enrichment of gene ontology (GO) biological process and molecular function terms for genes displaying ASE in each genetic background and tissue as well as in all tissues.

Sex-biased gene analysis

In order to calculate sex bias for each gene in each examined tissue, we downloaded male (M) and female (F) FPKM (Fragments Per Kilobase of transcript per Million mapped reads) values from FlyAtlas2 [ 41 ] and calculated sex bias as log2(FPKM M /FPKM F ). Based on this log fold-change (LFC), we categorized genes as either sex-biased (LFC ≥ 0.5 or LFC ≤ -0.5) or unbiased (LFC ≤ 0.5 or LFC ≥ -0.5). We tested for a significant over- or underrepresentation of sex-biased genes among genes displaying ASE using a χ 2 test with a Benjamini-Hochberg multiple test correction. To better understand how these sex-biased genes were distributed among different levels of sex bias, we further categorized genes as strongly female-biased (FS; LFC ≤ -1.5), moderately female-biased (FB; -0.5 ≤ LFC ≤ -1.5), unbiased (UB; -0.5 > LFC > 0.5), moderately male-biased (MB; 0.5 ≥ LFC ≥ 1.5), or strongly male-biased (MS; LFC ≥ 1.5; S8 Table ).

Supporting information

S1 fig. expression and dominance ( h ) divergence within and among tissues..

A) Expression and B) dominance ( h ) divergence among genotypes within the midgut (MG), hindgut (HG), and Malpighian tubule (MT) versus divergence between the same genotype among tissues (across). C) Expression and D) dominance ( h ) divergence within the same genotype among tissues. A–C) Significance was assessed with a t -test. D) Significance was not assessed due to the low number of comparisons. Bonferroni-corrected P values are shown. * P < 0.05, ** P < 5 x 10 −5 , *** P < 10 −14 , ns not significant, nt not tested.

https://doi.org/10.1371/journal.pgen.1011257.s001

S2 Fig. Differential expression and divergence within tissues.

The total number of differentially expressed (DE) genes between genotypes within the A) hindgut (HG), B) midgut (MG), and C) Malpighian tubule (MT) are shown above the diagonal, while expression divergence (as measured by ρ subtracted from one) between genotypes is shown below the diagonal. Analysis was performed in each tissue individually. The numbers of genes that could be included in the analysis for each tissue were 8,209 in the hindgut, 7,684 in the midgut, and 7,675 in the Malpighian tubule.

https://doi.org/10.1371/journal.pgen.1011257.s002

S3 Fig. Mode of expression Inheritance in SU26xZI418 and SU58xZI418 backgrounds.

Upset plots showing unique and overlapping genes within the hindgut (circles), midgut (triangles), and Malpighian tubule (squares) in the A,C,E) SU26xZI418 or B,D,F) SU58xZI418 backgrounds. Horizontal bars represent the total number (num.) of genes in a tissue and inheritance category combination. Vertical bars represent the number of genes in an intersection class. A filled circle underneath a vertical bar indicates that a tissue and inheritance category combination is included in an intersection class. A single filled circle represents an intersection class containing only genes unique to a single tissue and inheritance category combination. Filled circles connected by a line indicate that multiple tissue and inheritance category combinations are included in an intersection class. Genes categorized into A,B) basic expression inheritance (inherit.), i.e. P1 dominant (P1 dom.), P2 dominant (P2 dom.), and additive (add.), C,D) misexpression (misexpress.), and E,F) similar categories are shown.

https://doi.org/10.1371/journal.pgen.1011257.s003

S4 Fig. Mode of expression inheritance in SU26xZI197 and SU58xZI197 backgrounds.

Upset plots showing unique and overlapping genes within the hindgut (circles) and midgut (triangles) in the A,B,E) SU26xZI197 or C,D,F) SU58xZI197 backgrounds. Horizontal bars represent the total number (num.) of genes in a tissue and inheritance category combination. Vertical bars represent the number of genes in an intersection class. A filled circle underneath a vertical bar indicates that a tissue and inheritance category combination is included in an intersection class. A single filled circle represents an intersection class containing only genes unique to a single tissue and inheritance category combination. Filled circles connected by a line indicate that multiple tissue and inheritance category combinations are included in an intersection class. Genes categorized into A,C) similar, B,D) basic expression inheritance (inherit.), i.e. P1 dominant (P1 dom.), P2 dominant (P2 dom.), and additive (add.), and E,F) misexpression (mis-express.) categories are shown.

https://doi.org/10.1371/journal.pgen.1011257.s004

S5 Fig. Genetic basis of expression inheritance in SU26xZI418 and SU58xZI418 backgrounds.

Upset plots showing unique and overlapping genes with non-ambiguous regulatory divergence in the hindgut (circles), midgut (triangles), and Malpighian tubule (squares) in the A) SU26xZI418 or B) SU58xZI418 backgrounds. Horizontal bars represent the total number of genes in a tissue and regulatory category combination. Vertical bars represent the number of genes in an intersection class. A filled circle underneath a vertical bar indicates that a tissue and inheritance category combination is included in an intersection class. A single filled circle represents an intersection class containing only genes unique to a single tissue and regulatory category combination. Filled circles connected by a line indicate that multiple tissue and regulatory category combinations are included in an intersection class.

https://doi.org/10.1371/journal.pgen.1011257.s005

S6 Fig. Genetic basis of expression inheritance in SU26xZI197 and SU58xZI197 backgrounds.

Upset plots showing unique and overlapping genes with non-ambiguous regulatory divergence in the hindgut (circles) and midgut (triangles) in the A) SU26xZI197 or B) SU58xZI197 backgrounds. Horizontal bars represent the total number (num.) of genes in a tissue and regulatory category combination. Vertical bars represent the number of genes in an intersection class. A filled circle underneath a vertical bar indicates that a tissue and inheritance category combination is included in an intersection class. A single filled circle represents an intersection class containing only genes unique to a single tissue and regulatory category combination. Filled circles connected by a line indicate that multiple tissue and regulatory category combinations are included in an intersection class.

https://doi.org/10.1371/journal.pgen.1011257.s006

S7 Fig. Genetic basis of expression inheritance across examined tissues and backgrounds.

Shown are A,B) unique and C,D) overlapping genes in each regulatory category. Shown are A) the number of genes unique to each tissue within each regulatory category and genetic background, B) the number of genes unique to each genetic background and tissue within each regulatory category, C) the number of genes in each regulatory category detected in all examined tissues for each genetic background, and D) the number of genes in each regulatory category detected in all genetic backgrounds for each tissue. Asterisks (*) indicate comparisons using only C) two tissues or D) two genetic backgrounds.

https://doi.org/10.1371/journal.pgen.1011257.s007

S8 Fig. All dominance in cis -only versus trans -only genes.

A) Dominance and B) magnitude of dominance h for genes categorized as cis -only ( c , light) and trans -only ( t , dark) in each background and tissue. Significance was assessed with a t -test. Bonferroni-corrected P values are shown in grey. *** P < 0.005, ** P < 0.01, * P < 0.05, ms P marginally significant after multiple test correction ( P < 0.1), ns P not significant after multiple test correction.

https://doi.org/10.1371/journal.pgen.1011257.s008

S9 Fig. Composition of the bacterial communities in the midgut and hindgut of each genotype, including Wolbachia ASVs.

Colored sections of each bar show bacterial genera with a relative abundance superior to 5% in each sample. The remaining genera are compiled in the “Others” category.

https://doi.org/10.1371/journal.pgen.1011257.s009

S10 Fig. Principal coordinate analysis of bacterial communities in A) both midgut and hindgut samples, B) hindgut, and C) midgut, including Wolbachia ASVs.

The legend indicates that replicates of each genotype share the same color, while shape indicates tissue.

https://doi.org/10.1371/journal.pgen.1011257.s010

S11 Fig. Shannon diversity index of the bacterial community in the midgut and hindgut excluding (A) or including (B) Wolbachia ASVs.

* indicates significant differences of the Shannon index between groups (lmer, P < 0.05).

https://doi.org/10.1371/journal.pgen.1011257.s011

S1 Table. Shared differentially expressed genes among tissues and genotypes.

The numbers of overlapping differentially expressed (DE) genes within or among genotypes and/or tissues for all DE genes (All) or genes upregulated in the midgut (MG up ), hindgut (HG up ), and Malpighian tubule (MT up ) tissues are shown. Only genotypes for which data was available in all examined tissues are shown.

https://doi.org/10.1371/journal.pgen.1011257.s012

S2 Table. Mode of expression inheritance in combined tissue analysis.

Numbers of genes in each mode of expression inheritance category within the hindgut, midgut, and Malpighian tubule at a 1.25-, 1.5-, and 2-fold change or 5% FDR cut-off (see Methods ) are shown. 6,894 genes could be included in the analysis. The ambiguous category is only necessary for the 5% FDR cutoff and comprises genes which could not be assigned into other categories.

https://doi.org/10.1371/journal.pgen.1011257.s013

S3 Table. Mode of expression inheritance in individual tissue analysis.

Numbers of genes in each mode of expression inheritance category within the hindgut, midgut, and Malpighian tubule at a 1.25-, 1.5-, and 2-fold change or 5% FDR cut-off (see Methods ) are shown. 7,684, 8,209, and 7,675 genes could be included in the analysis in the midgut, hindgut and the Malpighian tubule, respectively. The ambiguous category is only necessary for the 5% FDR cutoff and comprises genes which could not be assigned into other categories.

https://doi.org/10.1371/journal.pgen.1011257.s014

S4 Table. ASE genes identified in individual tissue analyses.

Number of differentially expressed (DE) genes between the parental strains (P) and alleles within the F1 hybrid (H) as well as allele specific genes (ASE) are shown for hindgut (HG), midgut (MG), Malpighian tubule (MT), and shared across all tissues (All). Dashes indicate missing data.

https://doi.org/10.1371/journal.pgen.1011257.s015

S5 Table. The genetic basis of expression inheritance in individual tissue analyses.

Numbers of genes in each regulatory category within the hindgut, midgut, and Malpighian tubule are shown. 4,228, 4,800, and 4,397 genes could be included in the analysis in the midgut, hindgut and the Malpighian tubule, respectively.

https://doi.org/10.1371/journal.pgen.1011257.s016

S6 Table. Phenotypic dominance in all cis and all trans groups including all genes that could be analyzed in each individual tissue and genotype.

The mean and mean of the absolute value of phenotypic dominance ( h ) are shown. Significance was assessed using a t -test. Significant P -values are in bold and values non-significant after multiple test correction are shown in grey.

https://doi.org/10.1371/journal.pgen.1011257.s017

S7 Table. Enriched GO terms in genes with ASE.

Enriched molecular function and biological process GO terms for genes showing ASE in the Malpighian tubule (MT), midgut (MG), hindgut (HG), and all examined tissues. Number (num) of contributing terms and Holm-Bonferroni-adjusted P -values are shown. Genetic background and tissue combinations with no enriched GO terms are not shown.

https://doi.org/10.1371/journal.pgen.1011257.s018

S8 Table. Sex bias in genes displaying ASE.

The number of genes categorized as strongly female-biased (FS; LFC ≤ -1.5), moderately female-biased (FB; -0.5 ≤ LFC ≤ -1.5), unbiased (UB; -0.5 > LFC > 0.5), moderately male-biased (MB; 0.5 ≥ LFC ≥ 1.5), strongly male-biased (MS; LFC ≥ 1.5), and generally sex-biased (SB; LFC ≥ 0.5 or LFC ≤ -0.5) are shown. Significant deviations from the expected number of sex-biased genes (SB EXP) were assessed with a χ 2 test. P - and Benjamini-Hochberg-corrected P -values are shown.

https://doi.org/10.1371/journal.pgen.1011257.s019

S9 Table. The effect of In(2L)t and In(3R)K inversion status.

The number of genes differentially expressed (DE) or displaying allele specific expression (ASE) between parental strains along with the number of genes located within an inversion are shown. The strain containing an inversion is shown in parentheses. Significant deviations from the expected number of DE or ASE genes were assessed with a χ 2 test.

https://doi.org/10.1371/journal.pgen.1011257.s020

S10 Table. ANOVA results for pairwise comparisons between genes categorized as cis -only, trans -only, and conserved.

Results for ANOVAs with τ of the indicated strain as the response variable and regulatory (reg.) variant type, tissue, and the interaction between them as factors.

https://doi.org/10.1371/journal.pgen.1011257.s021

S11 Table. PERMANOVA results on comparison of Bray-Curtis distances of the bacterial community including or excluding Wolbachia ASVs.

https://doi.org/10.1371/journal.pgen.1011257.s022

S12 Table. Summary statistics of the LMER ANOVA comparing Shannon indices of the bacterial community including or excluding Wolbachia ASVs.

The mixed models included the genotype and tissue as fixed factors and the batch and sample ID as random factors. The interaction term in between the two fixed effect did not have a significant effect on the Shannon indices and was subsequently removed from the models. LMER: Shannon index ~ genotype + tissue + 1| batch + 1| sample ID.

https://doi.org/10.1371/journal.pgen.1011257.s023

S13 Table. Pairwise comparisons of the Shannon index for the bacterial communities including or excluding Wolbachia ASVs.

The comparisons were performed following the Tukey method and the P -values were adjusted via the Benjamini-Hochberg (BH) method.

https://doi.org/10.1371/journal.pgen.1011257.s024

S14 Table. Results of lmer comparing the relative abundance of Wolbachia in the bacterial communities.

https://doi.org/10.1371/journal.pgen.1011257.s025

S15 Table. Library size and mapping efficiency for all mRNA-seq libraries.

To prevent mapping bias, all libraries were simultaneously mapped to at least one pair of parental strains (P1 and P2). The number of mapped, paired mapped, unmapped, and discarded reads as well as total reads, read pairs, and proportion (prop) of mapped reads are shown.

https://doi.org/10.1371/journal.pgen.1011257.s026

S16 Table. Number of diagnostic SNPs in each tissue and comparison.

Shown are the number of diagnostic SNPs determined from the respective reference genome (Ref) and with the mRNA-seq data included as well as the number of genes covered by the diagnostic SNPs.

https://doi.org/10.1371/journal.pgen.1011257.s027

S1 Data. ASE gene counts and expression in analyses including all tissues.

https://doi.org/10.1371/journal.pgen.1011257.s028

S2 Data. Phenotypic dominance, sex-biased gene expression, and τ in analyses including all tissues.

https://doi.org/10.1371/journal.pgen.1011257.s029

S3 Data. ASE gene counts and expression, and phenotypic dominance in midgut analyses.

https://doi.org/10.1371/journal.pgen.1011257.s030

S4 Data. ASE gene counts and expression, and phenotypic dominance in hindgut analyses.

https://doi.org/10.1371/journal.pgen.1011257.s031

S5 Data. Gene counts, expression and phenotypic dominance in Malpighian tubule analyses.

https://doi.org/10.1371/journal.pgen.1011257.s032

S6 Data. ASV and taxonomy tables of the bacterial communities in the midgut and hindgut.

https://doi.org/10.1371/journal.pgen.1011257.s033

Acknowledgments

We thank Hilde Lainer for excellent technical assistance in the lab as well as Dr. Grit Kunert (Department of Biochemistry, Max-Planck-Institute for Chemical Ecology) for advice on statistical models. We also thank the LMU Evolutionary Biology department for helpful suggestions and discussions.

  • View Article
  • PubMed/NCBI
  • Google Scholar
  • 62. Wickham H. ggplot2: Elegant Graphics for Data Analysis: Springer-Verlag New York; 2016.
  • 72. Falconer DS, Mackay TF. Introduction to Quantitative Genetics. 4th ed. Harlow: Pearson Education; 1995.
  • Open access
  • Published: 24 August 2024

Spatial multi-omics: deciphering technological landscape of integration of multi-omics and its applications

  • Xiaojie Liu 1 , 3 , 4 ,
  • Ting Peng 1 , 3 , 4 ,
  • Miaochun Xu 1 , 3 , 4 ,
  • Shitong Lin 1 , 3 , 4 ,
  • Bai Hu 2 , 3 , 4 ,
  • Tian Chu 2 , 3 , 4 ,
  • Binghan Liu 1 , 3 , 4 ,
  • Yashi Xu 1 , 3 , 4 ,
  • Wencheng Ding 2 , 3 , 4 ,
  • Li Li 2 , 3 , 4 ,
  • Canhui Cao 2 , 3 , 4 &
  • Peng Wu 1 , 3 , 4  

Journal of Hematology & Oncology volume  17 , Article number:  72 ( 2024 ) Cite this article

379 Accesses

Metrics details

The emergence of spatial multi-omics has helped address the limitations of single-cell sequencing, which often leads to the loss of spatial context among cell populations. Integrated analysis of the genome, transcriptome, proteome, metabolome, and epigenome has enhanced our understanding of cell biology and the molecular basis of human diseases. Moreover, this approach offers profound insights into the interactions between intracellular and intercellular molecular mechanisms involved in the development, physiology, and pathogenesis of human diseases. In this comprehensive review, we examine current advancements in multi-omics technologies, focusing on their evolution and refinement over the past decade, including improvements in throughput and resolution, modality integration, and accuracy. We also discuss the pivotal contributions of spatial multi-omics in revealing spatial heterogeneity, constructing detailed spatial atlases, deciphering spatial crosstalk in tumor immunology, and advancing translational research and cancer therapy through precise spatial mapping.

Introduction

Single-cell sequencing has been instrumental in providing detailed insights into gene expression at the individual cell level for decades. This technique has revealed the complexity of cellular diversity, exacerbated by processes such as cell proliferation, differentiation, and death, particularly in relation to the local and distant environment of the cell [ 1 ]. Single-cell sequencing can detect cellular heterogeneity, enabling detailed analysis of individual cell behavior, mechanisms, and relationships. The high resolution of these methods has allowed for the extensive exploration and characterization of cell diversity on a large scale. However, despite these advantages, single-cell sequencing often fails to retain critical spatial information about cell populations, resulting in the loss of crucial spatial context [ 1 , 2 ].

To overcome this limitation, spatial multi-omics has emerged as a transformative technology, enabling the precise localization of cells within tissues and the quantitative measurement of gene expression in situ. This advancement marks an important technological breakthrough in life sciences and biomedicine, with wide-ranging applications in neuroscience, developmental biology, and cancer research [ 3 ]. Furthermore, spatial multi-omics allows researchers to investigate the development of multicellular organisms from single totipotent cells, as well as their function, aging, and disease progression. High-throughput multi-omics technologies, such as genomics, epigenomics, transcriptomics, proteomics, and metabolomics, have also facilitated the mapping of diverse molecular layers, significantly broadening the scope of biological analysis and our understanding of complex biological systems. In the current review, we trace the developmental timeline of spatial multi-omics technologies, highlighting their evolution and substantial contributions to modern science. Furthermore, we discuss the current state of these technologies, their integration into research, and their significant applicative value in enhancing our understanding of biological complexity.

Technologies for spatial omics

Spatial mono-omics, such as spatial transcriptomics, was recognized as the “Technology of the Year 2020” by Nature Methods magazine [ 4 ] (Fig.  1 ). Although single-cell sequencing technology has provided valuable insights into cellular heterogeneity, it lacks spatial context. Spatial multi-omics overcomes this limitation by enabling the precise localization and molecular characterization of individual cells within their tissue environments [ 5 ]. The innovation in spatial multi-omics builds upon foundational spatial mono-omics methods. In this section, we introduce key spatial mono-omics techniques (Table  1 ) and discuss their pivotal role in advancing the field of spatial multi-omics.

figure 1

Timeline of spatial multi-omics. Transcriptomics, genomics, proteomics, metabolomics, and epigenomics are included. In addition to the frequently used techniques, some emerging methods are mentioned

Spatial transcriptomics

Spatial transcriptomics has significantly enhanced our understanding of cellular organization and intra-tissue interactions based on the systematic measurement of gene expression levels across tissue space. Recent advancements in spatial transcriptomics sequencing have focused on increasing the number of detectable genes or proteins, enhancing sensitivity and resolution, simplifying operation, and expanding the size of the analyzed area. Spatial transcriptomics has been used in various fields, including cancer research [ 6 ], developmental biology [ 7 ], and disease studies [ 8 ]. Fundamentally, spatial transcriptomics technology has the ability to reveal the precise spatial localization of RNA molecules within tissues. In this section, we provide a comprehensive overview of mainstream spatial transcriptomics research strategies and summarize the strengths and limitations of these approaches.

(1) Image-based in situ transcriptomics . Image-based spatial transcriptomics primarily includes fluorescence in situ hybridization (FISH) and in situ sequencing (ISS). Recent advancements feature highly multiplexed single‐molecule FISH (smFISH), which uses reverse complementary oligo probes conjugated with fluorophores [ 9 ] for precise mRNA quantification and localization at the single-cell level [ 10 ]. The specificity of fluorescent probes to their RNA targets is critical for reliable smFISH results [ 11 ]. While smFISH can detect many transcripts due to high hybridization efficiency, signal overlap complicates barcode deconvolution. To address this issue, single-molecule imaging and multiplexed error-robust FISH (MERFISH) (Fig.  2 A) have been developed, allowing the identification of thousands of RNA species in single cells by reducing optical crowding, albeit at the cost of increased imaging rounds and time [ 12 ]. Sequential FISH (seqFISH) [ 13 , 14 ] (Fig.  2 B), an in situ three-dimensional (3D) multiplexed imaging method, also addresses optical crowding by decreasing the number of transcripts per image, requiring additional imaging rounds. Despite these advancements, smFISH is limited by the spectral overlap of fluorophores, restricting its multiplexing capabilities and its effectiveness in analyzing cell heterogeneity in complex tissues [ 15 ]. For example, Long et al. utilized seqFISH to analyze the hippocampus, identifying distinct transcriptional states by quantifying and clustering 249 genes in 16,958 cells [ 14 ], thereby demonstrating the effectiveness of this method for detailed transcriptional profiling in complex tissues.

figure 2

Technologies of spatial techniques. A The MERFISH technology, a binary barcode scheme that employs different fluorescent probes to sequentially detect each bit. B The seqFISH technology. Complete RNA in cells/tissues was imaged by multiple rounds of hybridization. Each round obtains a coded message, corresponding to a bit in the digital code, and then decodes it to correspond to each RNA. C FISSEQ incorporates amplification after reverse transcription of cellular RNA into cDNA. D STARmap is based on DNA tandem sequencing technology, using complementary pairing principle of DNA and fluorescent dye labeled nucleotide probe for sequence determination. E LCM-seq utilizes a laser beam to microdissect tissue regions under a microscope. F IGS combines in situ sequencing with high-throughput paired-end DNA sequencing. G Slide-DNA-seq is used to fragment genomic DNA in situ by tissue, and barcode connector with spatial information is added for subsequent second-generation sequencing. H CUT and Tag guides Protein A/G-Tn5 transposase to cut the target chromatin region through protein-specific antibodies such as transcription factors. At the same time, sequencing joints are added to both ends of the sequence to form a library for high-throughput sequencing by PCR amplification

Both ISH and ISS provide similar transcriptomic information, with the primary difference being that ISS-based methods directly read nucleotide sequences within tissues to identify a larger number of RNA‐targeting probes, while ISH-based methods image the sequences of barcoded FISH probes [ 9 ]. As a targeted spatial transcriptomics technology, ISS facilitates highly multiplexed in situ gene expression profiling through padlock probes, rolling circle amplification (RCA), and sequencing-by-ligation [ 16 , 17 ] chemistry combined with next-generation sequencing chemistry [ 18 ]. In ISS, reverse transcribed cDNA is hybridized with padlock probes containing gene-specific barcode sequences, which are ligated at the specific hybridization site and amplified by rolling circle amplification (RCA) with a circularized padlock primer probe [ 9 ]. Chatarina et al. developed a method that combines padlock probes with in situ target-primed rolling-circle amplification to detect and genotype individual transcripts, offering deeper insights into mRNA expression heterogeneity within single-cell populations [ 19 ]. Sequential imaging using sequencing-by-ligation allows for the identification of repeatedly amplified barcode sequences in situ, while fluorescent in situ sequencing (FISSEQ) (Fig.  2 C) employs an oligonucleotide ligation and detection substrate (SOLiD) for genome and transcriptome sequencing of DNA amplicons [ 9 , 17 ]. FISSEQ experiences fewer issues with optical crowding compared to ISH-based methods because it is less efficient at converting transcripts into cDNA in situ. However, methods that use padlock probes hybridized with target RNA species require enzyme ligations and have lower detection rates compared to multiplexed FISH methods [ 12 ]. Next-generation FISSEQ [ 20 ] was developed to complement spatially structured sequencing libraries and includes an imaging method capable of resolving amplicons, which is essential for conducting ISS of cellular RNA for gene expression profiling [ 17 ]. RNA is reverse transcribed in fixed cells with tagged random hexamers to generate cDNA amplicons within the cell, which can be repeatedly hybridized with minimal changes in signal-to-noise ratios or position [ 21 ]. RNA sequencing libraries can be visualized in different cell types, tissue sections, and whole-mount embryos, enabling 3D visualization spanning multiple resolution scales [ 17 ]. Spatially resolved transcript amplicon readout mapping (STARmap) [ 22 , 23 ] (Fig.  2 D) employs dynamic annealing and ligation (SEDAL) to reduce sequencing errors. This technology integrates hydrogel tissue chemistry, targeted signal amplification, and ISS [ 22 ], enabling high multiplexing and analysis of thicker tissue slices, although it may detect fewer transcripts in such slices [ 24 ]. BaristaSeq, an optimized padlock probe-based technique compatible with Illumina sequencing, significantly enhances amplification efficiency and sequencing accuracy, achieving at least 97% accuracy and a five-fold increase in amplification efficiency [ 25 ].

Both ISS and ISH-based methods require image processing to generate gene expression matrices. These images are segmented to create cell-level matrices, which can be done manually for small areas or systematically using computational approaches [ 3 , 26 ]. RNA hybridization-based spatial transcriptomics provides exceptional detection sensitivity [ 27 ]; however, the misassignment of mRNAs during cell segmentation is a significant source of error. To address this, the JSTA computational framework utilizes prior knowledge of cell type-specific gene expression to perform joint cell segmentation and cell type annotation, increasing the accuracy of RNA assignment by over 45% [ 28 ]. Spot-based spatial cell-type analysis by multidimensional mRNA density estimation (SSAM) is a robust cell segmentation-free computational framework that identifies cell types and tissue domains in both 2D and 3D [ 29 ].

(2) Oligonucleotide-based spatial barcoding followed by the next-generation sequencing (NGS) [ 20 ]. NGS represents a significant improvement over previous sequencing technologies, offering cost-effective, rapid sequencing with higher throughput, thereby greatly extending our genomic knowledge [ 30 ] and addressing the time and resource-intensive challenges faced by the Human Genome Project [ 31 ]. NGS technologies introduce three main improvements over first-generation sequencing. First, they rely on the preparation of NGS libraries in a cell-free system, eliminating the need for bacterial cloning of DNA fragments [ 32 ]. Second, numerous sequencing reactions are produced in parallel, enhancing efficiency [ 33 ]. Third, sequencing outputs are detected directly, with base interrogation performed cyclically and in parallel [ 34 ]. Several prominent NGS platforms have emerged, including 454 (pyrosequencing) [ 35 ], Illumina/Solexa, and Sequencing by Oligo Ligation Detection (SOLiD) [ 36 ]. The 454 approach involves the clonal amplification of DNA fragments on beads within emulsion droplets, which are then loaded into wells for sequencing using the pyrosequencing protocol [ 37 ]. This approach enables the sequencing of long reads, making it suitable for various applications, although its inherent problem in detecting homopolymers and nucleotide stretches can impact data quality as sequence volume increases [ 38 ]. Illumina/Solexa employs an array-based DNA sequencing-by-synthesis technology with reversible terminator chemistry [ 39 ]. Primers, DNA polymerase, and four differently labeled reversible terminator nucleotides are used, with each nucleotide identified by color, followed by terminator and fluorophore removal, and the cycle repeating [ 34 ]. This platform currently offers the highest throughput and lowest per-base cost, making it the leading NGS platform. In contrast, the SOLiD platform prepares sequencing libraries by emulsion polymerase chain reaction (PCR) and sequences through successive cycles of ligation [ 39 ], exhibiting the lowest error rate among the three platforms. However, NGS methods have several drawbacks, notably short reads that fail to cover full-length transcripts in eukaryotic genomes and challenges in detecting larger structural variations. Additionally, the reliance on PCR amplification can lead to difficulties in regions with extreme GC content [ 40 ]. The advent of single-molecule, third-generation sequencing technologies, such as Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT), has resolved these issues. PacBio and ONT offer read lengths exceeding 15 kb and 30 kb, respectively, surpassing the length necessary to capture most RNA molecules in eukaryotes. Furthermore, ONT long-read sequencing does not require PCR amplification, thereby reducing potential bias [ 41 , 42 ].

(3) Laser capture microdissection (LCM) . A key challenge in transcriptomics is precise segmentation of tissues and accurate assignment of individual cells to specific locations, often resulting in the loss of spatial information [ 12 ]. LCM (Fig.  2 E), a powerful, microscope-guided cutting system that uses ultraviolet (UV) light as a contact- and contamination-free knife [ 43 ], enables accurate isolation of specific tissues or cells of interest from complex tissue structures. The combination of smart-3SEQ and LCM overcomes various experimental design challenges posed by conventional single-cell RNA-sequencing (scRNA-seq). For instance, formalin-fixed, paraffin-embedded archival clinical tissues, which are unsuitable for conventional RNA-seq due to their inability to be physically dissociated, and fresh or frozen non-archival tissue samples that lack sufficient material for clinical studies can be effectively analyzed using the LCM smart-3SEQ technique [ 11 ].

Spatial genomics

The proper functioning of tissues relies on the precise spatial organization of cell types, which is influenced by both intrinsic genetic factors and the external cellular environment. In cancer, tumor cells exhibit multiple DNA mutations and large chromosomal rearrangements, resulting in intratumor genetic heterogeneity [ 44 ]. Additionally, cells within the tumor microenvironment (TME) interact with each other, forming spatial neighborhoods with distinct biochemical and biomechanical properties. Quantifying these genetic aberrations and environmental cues within tumors is critical for understanding cancer progression and improving treatment [ 45 ]. In situ genome sequencing (IGS) (Fig.  2 F) and slide-DNA-seq (Fig.  2 G) are two exciting methods that promise to fuel the spatial genomics revolution [ 46 ]. IGS expands non-targeted genomic samples in a natural spatial environment, creating an in situ sequencing library in a fixed sample using in vitro transposon technology to fragment DNA. Hairpin DNA splices are then connected to DNA fragments to form circular DNA, which is amplified via rolling circle replication mediated by Phi29 DNA polymerase. Sequencing is performed at both in situ and ectopic sites on the circular DNA fragments [ 47 ]. IGS spatially locates paired-end sequences of the whole genome in an endogenous environment, combining sequencing and imaging to construct a genome map [ 47 ]. This technique specializes in high-resolution imaging of chromosome structure, allowing detailed analysis of tissue sections. Slide-DNA-seq enables spatially resolved sequencing of DNA from intact tissues. The process begins with generating a spatial index array composed of 3 mm beads, each containing unique DNA barcodes corresponding to specific spatial locations. This array is then read through chemical sequencing [ 48 ]. Next, a single 10 μm thick fresh-frozen tissue slice is transferred onto the sequencing bead array. Spatial barcoding is performed through photolysis, and the proximal genome fragments are attached and amplified via PCR to create a DNA sequencing library [ 49 ]. Following library construction, high-throughput paired-end sequencing is carried out, associating each genome fragment with its spatial location on the bead array using DNA barcoding [ 45 ]. Slide-DNA-seq enables detection of clonal heterogeneity, characterization of copy number variations in each clone, and analysis of their spatial distribution within tissue. This technique is particularly useful for large-scale mapping of tumor evolution, providing essential spatial context to the study of clonal heterogeneity [ 45 , 50 ]. Current methods for characterizing chromatin states or DNA within tissues on a large spatial scale are still in their infancy. The integration of spatial multi-omics technologies aims to achieve spatially resolved whole-exome or whole-genome sequencing. Ultimately, integrating various spatially resolved omics technologies will mark the beginning of the era of molecular anatomy, offering unprecedented insights into tissue organization and function [ 46 ].

Spatial proteomics

Proteomics involves the large-scale study of proteins, encompassing their expression levels, post-translational modifications, and protein–protein interactions, thereby providing a comprehensive understanding of processes such as disease occurrence and cell metabolism at the protein level. Proteins, whether in their native or modified forms, are functional units within the body, making the direct study of proteomics more valuable than relying on transcripts. Targeted localization of proteins within eukaryotic cells can redirect existing proteins to various transport pathways, including nuclear, mitochondrial, ciliary, peroxisomal, endomembrane, and vesicular transport [ 51 ], enabling rapid changes in local protein functions. Conversely, protein mislocalization is frequently associated with cellular dysfunction and diseases such as neurodegeneration, cancer [ 52 ], cystic fibrosis [ 53 ], and metabolic disorders. Therefore, researching protein localization at the subcellular level and capturing subcellular dynamics are crucial for a complete understanding of cell biology. Two primary approaches are used to acquire large-scale spatial proteomic data, including mass spectrometry (MS) and imaging-based methods.

(1) Mass spectrometry-based methods : These approaches offer accurate, proteome-wide identification and quantification of proteins. In subcellular proteomics, specific subcellular compartments are often isolated through biochemical fractionation or proximity labeling before MS analysis [ 54 ]. Key processes involve the enrichment and quantification of proteins through biochemical fractionation across different stages using MS [ 55 ]. Organelles are separated based on properties such as size, density, membrane solubility, or charge, with differential and density centrifugation being common strategies. These methods typically achieve high sensitivity and proteome coverage, although contamination from non-target proteins can occur. Ensuring adequate enrichment of the target organelle is crucial for accurate analysis. Once purified, the distribution profiles of proteins specific to different organelles can reveal the subcellular localization or complex binding of uncharacterized proteins [ 54 ]. MS analysis, combined with multivariate statistics and machine learning (ML), is widely used to handle the complex data generated in spatial proteomics [ 56 ]. These techniques compare the abundance distribution of proteins with known organelle markers to infer protein locations and trafficking pathways [ 57 ]. They can identify trends in organelle protein distribution, even in the presence of structural alterations. Proteins, which can have different morphologies and modified states, function as essential units within cells. The relationship between mRNA and corresponding protein expression is highly regulated and non-linear, making RNA expression an unreliable predictor of protein levels. Unlike the more random expression of transcripts, proteins exhibit a much lower coefficient of variation than their homologous mRNA counterparts. Therefore, directly studying proteins at the single-cell level is far more informative than using transcripts as proxies [ 58 ]. Deep visual proteomics (DVP) combines artificial intelligence-driven image analysis of cellular phenotypes with automated single-cell or single-nucleus laser microdissection and ultra-high-sensitivity MS. This technique associates protein abundance with complex cellular or subcellular phenotypes while preserving spatial context [ 59 ]. To achieve this, an ultra-sensitive liquid chromatography-mass spectrometry (LC/MS) workflow has been developed, enhancing sensitivity by up to two orders of magnitude to enable true single-cell state proteomic analysis. The data generated by DVP provide molecular insights into proteomic variation at the phenotypic level while retaining complete spatial information.

(2) Imaging-based methods : These approaches allow for the visualization of proteins in situ without requiring cell lysis or the physical separation of compartments or organelles. Unlike MS methods, which are faster and suitable for large‐scale quantitative analysis, imaging-based approaches visualize the interactions between proteins and affinity reagents. Modern microscopes can simultaneously analyze up to 50 proteins, but each protein of interest requires specific and validated antibodies, limiting high-throughput detection. To minimize the loss of soluble proteins during cellular permeability, it is best to use non-specific crosslinking in proteome-wide studies. It is increasingly evident that protein expression varies even among genetically identical cells. Imaging-based methods can capture this variation by targeting the spatial distribution of proteins at single-cell resolution [ 60 ]. However, the number of published global spatial proteomic studies remains small due to the high cost and time-consuming production of affinity reagents for entire proteomes.

MS and imaging methods each have unique advantages and disadvantages and can complement each other. MS offers high sensitivity, high resolution, and powerful quantitative analysis but involves complex and costly sample preparation. In contrast, imaging methods provide high spatial resolution and dynamic observation capabilities, enabling visualization of protein distribution but have limited quantitative abilities and cover only a small number of proteins. Combining these two techniques allows for comprehensive global protein analysis, enabling the observation of the spatial distribution and dynamic changes of key proteins for a more complete understanding of protein spatial organization.

Spatial epigenomics

Spatial epigenomics examines modifications to the DNA sequence and chromatin structure that regulate gene activity without altering the genetic code itself [ 61 ]. Nucleosomes, the fundamental units of chromosomes, are organized into higher-order chromatin structures. Epigenetic modifications, such as histone acetylation, methylation, phosphorylation, ubiquitination, and DNA methylation, play crucial roles in regulating chromatin structure and DNA accessibility [ 62 , 63 ]. These modifications impact key cellular processes, including gene transcription, DNA replication, recombination, and repair [ 61 ]. Unlike other omics fields, epigenomics relies heavily on bioinformatics to uncover the mechanisms by which the epigenome operates at the molecular level. Developing powerful, repeatable, and process-based techniques is essential for generating data that can be integrated into existing omics databases. The ultimate goal is to create a comprehensive picture of the epigenome by combining information on DNA methylation, chromatin dynamics, accessibility, and gene expression [ 64 ]. Epigenomic MERFISH combined with the recently developed Cleavage Under Targets and Tagmentation (CUT&Tag) approach (Fig.  2 H) enables the mapping of more than 100 epigenomic loci in tissues [ 65 ]. These maps can be used to study patterns of active and silent promoters and potential enhancers, providing deeper insights into the spatial organization and regulation of the epigenome [ 65 ].

Spatial metabolomics

Metabolites play a crucial role in various cellular activities, such as cell signaling, energy transfer, and intercellular communication [ 66 ]. Metabolomics is an emerging discipline that involves the qualitative and quantitative analysis of all low-molecular-weight metabolites within an organism or cell during specific physiological states [ 67 ]. Analyzing metabolites presents challenges due to their dynamic nature and susceptibility to environmental influences during cellular processes [ 68 ]. Spatial metabolomics involves the initial detection and quantification of metabolites present in biological material [ 69 ]. Depending on experimental objectives, researchers can employ either targeted approaches, focusing on quantifying specific analytes, or untargeted approaches, focusing on biomarker discovery and global metabolite profiling [ 70 ].

(1) Targeted metabolomics [ 71 ]: This approach analyzes specific subsets of compounds to address particular biochemical questions or hypotheses. The two primary methods include Fourier transfer mass spectrometry (FT-MS) [ 72 ] and nuclear magnetic resonance (NMR) [ 73 ], both of which offer significant advantages in data acquisition due to their specificity and quantitative reproducibility. FT-MS generates mass data for infused samples, allowing for the identification and matching of metabolites with entries in metabolomics databases. The major drawback of this method is that it does not establish a one-to-one correspondence relationship between entities, which means that a single data point can potentially match with multiple metabolites. NMR produces signals based on the chemical environment of protons present in each metabolite, enabling tentative identification [ 74 ]. The development of triple quadrupole (QqQ) MS provides a robust and sensitive method for high-throughput measurement of a substantial number of biologically significant metabolites. This technique is particularly effective for quantifying low-concentration metabolites that are difficult to detect using NMR [ 75 ].

(2) Untargeted metabolomics : Untargeted metabolomics aims to globally analyze biological compounds, permitting the simultaneous detection of as many metabolites as possible and the exploration of cellular biochemical pathways. LC/MS is the most commonly used platform for untargeted metabolomics [ 76 , 77 ], producing numerous signals during the detection of biological samples. The structural diversity of metabolites is vast, and the acquired data often include both known and unknown metabolites. When searching metabolomics databases for the mass–charge ratio of each detected feature, only a small percentage match the database entries, making the identification of unknown metabolites challenging [ 78 ]. The number of detected unknown metabolites is often overestimated due to several factors. A high concentration of 13 C can cause a mass shift, leading to the detection of multiple features for a single metabolite through naturally occurring isotopes. Additionally, a single metabolite can be ionized into various adducts, including isomers, increasing the demand for selective analytical techniques. Furthermore, metabolites can fragment or form non-covalent interactions with other metabolites upon entering the mass spectrometer. These factors collectively increase the complexity and diversity of detected metabolites [ 79 ].

LC/MS data analysis addresses the complexity of metabolite detection through two main approaches: (1) grouping metabolites with similar features and (2) annotating the type of ion species. These steps facilitate the identification of excimer ions, which are essential for further metabolite identification, such as determining elemental composition or conducting tandem MS based on accurate mass and isotope patterns. CAMERA (an integrated strategy for compound spectral extraction and annotation of LC/MS datasets) can effectively identify most features corresponding to isotopes, adducts, and fragments [ 80 ]. Isotopic labeling methods can also be used to identify and analyze isotope ratio outliers [ 81 ]. Despite these advancements, many metabolites remain uncharacterized. Variations in metabolites within the cellular environment are closely linked to health and disease development. Metabolomics enhances disease analysis at the genomic and protein levels by providing semi-quantitative and quantitative measurements of metabolite levels, which serve as chemical mediators defining specific phenotypes [ 70 ]. The rapid expansion of omics technologies has provided holistic molecular information, enabling the comprehensive study of biological systems. Small molecules and metabolites are essential for numerous cellular functions [ 82 ], offering unique insights into the phenotypic characteristics associated with genome sequences [ 83 ].

Integration of spatial multi-omics

While single-cell multi-omics yields valuable insights into gene regulation across various omics layers [ 84 , 85 ], it lacks the spatial information necessary for understanding cellular functions within tissues. Recently, spatial transcriptomics, proteomics, genomics, epigenomics and metabolomics have emerged, with extensive application in various fields [ 86 , 87 ]. These techniques typically capture only one layer of omics information, and computational methods for integrating data from different omics layers cannot fully overcome the lack of mechanistic links between them. Spatial multi-omics enables the simultaneous analysis of multiple data modalities, such as transcriptomics, proteomics, genomics, epigenomics, and metabolomics, with the same tissue section (Table 2 ).

Integration of spatial transcriptomics and (epi)genomics

Spatial ATAC&RNA-seq and spatial CUT&Tag RNA-seq have revolutionized genome-wide co-mapping of the epigenome and transcriptome by simultaneously profiling chromatin accessibility and mRNA expression, or histone modifications and mRNA expression, respectively. These technologies integrate the chemistry of spatial ATAC-seq or CUT&Tag with spatial transcriptomics on the same tissue section at the cellular level via deterministic co-barcoding [ 88 ], combining microfluidic deterministic barcoding in tissue (DBiT) strategies for spatial ATAC-seq [ 89 ] and CUT&Tag [ 90 ] with DBiT-seq poly(A) transcript profiling [ 91 ]. Spatial-ATAC-seq enables high-spatial-resolution genome-wide mapping of chromatin accessibility in tissue at the cellular level by applying a spatial barcoding scheme to DNA oligomers inserted into accessible genomic loci by Tn5 transposition [ 89 ]. This technology advances our understanding of cell identity, cell state, and cell fate decisions related to epigenetic bases in development and disease. Spatial-CUT&Tag analyzes spatial histone modification profiling at the pixel level on frozen tissue sections without requiring dissociation. This method addresses spatially distinct and cell type-specific chromatin modifications during mouse embryonic organogenesis and postnatal brain development, adding a new dimension to spatial biology by mapping epigenetic regulation related to development and disease [ 90 ]. DBiT-seq creates a 2D grid of spatially barcoded tissue pixels, each defined by a unique combination of barcodes A and B [ 88 ]. After reverse crosslinking, barcoded complementary DNA and genomic DNA fragments are released, and NGS constructs separate libraries for gDNA and cDNA. Sequencing reads are then combined with microscopy images of the tissue section based on spatial barcodes, allowing multi-omics sequence information to be spatially mapped [ 88 ]. These techniques have been applied to co-map embryonic and juvenile mouse brains, as well as the adult human brain. Spatially resolved, genome-wide co-sequencing of the epigenome and transcriptome at the cellular level provides an informative tool for a wide range of biological and biomedical research. Transcriptomics focuses on gene expression from the perspective of mRNA, presenting a global perspective on molecular dynamic changes induced by environmental factors or pathogenic agents [ 92 ]. Benefiting from mature in situ RNA hybridization strategies, targeted capture of DNA sequences or chromosomal loci facilitates spatial genomics detection. DNA-seq FISH + can be applied for studying the spatial structure of the genome based on multi-round probe hybridization imaging. Takei et al. [ 93 ] reported the imaging of 3660 chromosomal sites in a single mouse embryonic stem cell (ES) using DNA-seq FISH + and the imaging of 17 chromatin markers and subnuclear structures by sequential immunofluorescence and expression profiles of 70 RNAs. Genomic regions and chromosomes associated with nuclear bodies and chromatin marks in different cells were revealed by genomic regions. Some of these regions appear to be related to cell types, whereas others (mostly spot-related regions) are more conserved among different cell types [ 46 ].

Integration of spatial proteomics and transcriptomics

Single-cell multi-omics has been highly successful in capturing diverse biological processes at the level of individual cells and nuclei but lacks spatial information [ 94 ]. Gene expression is regulated at multiple levels, from transcription to protein degradation, with RNA and protein levels conveying distinct information about gene function and cell state. These processes occur in various contexts, such as tumors and single-cell suspensions [ 95 ]. Recent progress in spatial in situ profiling has enabled the simultaneous profiling of location and expression. Spatial transcriptomics provides a global spatial tissue profile and has been applied to the study of diverse diseases. Spatial proteomics acquires large-scale spatial proteomic data through MS- and imaging-based experimental approaches. However, few platforms have successfully integrated spatial proteomics and transcriptomics data. Vickovic et al. [ 96 ] developed Spatial Multi-Omics (SM-Omics), an end-to-end framework that leverages a liquid handling platform for high-throughput transcriptome and antibody-based spatial tissue profiling. Using DNA-barcoded antibodies, this automated system enables the simultaneous profiling of the epitopes and transcriptomes within single cells, offering detailed molecular characterization of tissues in situ by quantifying both spatial transcriptomics and multiplex protein detection [ 96 ]. Compared to Visium by 10X Genomics, SM-Omics provides an automated workflow that extends combined spatial transcriptomics and antibody-based protein measurements into a scalable all-sequencing-based technology.

NanoString GeoMx Digital Spatial Profiler (DSP) facilitates high-plex profiling at both the protein and RNA level, permitting spatial and temporal assessment of tumors in frozen or formalin-fixed, paraffin-embedded limited tissue samples [ 97 ]. This platform quantifies protein or RNA abundance by counting unique indexing oligos assigned to each target of interest, using oligonucleotides to study a higher number of biomarkers. Additionally, DSP is a non-destructive technique, allowing the same slides to be used for subsequent studies after the assay is completed [ 97 ].

Spatial co-indexing of transcriptomes and epitopes (Spatial-CITE-seq) offers high-plex protein and whole-transcriptome co-mapping. This approach involves the staining of a tissue slide with a cocktail of approximately 200–300 antibody-derived tags (ADTs), followed by deterministic in-tissue barcoding of both DNA tags and mRNAs. Each tag contains a unique spatial address code AiBj (i = 1 − 50, j = 1 − 50), co-indexing all protein epitopes and the transcriptome. Barcoded cDNAs are subsequently retrieved, refined, and amplified via PCR to create two NGS libraries for paired-end sequencing of ADTs and mRNAs. This process enables computational reconstruction of spatial protein or gene-expression maps [ 98 ].

Integration of spatial transcriptomics and metabolomics

Gene expression and metabolite distribution in tissues are influenced by a variety of factors, including cell type, microenvironment, signaling pathways, and gene regulation. To elucidate the interplay among these factors, it is essential to employ methods that can simultaneously measure molecular evidence of different patterns in tissues while preserving spatial distribution information. Researchers have developed a spatial multimodal analysis (SMA) protocol that combines spatially resolved transcriptomics and mass spectrometry imaging (MSI) in a single tissue slice, while maintaining the specificity and sensitivity of both analytical methods [ 88 ]. This integrated approach reveals associations and heterogeneities between transcriptomes and metabolomes across different tissue regions. Combining spatial transcriptomic and metabolomic data, Vicari et al. identified a reduced proportion of midbrain dopaminergic neurons (MBDOP2) in the lesioned substantia nigra pars compacta and ventral tegmental area, and specified the localization of multiple neurotransmitters and metabolites, including taurine, 3-methoxytyramine, 3,4-dihydroxy-phenylacetaldehyde (DOPAL), 3,4-dihydroxy-phenylacetic acid, norepinephrine, serotonin, histidine, tocopherol, and gamma-aminobutyric acid [ 88 ]. Oral submucous fibrosis (OSF) is a well‐established precancerous lesion, but the molecular mechanisms underlying its malignant transformation into oral squamous cell carcinoma (OSCC) remain unclear [ 99 ]. Yuan et al. integrated spatial transcriptomics and metabolomics to obtain spatial location information on cancer cells, fibroblasts, and immune cells, as well as the transcriptomic and metabolomic landscapes of OSF-derived OSCC tissues. Moreover, they revealed the malignant progression from in situ carcinoma (ISC) to partial epithelial-mesenchymal transformation (pEMT), and identified significant metabolic reprogramming, including abnormal polyamine metabolism, which may play a key role in promoting tumorigenesis and immune escape [ 100 ]. Zheng et al. [ 101 ] combined spatial transcriptomic and metabolic analyses to reveal metabolic heterogeneity and complex transcriptome regulation in injured human brain tissue, facilitating the design of reagents for functional analysis of specific genes. The simultaneous application of these advanced technologies reveals the spatial composition of functional maps within tissues, heterogeneous distribution of cell populations, and differential gene expression in different locations. This comprehensive spatial expression mapping of genes holds significant research value and potential for advancing our understanding of complex biological systems.

Integration of spatial transcriptomics, genomics, and proteomics

The integration of spatial multi-omics aims to expand our understanding of mechanistic relationships across different omics layers and uncover molecular roles essential for cellular function by jointly profiling the transcriptome, genome, epigenome, proteome, and metabolome. Spatially resolved joint analysis of multi-omics can facilitate the identification of novel cell subtypes and measurement of intracellular and intercellular molecular interactions [ 102 ]. Therefore, the need for advanced spatial multi-omics methods has become increasingly important. Multi-omics in situ pairwise sequencing (MiP-seq) is a high-throughput targeted in situ sequencing technique that simultaneously detects multiplexed DNA, RNA, proteins, and biomolecules at subcellular resolution, providing comprehensive data for studying cellular functions and disease mechanisms [ 103 ]. The in situ detection of proteins and biomolecules is achieved using padlocking probes that target antibody-conjugated nucleic acids, while the detection of DNA and RNA is accomplished through direct padlock probes targeting nucleic acids [ 103 ]. Compared to current in situ sequencing methods, MiP-seq utilizes a pairwise-sequencing strategy and dual barcoded padlock probes, markedly increasing decoding capacity and requiring fewer sequencing rounds (10 N vs. 4 N ). Consequently, MiP-seq can reduce sequencing time by approximately 50%, lower sequencing and imaging costs, and minimize laser damage, thereby improving signal decoding accuracy, a key issue in in situ sequencing [ 104 ]. MiP-seq has been applied to mouse brain tissue, enabling the in situ detection of Rbfox3 and Nr4a1 gene loci, which are located on different chromosomes and spatially localized within the nucleus. MiP-seq has also been used to study PK-15 cells co-infected with porcine circovirus 2 (PCV2) and classical swine fever virus (CSFV), simultaneously detecting mRNA from eight cytokine or chemokine genes and two virus-specific proteins (CSFV E2 protein and PCV2 Cap protein) by binding antibodies to nucleic acids [ 103 ]. Thus, MiP-seq demonstrates versatility and high sensitivity in multi-omics in situ analysis, detecting specific DNA sequences, RNA transcripts, and proteins at single-cell resolution, and is a powerful tool for studying cell function, disease mechanisms, and cell–cell interactions in complex biological systems.

Applications of spatial multi-omics

Deciphering spatial-specific atlas production of molecular and cellular profiles.

A comprehensive spatial-specific atlas of molecular and cellular profiles in both healthy and diseased states is essential for developing new therapeutic targets and disease interventions (Fig.  3 A). Spatial transcriptomics combined with single-cell sequencing has been widely used to decipher molecular profiles. Fang et al. constructed a spatial atlas of the human middle and superior temporal gyrus using MERFISH, revealing differences in the cellular composition of these cortical regions between humans and mice [ 105 ]. Single-nucleus RNA-seq (snRNA-seq), single-nucleus assay for transposase-accessible chromatin with sequencing (snATAC-seq) [ 106 ], and spatial transcriptomics have been applied to generate a spatially resolved multi-omics single-cell atlas of the entire human maternal–fetal interface, including the myometrium, enabling resolution of the full trajectory of trophoblast differentiation [ 107 ]. Kuppe et al. used snRNA-seq and spatial transcriptomics to create an integrative high-resolution map of cardiac remodeling, enhancing the spatial resolution of cell-type composition and providing spatially resolved insights into the cardiac transcriptome and epigenome with identification of distinct cellular zones of injury, repair, and remodeling [ 106 ]. Advanced spatial epigenome-transcriptome co-sequencing has revealed how epigenetic mechanisms control transcriptional phenotypes and cell dynamics at both spatial and genome-wide levels, providing new insights into spatial epigenetic initiation, differentiation, and gene regulation within tissue structures. Spatial ATAC-RNA-seq and spatial CUT&Tag-RNA-seq were first introduced in analyzing mouse embryos, successfully distinguishing each organ with epigenetic and transcriptome data [ 88 ]. In some mouse brain tissue regions, the epigenetic signature of certain genes persisted with development, but the gene expression was different. In addition, the results of the joint analysis also found that epigenetic regulation and gene expression in different regions of the brain of young mice have unexpected correlations, and that different epigenetic features can cooperate with each other to regulate gene expression. The integration of spatial multi-omics not only opens a new field of spatial omics but also provides novel research avenues for biological and biomedical research.

figure 3

Applications of spatial-based technologies. Spatial multi-omics technology is employed to investigate various cell biology. This diagram provides an overview of the application of spatial multi-omics. A Spatial-based molecular and cellular atlas. B Spatial-based heterogeneity in human diseases. C Spatial-related crosstalk in tumor immunology. D Spatial trajectory and lineage tracking in human diseases. E Potential targets for therapeutic applications. F Reproduction and development research

Spatial multi-omics decodes spatial-based heterogeneity in human diseases

The complex interactions among tumor cells, surrounding tissues, infiltrating innate immune cells, and adaptive immune cells create a unique environment characterized by inter-related, coexisting, and competitive dynamics [ 108 ]. The characteristics of this tumor immune microenvironment vary significantly due to both intrinsic (e.g., tumor type) and extrinsic factors (e.g., environment). Tumor heterogeneity plays a crucial role in enabling tumor cells to adapt to changes in the microenvironment, thereby promoting tumor resistance and progression (Fig.  3 B).

Tumor heterogeneity includes both intratumor and intertumor heterogeneity [ 109 ]. Metastatic prostate cancer exhibits a wide spectrum of diverse phenotypes, but the extent of these heterogeneities has not yet been established [ 110 ]. Brady et al. integrated spatial transcriptomics and proteomics to analyze multiple discrete areas of metastases, discovering heterogeneity among tumors at different metastatic sites and within the same site. They also identified significant intra-patient heterogeneity in regions with varying androgen receptor (AR) and neuroendocrine activity. Most metastases lacked significant inflammatory infiltrates and PD1, PD-L1, and CTLA4 expression, while the B7-H3/CD276 immune checkpoint protein was highly expressed, particularly in metastatic prostate cancers with high AR activity [ 111 ]. These findings correlate with the clinical observation that metastatic prostate cancers often fail to respond to immune checkpoint blockade therapies such as anti-CTLA4, PD1, and PD-L1 antibodies, suggesting that B7-H3/CD276 could be a potential therapeutic target. Non-small cell lung cancer (NSCLC) is characterized by substantial heterogeneity among individual tumors and within regions of a single tumor [ 112 ]. Intratumor heterogeneity has been shown to contribute to treatment failure and drug resistance through the expansion of pre-existing resistant subclones [ 113 , 114 ]. Previous studies using multi-region profiling to decode the spatial patterns of heterogeneity were limited by the small number of regions analyzed per tumor [ 115 ]. Wu et al. employed multi-region matrix-assisted laser desorption ionization-time of flight (MALDI-TOF), cyclic immunofluorescence (CyCIF), and multi-region single-cell copy number sequencing to conduct spatial multi-omics analysis of tumors from 147 lung adenocarcinoma patients. They developed a novel analysis approach to quantify intratumor spatial heterogeneity: clustered geographic diversification (GD), where molecularly similar cells cluster together, and random GD, where molecularly similar cells are randomly distributed. Patients with random GD exhibited higher recurrence rates and risk of death, characterized by fewer tumor-interacting endothelial cells, higher infiltrating immune cells, and similar GD patterns observed in both proteomic and genomic data [ 116 ], providing insights into spatial heterogeneity and innovative ideas for cancer research. A non-targeted MALDI-MSI analysis [ 117 ] followed by spatial segmentation using different algorithms allowed to highlight molecular heterogeneity among glioblastomas. Three sub-regions were identified (A, B and C regions). Duhamel et al. performed a spatially resolved proteomic analysis to decode the biological pathways involved in these three regions: region A is enriched in genes related to neurotransmission and synaptogenesis; proteins overexpressed in region B are associated with immune infiltration; region C identified proteins involved in RNA processing and metabolism. Finally, they identified PPP1R12A and RPS14 are favorable prognostic markers while ALCAM, ANXA11, and AltProt IP_652563 are unfavorable prognostic markers [ 118 ]. These results highlight the potential of spatial proteomics and spatial metabolomics to decipher the molecular heterogeneity of glioblastoma and identify markers associated with survival.

Understanding how reprogrammed metabolic networks impact tumor growth is crucial for identifying metabolic vulnerabilities that improve cancer treatment. Sun et al. [ 119 ] combined mass spectrometry imaging-based spatial metabolomics and lipid-omics with microarray-based spatial transcriptomics [ 120 ] to visualize intratumor metabolic heterogeneity and cell metabolic interactions within the same gastric cancer sample. They imaged tumor-associated metabolic reprogramming at metabolic-transcriptional levels, linking marker metabolites, lipids, and genes within metabolic pathways and colocalizing them in heterogeneous cancer tissues. The integrated data revealed unique transcriptional features and significant immune-metabolic changes at the tumor invasion frontier. Furthermore, glutamine was overutilized in tumor tissue, genes related to lipids, fatty acid synthesis (FA), and fatty acid elongation were enriched in the tumor tissue region, and long chain polyunsaturated fatty acids were significantly up-regulated in borderline lymphoid tissue, even exceeding levels in tumor tissues [ 121 ]. These findings enhance our understanding of tumor molecular mechanisms and potential targets for cancer therapy. Spatial multi-omics technology accurately depicts gene expression in different tumor tissue locations, addressing the lack of spatial context in single-cell sequencing. Thus, the advancement of spatial omics provides essential support for exploring tumor immune microenvironment dynamics and identifying corresponding therapeutic targets.

Novel insights from spatial multi-omics analyze spatial-related crosstalk in tumor immunology

Spatial multi-omics has provided new perspectives on the complex interactions within the tumor microenvironment. Tumor tissue comprises various cell types, including epithelial, endothelial, fibroblast, vascular smooth muscle, resident immune, and infiltrating immune cells, all of which interact within a 3D environment to support cancer cell growth [ 122 ] (Fig.  3 C). By integrating mass spectrometry imaging-based spatial metabolomics and lipidomics with microarray-based spatial transcriptomics, researchers have identified a distinct interface at the junction of tumors and neighboring tissues, termed cluster9, within which peritumoral lymphoid tissue (PLT) and distal lymphoid tissue (DLT) are defined [ 121 ]. The PLT exhibits significantly increased uptake and metabolism of glutamine, as well as certain fatty acids, essential for tumor energy metabolism and signaling. Genes associated with fatty acid synthesis, such as FASN, SCD, and ELOVL, as well as ALOX5AP, which promotes arachidonic acid metabolism into leukotriene inflammatory mediators, are also up-regulated in PLT. These results suggest that PLT has a stronger inflammatory response than DLT and inhibits tumor cell proliferation [ 121 ]. Identification of this crosstalk between PLT and tumor cells has enhanced our understanding of tumor molecular mechanisms. ScRNA-seq studies on glioblastomas have highlighted the dynamic plasticity across cellular states [ 123 ], including mesenchymal-like (MES-like), neural progenitor cell-like (NPC-like), astrocyte-like (AC-like), and oligodendrocytic precursor cell-like (OPC-like) states, which are markers of malignant brain tumors [ 124 ]. However, single-cell analysis provides only indirect inferences about cell interactions, often neglecting the role of the local microenvironment in tumorigenesis. Ravi et al. [ 125 ] utilized spatial transcriptomics, metabolomics, and proteomics to quantify the relationship between tumor cells and myeloid and lymphoid cells, discovering increased interactions in inflammation-related gene-rich areas and confirming enhanced interactions between tumor cells and virus-free compartments within transcriptionally defined reactive immune regions. Annika et al. [ 126 ] combined spatial multi-omics and scRNA-seq data from epithelial and stromal compartments to examine immune cell composition during intestinal damage and regeneration, finding that activated B cells decreased and disrupted the essential crosstalk between stromal and epithelial cells during mucosal healing. Spatiotemporal multi-omics allows for consideration of the microenvironment in cell–cell crosstalk studies, enhancing the accuracy of research findings.

Spatial trajectory and lineage tracking in human diseases

Lineage tracking technology is crucial for studying the developmental trajectory and differentiation process of cells (Fig.  3 D). This technology can help determine how individual cells differentiate from a founder cell and how they evolve during development and disease [ 127 ]. Traditionally, lineage tracing involves labeling cells with heritable marks and tracking the trajectory of their offspring. The diversity of cell types produced from a founder cell reflects its differentiation potential. To predict the potential and evolutionary trajectory of founder cells, a wide array of markers is needed for accurate cell type classification. However, the limited availability of markers can mask the variability within cell subsets expressing the selected marker genes [ 128 ], potentially biasing the interpretation of organ complexity. Spatial transcriptomics not only enables comprehensive transcriptomic analysis of thousands of cells but also offers considerable insights into the spatiotemporal relationships among cells. This approach enhances cell-type identification, deepening our understanding of organizational complexity [ 129 ]. By constructing transcriptional atlases of adult tissues and developing embryos, spatial transcriptomics reveals the molecular mechanisms underlying differentiation from stem cells to mature cells. This detailed record elucidates the sequence of events and molecular mechanisms by which cells attain their final identity in embryogenesis or tissue regeneration. It also provides clues to the origins of developmental pathologies and cancer, allowing intervention in pathogenic pathways and replication of cell differentiation processes in vitro [ 130 ]. Densely sampling cells at various stages can describe state manifolds, which visualize the continuum of cell state changes in a multidimensional space and the trajectory of cell differentiation. To understand the instantaneous state of the cell, it is necessary to consider its molecular composition, inter-relationships, tissue position, and physical and regulatory interactions with surrounding cells. This comprehensive approach provides deeper insights into the state and function of cells [ 130 ]. Given the complexity of cells within different species, lineage tracing has expanded to include additional approaches, such as tracer dyes, cell transplantation, and in vivo genetic recombination. Advances in confocal and light-sheet microscopy have enabled the direct tracking of individual cell division patterns in complex vertebrates. However, these methods are limited to only a few measurements of cell state. Recent spatial transcriptomics approaches overcome spectral limitations by allowing genome-scale measurements in fixed in situ samples. High-throughput sequencing employs DNA sequence barcodes to encode clonal information, which can later be read and integrated with other sequence-based omics data. Zhang et al. applied single-cell and spatial transcriptomics to demonstrate extensive diversification of cells from a few multipotent progenitors to numerous differentiated cell states, including several novel cell populations. Furthermore, they identified lineage-specific clusters radiating from the center of six mesenchymal states and active transcription factor network modules associated with the progression of each lineage. They also observed that chondrocyte lineages increased over time, shifting from progenitor cells to more mature clusters [ 131 ]. Bao et al. [ 132 ] revealed that microglia and perivascular macrophages exhibit parallel differentiation processes, although the developmental origins of other tissue-resident macrophages require further exploration using single-cell and spatial transcriptomics. Spatial multi-omics have been applied in several fields, such as tumor progression, immune-associated diseases and metabolism-related disorders. Renal fibrosis, a critical pathological feature in chronic kidney disease progression, has significant global health implications. Spatial multi-omics techniques, such as Cut&Tag with DBiT-Seq [ 133 ], have been crucial in elucidating the complex epigenetic reprogramming during the transition from acute kidney injury to chronic kidney disease, underscoring the importance of multi-omics in understanding and addressing renal fibrosis pathogenesis [ 134 ]. The integration of imaging and sequencing-based omics has led to significant progress in spatial technologies, enabling spatially resolved single-cell detection [ 135 ]. These technologies preserve spatial resolution and large fields of view, allowing for detailed analysis of the microenvironment, spatial neighborhoods, and niche networks in kidney injury. Compatibility with formalin-fixed, paraffin-embedded tissue also facilitates the establishment of kidney injury cohorts, filling a critical gap in prognostic research [ 136 ].

Investigation of new therapies via spatial multi-omics

Targeting nucleotide metabolism is a well-established metabolic therapy in clinical oncology and practice [ 137 ]. However, efforts to target non-nucleotide metabolism in clinical trials have faced challenges due to drug toxicity, inconsistent dietary interventions, lack of biomarkers, and imprecise combination treatments, collectively leading to suboptimal trial outcomes. Additionally, cells within the TME can significantly influence treatment efficacy and undergo substantial changes during tumor progression and treatment response [ 138 ]. Therefore, developing biomarker-guided personalized precision metabolic therapies and targeted metabolic reprogramming is critical to improve the sensitivity of cancer therapy. Rational combinations of chemotherapy, radiation therapy, and other targeted therapies should also be considered. Integrating spatial multi-omics could enhance our understanding of tumor metabolic regulation, offering new therapeutic targets and identifying diagnostic and prognostic markers for various diseases.

Through multi-omics analysis of patients with triple-negative breast cancer (TNBC), researchers discovered that Clostridiales and the associated metabolite trimethylamine N-oxide (TMAO) induce pyroptosis in tumor cells by activating the endoplasmic reticulum stress kinase PERK, which amplifies CD8 + T cell-mediated antitumor immunity in vivo. These findings suggest that microbial metabolites, such as TMAO or its precursor choline, could serve as a new therapeutic strategy to enhance the efficacy of TNBC treatment [ 139 ], offering insights into the crosstalk between microbiota and metabolite immunology. Metastasis remains the leading cause of death in patients with breast cancer; however, the dynamic changes in dissemination evolution remain poorly understood. High-resolution technologies, such as spatial transcriptomics and metabolomics, have been used to map the metabolic landscape. Combined spatial transcriptomics and scRNA-seq have revealed metabolic changes in tumor cells during their transition from the primary site to the leading edge and metastatic lymph nodes, highlighting the potential of incorporating metabolic therapies in treating breast cancer with lymph node metastasis [ 140 ]. Eclipta prostrata L. [ 141 ] has long been used in traditional medicine for its liver-protective properties. Wedelolactone (WEL) and demethylwedelolactone (DWEL) are the primary coumarins found in E. prostrata L. Using a mature thioacetamide (TAA)-induced zebrafish model, Chen et al. integrated spatial metabolomics and transcriptomics and discovered that both WEL and DWEL can improve metabolic disorders induced by nonalcoholic fatty liver disease (NAFLD), primarily through the regulatory effects of WEL on steroid biosynthesis and fatty acid metabolism. Their study successfully mapped the biological distribution and metabolic characteristics of these compounds in zebrafish, revealing the unique mechanisms of WEL and DWEL in improving NAFLD and proposing a multi-omics platform to develop highly effective compounds that improve therapeutic outcomes [ 142 ]. Previous studies have highlighted the role of ferroptosis in a variety of neurological diseases [ 143 ], although its precise role in multiple sclerosis (MS) remained uncertain. Wu et al. integrated data from snRNA‐seq, spatial transcriptomics, and spatial proteomics to define a computational metric of ferroptosis levels and identify the ferroptosis landscape in neuroimmunity and neurodegeneration in MS patients [ 144 ]. Results showed that active lesion edges exhibited the highest ferroptosis scores, associated with phagocyte system activation, while remyelination lesions had the lowest scores. Elevated ferroptosis scores were also observed in cortical neurons, linked to multiple neurodegenerative disease-related pathways [ 144 ], while significant co-localization was detected between ferroptosis scores, neurodegeneration, and microglia. They also established a diagnostic model for MS based on 24 ferroptosis-related genes in peripheral blood. These findings suggest that ferroptosis may play a dual role in MS, associated with both neuroimmunological and neurodegenerative processes, making it a promising therapeutic target and diagnostic marker for MS. Vedolizumab (VDZ) is known to inhibit lymphocyte trafficking to the intestine and is effective in treating ulcerative colitis (UC). However, its broader effects on other cell subsets are less understood. Using comprehensive spatial transcriptomic and proteomic phenotyping, Mennillo et al. identified mononuclear phagocytes as an important cell type impacted by anti-integrin therapy in UC and revealed changes in the spatial distribution of cell subpopulations in tissues before and after VDZ treatment [ 145 ]. Notably, they highlighted the cellular and genetic factors of UC and VDZ therapy, potentially aiding in the development of more precise treatment strategies and the prediction of treatment responses (Fig.  3 E).

Multi-omics in reproduction and development research

Mammalian fertilization begins with the fusion of an oocyte and a sperm cell [ 146 ], with the reproductive system creating an environment for embryonic development (Fig.  3 F). In-depth exploration of the reproductive system requires an understanding of the function of each cell type and their interactions. Spatial multi-omics techniques have been used to examine interactions between adjacent cells and gametes or embryos within the natural tissue environment, preserving the spatial context of the analyzed cells. These technologies have the potential to transform our understanding of mammalian reproduction [ 147 ]. Winkler et al. used scRNA-seq and spatial transcriptomics to profile the remodeling of the female reproductive tract during the estrous cycle, decidualization, and aging and discovered that fibroblasts play a central and organ-specific role in female reproductive tract remodeling by coordinating extracellular matrix (ECM) recombination and inflammation. They also revealed the unexpected costs of repeated remodeling required during reproduction and illustrated how estrus, pregnancy, and aging collectively shape the female reproductive tract [ 148 ]. Yang et al. conducted scRNA-seq, scATAC-seq, and spatial transcriptomic analyses of fetal samples from gestational week (GW) 13–18, generating a large-scale multi-omics atlas of the developing human fetal cerebellum. They found that PARM1 exhibits inconsistent distribution in human and mouse granulosa cells, and identified gene regulatory networks that control the diversity of Purkinje cells and unipolar brush cells [ 149 ]. These key regulatory factors can be harnessed in vitro to generate small brain cells for future clinical applications and enhance our understanding of the link between molecular variation and cell types in neurodevelopmental disorders. Li et al. employed scRNA-seq, spatial transcriptomics, and hybridization-based in situ sequencing to analyze 16 human embryonic and fetal spinal cord samples from post-conceptional weeks 5–12, providing a comprehensive atlas of developmental cells and identifying novel molecular targets and genetic regulation of childhood spinal cancer stem cells [ 150 ] (Table  3 ).

Perspectives

The rapidly evolving field of spatial omics technologies aims to achieve higher resolution, deeper coverage, greater multiplexity, and enhanced versatility in analyzing diverse samples, including formalin-fixed, paraffin-embedded, fresh-frozen, and living tissues. These advancements enable 3D reconstruction of larger tissue regions and comprehensive analysis of spatiotemporal multi-omics, enhancing our understanding of the complex molecular mechanisms underlying cellular interactions within tissues. Effective acquisition, manipulation, analysis, and visualization of spatial omics data are critical components for their successful application. Integrating datasets from different omics modalities is essential to unlock their synergistic potential, although this is challenging due to differing spatial features of the data. Consequently, there is an urgent need for specialized hardware and software to visualize these complex datasets effectively. Key steps include normalizing data matrices, removing low-quality data, improving signal-to-noise ratios, smoothing data to increase sensitivity, and eliminating unwanted technical and biological variations. Developing an independent benchmark of spatial omics integration algorithms should greatly assist researchers in selecting appropriate integration strategies and designing experiments. Without suitable analytical tools, even costly experiments can yield unusable data. To mitigate bias, the scientific community must provide open datasets for comparative analysis of tissues and develop novel methods for accurate detection or capture efficiency. The path to widespread adoption of these technologies remains long. A thorough understanding of the cellular and molecular mechanisms within specific normal or pathogenic microenvironments is crucial for advancing personalized precision medicine. This approach is anticipated to become the primary treatment option in the near future. Expected advancements include increased throughput, reduced costs, integration of more detection modes, and enhanced sensitivity and specificity. Ultimately, multi-omics techniques with spatial single-cell resolution will revolutionize our understanding of cell biology.

Conclusions

The integration of multi-omics with spatial analysis is a rapidly evolving field that holds great promise for a wide range of applications. Spatial multi-omics enables a deeper understanding of complex biological systems, providing novel insights into disease mechanisms, drug target identification, and biomarker discovery. However, integrating multi-omics data presents technical challenges, necessitating advanced computational and statistical methods. Moreover, the interpretation of spatial multi-omics data is further complicated by spatially varying environmental factors and technical noise. Thus, the development of sophisticated computational tools and analytical methods capable of managing large-scale spatial multi-omics datasets is essential for fully leveraging the potential of this approach.

Availability of data and materials

No datasets were generated or analysed during the current study.

Baysoy A, Bai Z, Satija R, Fan R. The technological landscape and applications of single-cell multi-omics. Nat Rev Mol Cell Biol. 2023. https://doi.org/10.1038/s41580-023-00615-w .

Article   PubMed   PubMed Central   Google Scholar  

Wang WJ, Chu LX, He LY, Zhang MJ. Spatial transcriptomics: recent developments and insights in respiratory research. Mil Med Res. 2023. https://doi.org/10.1186/s40779-023-00471-x .

Rao A, Barkley D, França GS, Yanai I. Exploring tissue architecture using spatial transcriptomics. Nature. 2021. https://doi.org/10.1038/s41586-021-03634-9 .

Marx V. Method of the Year: spatially resolved transcriptomics. Nat Methods. 2021. https://doi.org/10.1038/s41592-020-01033-y .

Wang M, Hu Q, Lv T, Wang Y. High-resolution 3D spatiotemporal transcriptomic maps of developing Drosophila embryos and larvae. Dev Cell. 2022. https://doi.org/10.1016/j.devcel.2022.04.006 .

Jin Y, Zuo Y, Li G, Liu W. Advances in spatial transcriptomics and its applications in cancer research. Mol Cancer. 2024. https://doi.org/10.1186/s12943-024-02040-9 .

Choe K, Pak U, Pang Y, Hao W. Advances and challenges in spatial transcriptomics for developmental biology. Biomolecules. 2023. https://doi.org/10.3390/biom13010156 .

Cao J, Li C, Cui Z, Deng S. Spatial transcriptomics: a powerful tool in disease understanding and drug discovery. Theranostics. 2024. https://doi.org/10.7150/thno.95908 .

Park H-E, Jo SH, Lee RH, Macks CP. Spatial transcriptomics: technical aspects of recent developments and their applications in neuroscience and cancer research. Advanced Science. 2023. https://doi.org/10.1002/advs.202206939 .

Piskadlo E, Bastian Th, Eichenberger LG, Chao JA. Design, labeling, and application of probes for RNA smFISH. In: Scheiffele P, Mauger O, editors. Alternative Splicing: Methods and Protocols. Springer; 2022. p. 173–83. https://doi.org/10.1007/978-1-0716-2521-7_10 .

Chapter   Google Scholar  

Lee C, Roberts SE, Gladfelter AS. Quantitative spatial analysis of transcripts in multinucleate cells using single-molecule FISH. Methods. 2016. https://doi.org/10.1016/j.ymeth.2015.12.007 .

Article   PubMed   Google Scholar  

Park HE, Jo SH, Lee RH, Macks CP. Spatial transcriptomics: technical aspects of recent developments and their applications in neuroscience and cancer research. Adv Sci (Weinh). 2023. https://doi.org/10.1002/advs.202206939 .

Shah S, Lubeck E, Zhou W, Cai L. seqFISH accurately detects transcripts in single cells and reveals robust spatial organization in the hippocampus. Neuron. 2017. https://doi.org/10.1016/j.neuron.2017.05.008 .

Shah S, Lubeck E, Zhou W, Cai L. In situ transcription profiling of single cells reveals spatial organization of cells in the mouse hippocampus. Neuron. 2016. https://doi.org/10.1016/j.neuron.2016.10.001 .

Kalhor K, Chen CJ, Lee HS, Cai M. Mapping human tissues with highly multiplexed RNA in situ hybridization. Nat Commun. 2024. https://doi.org/10.1038/s41467-024-46437-y .

Ke R, Mignardi M, Pacureanu A, Svedlund J. In situ sequencing for RNA analysis in preserved tissue and cells. Nat Methods. 2013. https://doi.org/10.1038/nmeth.2563 .

Lee JH, Daugharthy ER, Scheiman J, Kalhor R. Highly multiplexed subcellular RNA sequencing in situ. Science. 2014. https://doi.org/10.1126/science.1250212 .

Tang X, Chen J, Zhang X, Liu X. Improved in situ sequencing for high-resolution targeted spatial transcriptomic analysis in tissue sections. J Genet Genom. 2023. https://doi.org/10.1016/j.jgg.2023.02.004 .

Article   Google Scholar  

Larsson C, Grundberg I, Söderberg O, Nilsson M. In situ detection and genotyping of individual mRNA molecules. Nat Methods. 2010. https://doi.org/10.1038/nmeth.1448 .

Liao J, Lu X, Shao X, Zhu L. Uncovering an organ’s molecular architecture at single-cell resolution by spatially resolved transcriptomics. Trends Biotechnol. 2021. https://doi.org/10.1016/j.tibtech.2020.05.006 .

Lee JH, Daugharthy ER, Scheiman J, Kalhor R. Fluorescent in situ sequencing (FISSEQ) of RNA for gene expression profiling in intact cells and tissues. 2015. Nat Protoc. https://doi.org/10.1038/nprot.2014.191 .

Wang X, Allen WE, Wright MA, Sylwestrak EL. Three-dimensional intact-tissue sequencing of single-cell transcriptional states. 2018. Science. https://doi.org/10.1126/science.aat5691 .

Koch L. Transcriptomics in intact tissues. 2018. Nat Rev Genet. https://doi.org/10.1038/s41576-018-0045-7 .

Robles-Remacho A, Sanchez-Martin RM, Diaz-Mochon JJ. Spatial transcriptomics: emerging technologies in tissue gene expression profiling. Anal Chem. 2023. https://doi.org/10.1021/acs.analchem.3c02029 .

Chen X, Sun Y-C, Church GM, Lee JH. Efficient in situ barcode sequencing using padlock probe-based BaristaSeq. Nucleic Acids Res. 2018. https://doi.org/10.1093/nar/gkx1206 .

Fang S, Chen B, Zhang Y, Sun H. Computational approaches and challenges in spatial transcriptomics. Genom Proteom Bioinform. 2023. https://doi.org/10.1016/j.gpb.2022.10.001 .

Lubeck E, Coskun AF, Zhiyentayev T, Ahmad M. Single-cell in situ RNA profiling by sequential hybridization. Nat Methods. 2014. https://doi.org/10.1038/nmeth.2892 .

Littman R, Hemminger Z, Foreman R, Arneson D. Joint cell segmentation and cell type annotation for spatial transcriptomics. Mol Syst Biol. 2021. https://doi.org/10.15252/msb.202010108 .

Park J, Choi W, Tiesmeyer S, Long B. Cell segmentation-free inference of cell types from in situ transcriptomics data. Nat Commun. 2021. https://doi.org/10.1038/s41467-021-23807-4 .

Cullum R, Alder O, Hoodless PA. The next generation: using new sequencing technologies to analyse gene regulation. Respirology. 2011. https://doi.org/10.1111/j.1440-1843.2010.01899.x .

Van Dijk EL, Auger H, Jaszczyszyn Y, Thermes C. Ten years of next-generation sequencing technology. Trends Genet. 2014. https://doi.org/10.1016/j.tig.2014.07.001 .

Yang Y, Xie B, Yan J. Application of next-generation sequencing technology in forensic science. Genom Proteom Bioinform. 2014. https://doi.org/10.1016/j.gpb.2014.09.001 .

Nguyen L, Burnett L. Automation of molecular-based analyses: a primer on massively parallel sequencing. Clin Biochem Rev. 2014;35(3):169.

PubMed   PubMed Central   Google Scholar  

Van Dijk EL, Auger H, Jaszczyszyn Y, Thermes C. Ten years of next-generation sequencing technology. Trends In Genetics : TIG. 2014. https://doi.org/10.1016/j.tig.2014.07.001 .

Rothberg JM, Leamon JH. The development and impact of 454 sequencing. Nat Biotechnol. 2008. https://doi.org/10.1038/nbt1485 .

Liu L, Li Y, Li S, Hu N. Comparison of next-generation sequencing systems. J Biomed Biotechnol. 2012. https://doi.org/10.1155/2012/251364 .

Margulies M, Egholm M, Altman WE, Attiya S. Genome sequencing in microfabricated high-density picolitre reactors. Nature. 2005;437(7057):376–80.

Article   CAS   PubMed   PubMed Central   Google Scholar  

Gilles A, Meglécz E, Pech N, Ferreira S. Accuracy and quality assessment of 454 GS-FLX Titanium pyrosequencing. BMC Genomics. 2011. https://doi.org/10.1186/1471-2164-12-245 .

Xuan J, Yu Y, Qing T, Guo L. Next-generation sequencing in the clinic: promises and challenges. Cancer Lett. 2013. https://doi.org/10.1016/j.canlet.2012.11.025 .

Van Dijk EL, Jaszczyszyn Y, Naquin D, Thermes C. The third revolution in sequencing technology. Trends Genet TIG. 2018. https://doi.org/10.1016/j.tig.2018.05.008 .

Kumar KR, Cowley MJ, Davis RL. Next-generation sequencing and emerging technologies. Semin Thromb Hemost. 2019. https://doi.org/10.1055/s-0039-1688446 .

Udaondo Z, Sittikankaew K, Uengwetwanit T, Wongsurawat T. Comparative analysis of PacBio and oxford nanopore sequencing technologies for transcriptomic landscape identification of penaeus monodon. Life (Basel). 2021. https://doi.org/10.3390/life11080862 .

Piskadlo E, Eichenberger BT, Giorgetti L, Chao JA. Design, labeling, and application of probes for RNA smFISH. Methods Mol Biol. 2022. https://doi.org/10.1007/978-1-0716-2521-7_10 .

Hosea R, Hillary S, Naqvi S, Wu S. The two sides of chromosomal instability: drivers and brakes in cancer. Signal Transduct Target Ther. 2024. https://doi.org/10.1038/s41392-024-01767-7 .

Zhao T, Chiang ZD, Morriss JW, Lafave LM. Spatial genomics enables multi-modal study of clonal heterogeneity in tissues. Nature. 2022. https://doi.org/10.1038/s41586-021-04217-4 .

Bouwman BAM, Crosetto N, Bienko M. The era of 3D and spatial genomics. Trends Genet. 2022. https://doi.org/10.1016/j.tig.2022.05.010 .

Payne AC, Chiang ZD, Reginato PL, Mangiameli SM. In situ genome sequencing resolves DNA sequence and structure in intact biological samples. 2021. Science. https://doi.org/10.1126/science.aay3446 .

Rodriques SG, Stickels RR, Goeva A, Martin CA. Slide-seq: A scalable technology for measuring genome-wide expression at high spatial resolution. Science. 2019. https://doi.org/10.1126/science.aaw1219 .

Stickels RR, Murray E, Kumar P, Li J. Highly sensitive spatial transcriptomics at near-cellular resolution with Slide-seqV2. Nat Biotechnol. 2021. https://doi.org/10.1038/s41587-020-0739-1 .

Gerstung M, Jolly C, Leshchiner I, Dentro SC. The evolutionary history of 2,658 cancers. Nature. 2020. https://doi.org/10.1038/s41586-019-1907-7 .

Bauer NC, Doetsch PW, Corbett AH. Mechanisms regulating protein localization. Traffic. 2015. https://doi.org/10.1111/tra.12310 .

Banworth MJ, Li G. Consequences of Rab GTPase dysfunction in genetic or acquired human diseases. Small GTPases. 2018. https://doi.org/10.1080/21541248.2017.1397833 .

Bridges RJ, Bradbury NA. Cystic fibrosis, cystic fibrosis transmembrane conductance regulator and drugs: insights from cellular trafficking. Handb Exp Pharmacol. 2018. https://doi.org/10.1007/164_2018_103 .

Christopher JA, Stadler C, Martin CE, Morgenstern M. Subcellular proteomics. Nat Rev Methods Primers. 2021. https://doi.org/10.1038/s43586-021-00029-y .

Fossati A, Li C, Uliana F, Wendt F. PCprophet: a framework for protein complex prediction and differential analysis using proteomic data. Nat Methods. 2021. https://doi.org/10.1038/s41592-021-01107-5 .

Mou M, Pan Z, Lu M, Sun H. Application of machine learning in spatial proteomics. J Chem Inf Model. 2022. https://doi.org/10.1021/acs.jcim.2c01161 .

Orre LM, Vesterlund M, Pan Y, Arslan T. SubCellBarCode: proteome-wide mapping of protein localization and relocalization. Mol Cell. 2019. https://doi.org/10.1016/j.molcel.2018.11.035 .

Mund A, Brunner AD, Mann M. Unbiased spatial proteomics with single-cell resolution in tissues. Mol Cell. 2022. https://doi.org/10.1016/j.molcel.2022.05.022 .

Mund A, Coscia F, Kriston A, Hollandi R. Deep Visual Proteomics defines single-cell identity and heterogeneity. Nat Biotechnol. 2022. https://doi.org/10.1038/s41587-022-01302-5 .

Lundberg E, Borner GHH. Spatial proteomics: a powerful discovery tool for cell biology. Nat Rev Mol Cell Biol. 2019. https://doi.org/10.1038/s41580-018-0094-y .

Xu J, Liu Y. A guide to visualizing the spatial epigenome with super-resolution microscopy. FEBS J. 2019. https://doi.org/10.1111/febs.14938 .

Chen X, Xu H, Shu X, Song CX. Mapping epigenetic modifications by sequencing technologies. Cell Death Differ. 2023. https://doi.org/10.1038/s41418-023-01213-1 .

Zhang Y, Sun Z, Jia J, Du T. Overview of histone modification. Adv Exp Med Biol. 2021. https://doi.org/10.1007/978-981-15-8104-5_1 .

Wang KC, Chang HY. Epigenomics: technologies and applications. Circ Res. 2018. https://doi.org/10.1161/CIRCRESAHA.118.310998 .

Schueder F, Bewersdorf J. Omics goes spatial epigenomics. Cell. 2022. https://doi.org/10.1016/j.cell.2022.10.014 .

Baker SA, Rutter J. Metabolites as signalling molecules. Nat Rev Mol Cell Biol. 2023. https://doi.org/10.1038/s41580-022-00572-w .

Chen Y, Li EM, Xu LY. Guide to metabolomics analysis: a bioinformatics workflow. Metabolites. 2022. https://doi.org/10.3390/metabo12040357 .

Alseekh S, Aharoni A, Brotman Y, Contrepois K. Mass spectrometry-based metabolomics: a guide for annotation, quantification and best reporting practices. Nat Methods. 2021. https://doi.org/10.1038/s41592-021-01197-1 .

Lee DY, Bowen BP, Northen TR. Mass spectrometry-based metabolomics, analysis of metabolite-protein interactions, and imaging. Biotechniques. 2010. https://doi.org/10.2144/000113451 .

Gertsman I, Barshop BA. Promises and pitfalls of untargeted metabolomics. J Inherit Metab Dis. 2018. https://doi.org/10.1007/s10545-017-0130-7 .

Roberts LD, Souza AL, Gerszten RE, Clish CB. Targeted metabolomics. Curr Protoc Mol Biol. 2012. https://doi.org/10.1002/0471142727.mb3002s98 .

Ghaste M, Mistrik R, Shulaev V. Applications of Fourier transform ion cyclotron resonance (FT-ICR) and orbitrap based high resolution mass spectrometry in metabolomics and lipidomics. Int J Mol Sci. 2016. https://doi.org/10.3390/ijms17060816 .

Dona AC, Kyriakides M, Scott F, Shephard EA. A guide to the identification of metabolites in NMR-based metabonomics/metabolomics experiments. Comput Struct Biotechnol J. 2016. https://doi.org/10.1016/j.csbj.2016.02.005 .

Dudley E, Yousef M, Wang Y, Griffiths WJ. Targeted metabolomics and mass spectrometry. Adv Protein Chem Struct Biol. 2010. https://doi.org/10.1016/B978-0-12-381264-3.00002-3 .

Patti GJ, Yanes O, Siuzdak G. Innovation: metabolomics: the apogee of the omics trilogy. Nat Rev Mol Cell Biol. 2012. https://doi.org/10.1038/nrm3314 .

Zhou B, Xiao JF, Tuli L, Ressom HW. LC-MS-based metabolomics. Mol Biosyst. 2012. https://doi.org/10.1039/c1mb05350g .

Thomas SN, French D, Jannetto PJ, Rappold BA. Liquid chromatography-tandem mass spectrometry for clinical diagnostics. Nat Rev Methods Primers. 2022. https://doi.org/10.1038/s43586-022-00175-x .

Duncan KD, Fyrestam J, Lanekoff I. Advances in mass spectrometry based single-cell metabolomics. Analyst. 2019. https://doi.org/10.1039/c8an01581c .

Zamboni N, Saghatelian A, Patti GJ. Defining the metabolome: size, flux, and regulation. Mol Cell. 2015. https://doi.org/10.1016/j.molcel.2015.04.021 .

Kuhl C, Tautenhahn R, Böttcher C, Larson TR. CAMERA: an integrated strategy for compound spectra extraction and annotation of liquid chromatography/mass spectrometry data sets. Anal Chem. 2012. https://doi.org/10.1021/ac202450g .

Mahieu NG, Huang X, Chen Y Jr, Patti GJ. Credentialing features: a platform to benchmark and optimize untargeted metabolomic methods. Anal Chem. 2014. https://doi.org/10.1021/ac503092d .

Bingol K. Recent advances in targeted and untargeted metabolomics by NMR and MS/NMR methods. High Throughput. 2018. https://doi.org/10.3390/ht7020009 .

Rueedi R, Mallol R, Raffler J, Lamparter D. Metabomatching: using genetic association to identify metabolites in proton NMR spectroscopy. PLoS Comput Biol. 2017. https://doi.org/10.1371/journal.pcbi.1005839 .

Chen S, Lake BB, Zhang K. High-throughput sequencing of the transcriptome and chromatin accessibility in the same cell. Nat Biotechnol. 2019. https://doi.org/10.1038/s41587-019-0290-0 .

Bian X, Wang W, Abudurexiti M, Zhang X. Integration analysis of single-cell multi-omics reveals prostate cancer heterogeneity. Adv Sci (Weinh). 2024. https://doi.org/10.1002/advs.202305724 .

Chen A, Liao S, Cheng M, Ma K. Spatiotemporal transcriptomic atlas of mouse organogenesis using DNA nanoball-patterned arrays. Cell. 2022. https://doi.org/10.1016/j.cell.2022.04.003 .

Lu T, Ang CE, Zhuang X. Spatially resolved epigenomic profiling of single cells in complex tissues. Cell. 2022. https://doi.org/10.1016/j.cell.2022.09.035 .

Vicari M, Mirzazadeh R, Nilsson A, Shariatgorji R. Spatial multimodal analysis of transcriptomes and metabolomes in tissues. Nat Biotechnol. 2023. https://doi.org/10.1038/s41587-023-01937-y .

Deng Y, Bartosovic M, Ma S, Zhang D. Spatial profiling of chromatin accessibility in mouse and human tissues. Nature. 2022. https://doi.org/10.1038/s41586-022-05094-1 .

Deng Y, Bartosovic M, Kukanja P, Zhang D. Spatial-CUT&Tag: spatially resolved chromatin modification profiling at the cellular level. Science. 2022. https://doi.org/10.1126/science.abg7216 .

Liu Y, Yang M, Deng Y, Su G. High-spatial-resolution multi-omics sequencing via deterministic barcoding in tissue. Cell. 2020. https://doi.org/10.1016/j.cell.2020.10.026 .

Chen C, Wang J, Pan D, Wang X. Applications of multi-omics analysis in human diseases. MedComm. 2023. https://doi.org/10.1002/mco2.315.6 .

Takei Y, Yun J, Zheng S, Ollikainen N. Integrated spatial genomics reveals global architecture of single nuclei. Nature. 2021. https://doi.org/10.1038/s41586-020-03126-2 .

Slyper M, Porter CBM, Ashenberg O, Waldman J. A single-cell and single-nucleus RNA-Seq toolbox for fresh and frozen human tumors. Nat Med. 2020. https://doi.org/10.1038/s41591-020-0844-1 .

Stoeckius M, Hafemeister C, Stephenson W, Houck-Loomis B. Simultaneous epitope and transcriptome measurement in single cells. Nat Methods. 2017. https://doi.org/10.1038/nmeth.4380 .

Vickovic S, Lötstedt B, Klughammer J, Mages S. SM-Omics is an automated platform for high-throughput spatial multi-omics. Nat Commun. 2022. https://doi.org/10.1038/s41467-022-28445-y .

Hernandez S, Lazcano R, Serrano A, Powell S. Challenges and opportunities for immunoprofiling using a spatial high-plex technology: The NanoString GeoMx(®) digital spatial profiler. Front Oncol. 2022. https://doi.org/10.3389/fonc.2022.890410 .

Liu Y, Distasio M, Su G, Asashima H. Spatial-CITE-seq: spatially resolved high-plex protein and whole transcriptome co-mapping. Res Sq. 2022. https://doi.org/10.21203/rs.3.rs-1499315/v1 .

Qin X, Ning Y, Zhou L, Zhu Y. Oral submucous fibrosis: etiological mechanism, malignant transformation, therapeutic approaches and targets. Int J Mol Sci. 2023. https://doi.org/10.3390/ijms24054992 .

Zhi Y, Wang Q, Zi M, Zhang S. Spatial transcriptomic and metabolomic landscapes of oral submucous fibrosis-derived oral squamous cell carcinoma and its tumor microenvironment. Adv Sci (Weinh). 2024. https://doi.org/10.1002/advs.202306515 .

Zheng P, Zhang N, Ren D, Yu C. Integrated spatial transcriptome and metabolism study reveals metabolic heterogeneity in human injured brain. Cell Rep Med. 2023. https://doi.org/10.1016/j.xcrm.2023.101057 .

Tang L. Spatially resolved multiomics. Nat Methods. 2023. https://doi.org/10.1038/s41592-023-02110-8 .

Wu X, Xu W, Deng L, Li Y. Spatial multi-omics at subcellular resolution via high-throughput in situ pairwise sequencing. Nat Biomed Eng. 2024. https://doi.org/10.1038/s41551-024-01205-7 .

Muster B, Rapp A, Cardoso MC. Systematic analysis of DNA damage induction and DNA repair pathway activation by continuous wave visible light laser micro-irradiation. AIMS Genet. 2017. https://doi.org/10.3934/genet.2017.1.47 .

Fang R, Xia C, Close JL, Zhang M. Conservation and divergence of cortical cell organization in human and mouse revealed by MERFISH. Science. 2022. https://doi.org/10.1126/science.abm1741 .

Kuppe C, Ramirez Flores RO, Li Z, Hayat S. Spatial multi-omic map of human myocardial infarction. Nature. 2022. https://doi.org/10.1038/s41586-022-05060-x .

Arutyunyan A, Roberts K, Troulé K, Wong FCK. Spatial multiomics map of trophoblast development in early pregnancy. Nature. 2023. https://doi.org/10.1038/s41586-023-05869-0 .

Hsieh W-C, Budiarto BR, Wang Y-F, Lin C-Y. Spatial multi-omics analyses of the tumor immune microenvironment. J Biomed Sci. 2022. https://doi.org/10.1186/s12929-022-00879-y .

Prasetyanti PR, Medema JP. Intra-tumor heterogeneity from a cancer stem cell perspective. Mol Cancer. 2017. https://doi.org/10.1186/s12943-017-0600-4 .

Haffner MC, Zwart W, Roudier MP, True LD. Genomic and phenotypic heterogeneity in prostate cancer. Nat Rev Urol. 2021. https://doi.org/10.1038/s41585-020-00400-w .

Brady L, Kriner M, Coleman I, Morrissey C. Inter- and intra-tumor heterogeneity of metastatic prostate cancer determined by digital spatial gene expression profiling. Nat Commun. 2021. https://doi.org/10.1038/s41467-021-21615-4 .

Jamal-Hanjani M, Wilson GA, Mcgranahan N, Birkbak NJ. Tracking the evolution of non-small-cell lung cancer. N Engl J Med. 2017. https://doi.org/10.1056/NEJMoa1616288 .

Black JRM, Mcgranahan N. Genetic and non-genetic clonal diversity in cancer evolution. Nat Rev Cancer. 2021. https://doi.org/10.1038/s41568-021-00336-2 .

Martínez-Ruiz C, Black JRM, Puttick C, Hill MS. Genomic-transcriptomic evolution in lung cancer and metastasis. Nature. 2023. https://doi.org/10.1038/s41586-023-05706-4 .

Gerlinger M, Rowan AJ, Horswell S, Math M. Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. N Engl J Med. 2012. https://doi.org/10.1056/NEJMoa1113205 .

Wu HJ, Temko D, Maliga Z, Moreira AL. Spatial intra-tumor heterogeneity is associated with survival of lung adenocarcinoma patients. Cell Genom. 2022. https://doi.org/10.1016/j.xgen.2022.100165 .

Ren Z, Qin L, Chen L, Xu H. Spatial lipidomics of EPSPS and PAT transgenic and non-transgenic soybean seeds using matrix-assisted laser desorption/ionization mass spectrometry imaging. J Agric Food Chem. 2023. https://doi.org/10.1021/acs.jafc.3c01377 .

Duhamel M, Drelich L, Wisztorski M, Aboulouard S. Spatial analysis of the glioblastoma proteome reveals specific molecular signatures and markers of survival. Nat Commun. 2022. https://doi.org/10.1038/s41467-022-34208-6 .

Neumann JM, Niehaus K, Neumann N, Knobloch HC. A new technological approach in diagnostic pathology: mass spectrometry imaging-based metabolomics for biomarker detection in urachal cancer. Lab Invest. 2021. https://doi.org/10.1038/s41374-021-00612-7 .

Moncada R, Barkley D, Wagner F, Chiodin M. Integrating microarray-based spatial transcriptomics and single-cell RNA-seq reveals tissue architecture in pancreatic ductal adenocarcinomas. Nat Biotechnol. 2020. https://doi.org/10.1038/s41587-019-0392-8 .

Sun C, Wang A, Zhou Y, Chen P. Spatially resolved multi-omics highlights cell-specific metabolic remodeling and interactions in gastric cancer. Nat Commun. 2023. https://doi.org/10.1038/s41467-023-38360-5 .

Bechtel TJ, Reyes-Robles T, Fadeyi OO, Oslund RC. Strategies for monitoring cell–cell interactions. Nat Chem Biol. 2021. https://doi.org/10.1038/s41589-021-00790-x .

Neftel C, Laffy J, Filbin MG, Hara T. An integrative model of cellular states, plasticity, and genetics for glioblastoma. Cell. 2019. https://doi.org/10.1016/j.cell.2019.06.024 .

Couturier CP, Ayyadhury S, Le PU, Nadaf J. Single-cell RNA-seq reveals that glioblastoma recapitulates a normal neurodevelopmental hierarchy. Nat Commun. 2020. https://doi.org/10.1038/s41467-020-17186-5 .

Ravi VM, Will P, Kueckelhaus J, Sun N. Spatially resolved multi-omics deciphers bidirectional tumor-host interdependence in glioblastoma. Cancer Cell. 2022. https://doi.org/10.1016/j.ccell.2022.05.009 .

Frede A, Czarnewski P, Monasterio G, Tripathi KP. B cell expansion hinders the stroma-epithelium regenerative cross talk during mucosal healing. Immunity. 2022. https://doi.org/10.1016/j.immuni.2022.11.002 .

Zhang Y, Zeng F, Han X, Weng J. Lineage tracing: technology tool for exploring the development, regeneration, and disease of the digestive system. Stem Cell Res Ther. 2020. https://doi.org/10.1186/s13287-020-01941-y .

Kester L, Van Oudenaarden A. Single-cell transcriptomics meets lineage tracing. Cell Stem Cell. 2018. https://doi.org/10.1016/j.stem.2018.04.014 .

Vanhorn S, Morris SA. Next-generation lineage tracing and fate mapping to interrogate development. Dev Cell. 2021. https://doi.org/10.1016/j.devcel.2020.10.021 .

Wagner DE, Klein AM. Lineage tracing meets single-cell omics: opportunities and challenges. Nat Rev Genet. 2020. https://doi.org/10.1038/s41576-020-0223-2 .

Zhang B, He P, Lawrence JEG, Wang S. A human embryonic limb cell atlas resolved in space and time. Nature. 2023. https://doi.org/10.1038/s41586-023-06806-x .

Bao Y, Wang G, Li H. Approaches for studying human macrophages. Trends Immunol. 2024. https://doi.org/10.1016/j.it.2024.02.007 .

Su G, Qin X, Enninful A, Bai Z. Spatial multi-omics sequencing for fixed tissue via DBiT-seq. STAR Protoc. 2021. https://doi.org/10.1016/j.xpro.2021.100532 .

Liao X, Scheidereit E, Kuppe C. New tools to study renal fibrogenesis. Curr Opin Nephrol Hypertens. 2024. https://doi.org/10.1097/mnh.0000000000000988 .

Park J, Kim J, Lewy T, Rice CM. Spatial omics technologies at multimodal and single cell/subcellular level. Genome Biol. 2022. https://doi.org/10.1186/s13059-022-02824-6 .

Li Z, Lu Y, Yang L. Imaging and spatial omics of kidney injury: significance, challenges, advances and perspectives. Med Rev. 2023. https://doi.org/10.1515/mr-2023-0046 .

Mullen NJ, Singh PK. Nucleotide metabolism: a pan-cancer metabolic dependency. Nat Rev Cancer. 2023. https://doi.org/10.1038/s41568-023-00557-7 .

Arner EN, Rathmell JC. Metabolic programming and immune suppression in the tumor microenvironment. Cancer Cell. 2023. https://doi.org/10.1016/j.ccell.2023.01.009 .

Wang H, Rong X, Zhao G, Zhou Y. The microbial metabolite trimethylamine N-oxide promotes antitumor immunity in triple-negative breast cancer. Cell Metab. 2022. https://doi.org/10.1016/j.cmet.2022.02.010 .

Liu Y-M, Ge J-Y, Chen Y-F, Liu T. Combined single-cell and spatial transcriptomics reveal the metabolic evolvement of breast cancer during early dissemination. Adv Sci. 2023. https://doi.org/10.1002/advs.202205395 .

Tang HH, Li HL, Li YX, You Y. Protective effects of a traditional Chinese herbal formula Jiang-Xian HuGan on Concanavalin A-induced mouse hepatitis via NF-κB and Nrf2 signaling pathways. J Ethnopharmacol. 2018. https://doi.org/10.1016/j.jep.2018.02.003 .

Chen P, Zhu Z, Geng H, Cui X. Integrated spatial metabolomics and transcriptomics decipher the hepatoprotection mechanisms of wedelolactone and demethylwedelolactone on non-alcoholic fatty liver disease. J Pharm Anal. 2024. https://doi.org/10.1016/j.jpha.2023.11.017 .

Rothammer N, Woo MS, Bauer S, Binkle-Ladisch L. G9a dictates neuronal vulnerability to inflammatory stress via transcriptional control of ferroptosis. Sci Adv. 2022. https://doi.org/10.1126/sciadv.abm5500 .

Wu T, Ning S, Zhang H, Cao Y. Role of ferroptosis in neuroimmunity and neurodegeneration in multiple sclerosis revealed by multi-omics data. J Cell Mol Med. 2024. https://doi.org/10.1111/jcmm.18396 .

Mennillo E, Kim YJ, Lee G, Rusu I. Single-cell and spatial multi-omics highlight effects of anti-integrin therapy across cellular compartments in ulcerative colitis. Nat Commun. 2024. https://doi.org/10.1038/s41467-024-45665-6 .

Xu R, Li C, Liu X, Gao S. Insights into epigenetic patterns in mammalian early embryos. Protein Cell. 2021. https://doi.org/10.1007/s13238-020-00757-z .

Zhang X, Cao Q, Rajachandran S, Grow EJ. Dissecting mammalian reproduction with spatial transcriptomics. Hum Reprod Update. 2023. https://doi.org/10.1093/humupd/dmad017 .

Winkler I, Tolkachov A, Lammers F, Lacour P. The cycling and aging mouse female reproductive tract at single-cell resolution. Cell. 2024. https://doi.org/10.1016/j.cell.2024.01.021 .

Yang F, Zhao Z, Zhang D, Xiong Y. Single-cell multi-omics analysis of lineage development and spatial organization in the human fetal cerebellum. Cell Discov. 2024. https://doi.org/10.1038/s41421-024-00656-1 .

Li X, Andrusivova Z, Czarnewski P, Langseth CM. Profiling spatiotemporal gene expression of the developing human spinal cord and implications for ependymoma origin. Nat Neurosci. 2023. https://doi.org/10.1038/s41593-023-01312-9 .

Chen KH, Boettiger AN, Moffitt JR, Wang S. RNA imaging. Spatially resolved, highly multiplexed RNA profiling in single cells. Science. 2015. https://doi.org/10.1126/science.aaa6090 .

Su JH, Zheng P, Kinrot SS, Bintu B. Genome-scale imaging of the 3D organization and transcriptional activity of chromatin. Cell. 2020. https://doi.org/10.1016/j.cell.2020.07.032 .

Jiang F, Zhou X, Qian Y, Zhu M. Simultaneous profiling of spatial gene expression and chromatin accessibility during mouse brain development. Nat Methods. 2023. https://doi.org/10.1038/s41592-023-01884-1 .

Liu S, Iorgulescu JB, Li S, Borji M. Spatial maps of T cell receptors and transcriptomes reveal distinct immune niches and interactions in the adaptive immune response. Immunity. 2022. https://doi.org/10.1016/j.immuni.2022.09.002 .

Liu Y, Distasio M, Su G, Asashima H. High-plex protein and whole transcriptome co-mapping at cellular resolution with spatial CITE-seq. Nat Biotechnol. 2023. https://doi.org/10.1038/s41587-023-01676-0 .

Srivastava AK, Wang Y, Huang R, Skinner C. Human genome meeting 2016: Houston, TX, USA. 28 February–2 March 2016. Hum Genom. 2016. https://doi.org/10.1186/s40246-016-0063-5 .

Shah S, Takei Y, Zhou W, Lubeck E. dynamics and spatial genomics of the nascent transcriptome by intron seqFISH. Cell. 2018. https://doi.org/10.1016/j.cell.2018.05.035 .

Xia C, Fan J, Emanuel G, Hao J. Spatial transcriptome profiling by MERFISH reveals subcellular RNA compartmentalization and cell cycle-dependent gene expression. Proc Natl Acad Sci U S A. 2019. https://doi.org/10.1073/pnas.1912459116 .

Vicari M, Mirzazadeh R, Nilsson A, Shariatgorji R. Spatial multimodal analysis of transcriptomes and metabolomes in tissues. Nat Biotechnol. 2024. https://doi.org/10.1038/s41587-023-01937-y .

Liu YM, Ge JY, Chen YF, Liu T. Combined single-cell and spatial transcriptomics reveal the metabolic evolvement of breast cancer during early dissemination. Adv Sci (Weinh). 2023. https://doi.org/10.1002/advs.202205395 .

Download references

Acknowledgements

Thank you for drawing materials from SMART—Servier Medical ART and SciDraw | Scientific Drawings.

The National Key R&D Program of China (2021YFC2701201 to P.W.), the Natural Science Foundation of China (82072895 and 82372929 to P.W., 82141106 to M.D., 82203453 to C. C.), and Foundation of Tongji Hospital (24-2KYC13057-08 to C. C.).

Author information

Authors and affiliations.

Department of Obstetrics and Gynecology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China

Xiaojie Liu, Ting Peng, Miaochun Xu, Shitong Lin, Binghan Liu, Yashi Xu & Peng Wu

Department of Gynecologic Oncology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China

Bai Hu, Tian Chu, Wencheng Ding, Li Li & Canhui Cao

National Clinical Research Center for Obstetrics and Gynecology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China

Xiaojie Liu, Ting Peng, Miaochun Xu, Shitong Lin, Bai Hu, Tian Chu, Binghan Liu, Yashi Xu, Wencheng Ding, Li Li, Canhui Cao & Peng Wu

Cancer Biology Research Center (Key Laboratory of the Ministry of Education), Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China

You can also search for this author in PubMed   Google Scholar

Contributions

P. W. and X. L. conceived the whole article and completed the original draft preparation. P. W. and C. C. performed major reviews and edited the structure of the article. The primary drawing effort was done by T. P. and C. X. During subsequent revisions of the article, S. L., B. H., T. C., B. L., Y. X., W. D., and L. L. have touched up the language of the article and participated in the revision process of the later article. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Canhui Cao or Peng Wu .

Ethics declarations

Consent for publication.

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/ .

Reprints and permissions

About this article

Cite this article.

Liu, X., Peng, T., Xu, M. et al. Spatial multi-omics: deciphering technological landscape of integration of multi-omics and its applications. J Hematol Oncol 17 , 72 (2024). https://doi.org/10.1186/s13045-024-01596-9

Download citation

Received : 22 May 2024

Accepted : 09 August 2024

Published : 24 August 2024

DOI : https://doi.org/10.1186/s13045-024-01596-9

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Spatial multi-omics
  • Heterogeneity
  • Spatial-specific atlas
  • Lineage tracking
  • New therapy
  • Reproduction

Journal of Hematology & Oncology

ISSN: 1756-8722

essay introduction about genes

Royal Society of Chemistry

Fungal diversity and key functional gene abundance in Iowa bioretention cells: implications for stormwater remediation potential †

ORCID logo

First published on 20th August 2024

Stormwater bioretention cells are green stormwater infrastructure systems that can help mitigate flooding and remove contaminants. Plants and bacteria improve nutrient removal and degrade organic contaminants; however, the roles of fungi in bioretention cells are less known. Although mycorrhizal fungi aid in plant growth/improve nutrient uptake, there is a notable lack of research investigating fungal diversity in bioretention cells. Other types of fungi could benefit bioretention cells ( e.g. , white rot fungi degrade recalcitrant contaminants). This study addresses the knowledge gap of fungal function and diversity within stormwater bioretention cells. We collected multiple soil samples from 27 different bioretention cells in temperate-climate eastern Iowa USA, characterized soil physicochemical parameters, sequenced the internal transcribed spacer (ITS) amplicon to identify fungal taxa from extracted DNA, and measured functional gene abundances for two fungal laccases ( Cu1 , Cu1A ) and a fungal nitrite reductase gene ( nirKf ). Fungal biodegradation functional genes were present in bioretention soils (mean copies per g: 7.4 × 10 5 nirKf , 3.2 × 10 6 Cu1 , 4.0 × 10 8 Cu1A ), with abundance of fungal laccase and fungal nitrite reductase genes significantly positively correlated with soil pH and organic matter (Pearson's R : >0.39; rho < 0.05). PERMANOVA analysis determined soil characteristics were not significant explanatory variables for community composition (beta diversity). In contrast, planting specifications significantly impacted fungal diversity; the presence/absence of a few planting types and predominant vegetation type in the cell explained 89% of variation in fungal diversity. These findings further emphasize the importance of plants and media as key design parameters for bioretention cells, with implications for fungal bioremediation of captured stormwater contaminants.

Bioretention cells capture and treat urban runoff laden with organic contaminants, and fungi may be a useful bioremediation approach. We characterized fungal communities in eastern Iowa USA, quantified biodegradation function genes, and correlated plant/soil types with fungal diversity. The taxa that include white/brown rot fungi capable of biodegrading recalcitrant contaminants were highly represented, with plant type a significant driver of fungal diversity. Fungal laccase/nitrite reductase functional genes were significantly correlated with soil pH and organic matter. We provide new evidence that diverse fungi are present within bioretention cells and elucidate plantings as key influences. Implications include optimizing planting/media types in green stormwater infrastructure design, and potential for fungal bioaugmentation/inoculation of bioretention to improve stormwater contaminant removal.

1. Introduction

To mitigate limitations from abiotic removal mechanisms, maximized contaminant sorption/filtration can be coupled with biotic uptake and transformation processes in microbes and plants. Plant selection is important for functioning bioretention cells; plants with deeper rooting zones can prevent clogging and improve infiltration. 16 Phosphorus and nitrogen removal can also improve in planted systems, though efficacy may depend on plant type. 17 Increasing evidence demonstrates plants can impact trace organic contaminant removal in bioretention. For example, plants can remove and transform benzotriazole hydroponically, 18 and benzotriazole phytotransformation products were detected in bench-scale bioretention mesocosms 19 as well as a field-scale tracer study. 20 In addition to plants, organisms such as bacteria and fungi also affect contaminant fate within bioretention cells. Bacteria have mostly been studied in context of nutrients within bioretention cells but have also demonstrated contributions to organic contaminant removal. One recent study suggests biochar-amended bioretention cells, with the contributions of active microbial biofilms, could extend the lifetime of the system by 1.8–2.3 times and enhance removal of trace contaminants like atrazine and neonicotinoids. 21

Although there is growing evidence on the roles of plants and bacteria within bioretention cells, fungi have received very limited consideration. Within these limited studies, mycorrhizal fungi have garnered the most attention, as they associate with and often provide benefits to plants such as nutrients and protection against pathogens. 22,23 Mycorrhizal fungi have been observed in bioretention cells, 24 and initial studies indicate functional improvements such as increased plant growth and stress resistance, improved nutrient uptake and heavy metal removal. 25–27 Beyond mycorrhizal fungi, fungal enzymes such as lignin peroxidases or laccases could enhance biotransformation of contaminants; indeed, many of these enzymes are not substrate-specific and have widely demonstrated the break-down of recalcitrant contaminants such as polychlorinated biphenyls, 28,29 carbamazepine, 30,31 and azo dyes. 32 Despite the potential of fungi to enhance bioretention function, we found very few studies reporting on fungal diversity within green stormwater infrastructure bioswales. These initial findings indicated fungal communities in bioswales (bioswales serve similar functions to bioretention cells) were distinct from park/tree pit soils, likely influenced by differences in micro-environment/habitat, 33 and that community differences were correlated with plant species differences. 34

Though emerging studies on microbial communities in green stormwater infrastructure provide important initial insights on fungal diversity, more work is needed to understand the role of fungi in green stormwater infrastructure, particularly bioretention cells. The single study probing the connection between plant species/physiology and microbial community differences focused on plant transpiration rates as the relevant physiological trait. Other traits, such as rooting depth, or non-plant variables such as runoff type ( e.g. , parking lot vs. road), remain unexplored. Abundance of relevant bacterial biotransformation genes such as denitrification genes, 35,36 bphA , 37 and naphthalene degradation genes 38,39 have been quantified within bioretention cells or mesocosms, but no studies have specifically investigated relevant fungal biotransformation genes such as laccases and peroxidases in bioretention cells. Thus, there is a paucity of information on fungal communities and function within bioretention cells and how fungi (native or bioaugmented) could enhance removal of dissolved organic contaminants. To address these knowledge gaps, we measured fungal diversity and key fungal functional gene abundance in 27 different bioretention cells in eastern Iowa (in the temperate Midwestern United States). The objective of this study was to determine driving factors of fungal diversity and functional gene abundance in representative Iowa bioretention cells. We hypothesized that bioretention plant rooting depth and soil characteristics would significantly influence fungal diversity and functional gene abundance.

2.1 Bioretention cell soil sampling

2.2 soil dna extraction and sequencing, 2.3 bioinformatics pipeline, 2.4 statistical analysis for sequencing data.

To test relationships between site characteristic variables and the beta diversity distance matrix, permutational multivariate analysis of variance (PERMANOVA) was used. PERMANOVA is a non-parametric method of hypothesis testing for multivariate variation that can be applied to distance matrices such as the rCLR-transformed counts table with the Aitchison distance. 52 The PERMANOVA test assumes sample “exchangeability” (data can be rearranged without changing the overall distribution), and random permutations of the data are compared to determine if the centroid of the random permutations are equivalent. The null hypothesis of the test states that there are no differences in the positioning of the centroids when comparing the random permutations of the data to the distance matrix. 46 PERMANOVA is considered a powerful statistical tool for microbiome and ecology data, as data do not need to be normally distributed. However, PERMANOVA can be affected by uneven/unbalanced sampling for groups (which applied for many of our site characteristics). To account for this, we conducted tests for homogeneity of multivariate dispersions (PERMDISP). 52 Soil characteristics, planting variables, and physical location were analyzed using PERMANOVA (adonis2 function in the R ‘vegan’ package). 53 Soil characteristics (continuous variables) and planting/site categorical variables were analyzed in separate PERMANOVA analyses.

2.5 Fungal functional gene analysis

3. results and discussion, 3.1 fungal diversity in bioretention cells.

Results from PERMANOVA analysis ( Table 1 ) provide insight into the variables that influence differences in community composition (beta diversity). Soil characteristics were not significant explanatory variables for beta diversity. Rather, a few key planting variables explained 89% of the variability in the distance matrix: the presence of native grasses ( R 2 = 0.43), the presence of forbs ( R 2 = 0.12), and the type of plant determined to be predominant in the cell ( R 2 = 0.24) based on sampling field notes and site images. Presence/absence of different plants were coded as Boolean variables. For example, if native grasses such as switchgrass ( Panicum virgatum ) were recorded as present within the cell and confirmed in pictures, presence of native grasses was coded as TRUE (otherwise coded as FALSE.) Presence/absence variables do not necessarily indicate the plant is dominant, but simply that it was growing within the cell. In contrast, the type of plant determined to be “dominant” in a cell was based on the abundance of the plant within a cell; for example, if the cell consisted of mostly Black-eyed Susan ( Rudbeckia fulgida ) and other forbs, the dominant planting type was classified as “forb”. Some cells had multiple planting types that could be considered dominant (Fig. S.2 † ) and were classified as “mixed”. PERMDISP analyses of each of the statistically significant PERMANOVA variables indicate homogeneous dispersions within groups for the presence of forbs variable and the dominant planting variable, but not for the native grasses nor legumes variables (Table S.11 and Fig. S.15–S.18 † for PERMDISP results). For the native grasses/legumes variables, the significant results from the PERMDISP indicate that effects from within group dispersion (variances) could be confounding the results from the PERMANOVA; in essence, although the PERMANOVA is not ‘invalid’ per se , caution must be taken when interpreting results. The statistically significant PERMDISP implicates that within-group variance is also driving some of the differences seen in beta diversity and not solely the effect from presence of native grasses. These within-group variances are further explored using other visualizations (Fig. S.19 and S.20 † ). For the other two variables, the results of the PERMDISP further confirm the appropriateness of the PERMANOVA test, and within-group variances do not need to be considered when evaluating effects on beta diversity from the variables.

Model variables Total R explained by the model p-Value
“Soils model” soil pH, OM (%), NO3, ammonium, Zn, Mn, Cu, P 0.14 0.927
“Plantings model” rooting depth, presence of different plant types (forbs**, weeds, turf grass, trees, legumes, native grass**, decorative cultivars, weeds), dominant planting*, city 0.89 0.003
“Geography model” city**, sampling location, runoff type (parking lot, roof, minor road, major road) 0.75 0.008

In the visualization of the PCA for the distance matrix, we colored points by the variables explaining the most variation: type of dominant planting, presence of native grasses, and presence of forbs ( Fig. 1 ). Legumes, while a statistically significant variable from the PERMANOVA model, were excluded from the PCA analysis due to the low R 2 (0.04). The PCA ordination explained 79% of the variance in PC1, and 21% of the variance in PC2. The high values of variance explained is not uncommon for this type of ordination in microbiome datasets. 43,51 When examining the PCA for the native grasses, we can consider the results from both the PERMANOVA and PERMDISP tests ( i.e. , visually check for differences in within-group variation). The cells lacking native grasses visually appear to have heterogenous dispersion (spread unevenly within the group); given the significance of other planting variables, we posit that other planting types within the cells with no native grasses could be influencing results, in addition to the effect of not having native grasses. Within the six sites with no native grasses present, there were three within the group that are turf grass cells, two that were dominated by legumes, and one that had mostly weeds but legumes present. These planting types are physiologically different, with turf grass having short rooting depths and less transpiration than legumes. These differences could explain the significant result of the PERMDISP for the presence of native grasses.

Principal component analysis plots using the rCLR-transformed data from DEICODE, colored by: (A) dominant planting type, (B) presence of native grasses, (C) presence of native forbs.

The cluster dendrogram ( Fig. 2 ) based on the beta diversity rCLR-transformed Aitchison distance matrix is another visualization method for relatedness between sampling locations. The two North Liberty cells appear to be distinct, exhibiting a high branch height and clustering separately from all other samples. These two cells were the only cells with legumes as the dominant planting. Given the small sample size for this dominant planting group, it is challenging to attribute the difference to just one variable and should be considered as an area for future research. In contrast to the legume-dominant cells, the cells sampled from Cedar Rapids do not all cluster together, despite having some similarities in plantings within those sampling locations. Another interesting separation on the dendrogram is the “cvcell14top” vs. “cvcell14bottom” sample. At this cell, we sampled at two different depths in the soil cores – one at the top 10 cm and one at the bottom 10 cm of the corer. Though the sample size for depth is limited, this preliminary indication of difference with depth suggests that soil depth could be another variable of interest.

Dendrogram showing beta diversity differences based on the rCLR-transformed data with Aitchison distances. Labels are each sampling site, with the first two letter abbreviations standing for city names (‘nl’ = North Liberty, ‘cv’ = Coralville, ‘cr’ = Cedar Rapids, ‘ic’ = Iowa City). The height and branching of the dendrogram indicate differences, where samples clustered on the same branch at “shorter” heights are more similar, and samples that have “high” branch heights are less similar.

Altogether, the strong contributions from the planting categorical variables strengthens the evidence from Gill et al. 33 who reported that differences in micro-environment drove community differences. In a separate study on influence of plant transpiration rates on fungal/bacterial diversity in bioswales, Brodsky et al. 34 found that plants with higher transpiration rates were associated with higher fungal and bacterial diversity. Though planting categorical variables appear to explain much of the variation in the data for our study, we cannot rule out the possibility that further variability could be explained by site-specific characteristics not considered here, such as redox conditions, presence of a saturated zone, size of the cell/runoff volume, specific pollutant loading, cell age, cell maintenance strategies/frequency, etc. We also note that certain fungi are likely excluded from this study based on the sampling design. Examples of excluded fungi are endomycorrhizal fungi and plant pathogens that would inhabit within the vegetation of the bioretention cells, because only soil samples were analyzed. Additionally, fungal community variation with depth was not studied, as soil samples were collected near the subsurface of the cell, excepting cell 14. This limitation could exclude fungi that can survive under low oxygen conditions. Thus, while subsurface soil diversity was studied, more studies may be needed to further understand fungal diversity in bioretention cells.

Plots of mean relative abundance data ( Fig. 3 , S.19 and 20 † ) provide another way to compare the fungal communities between our study and Gill et al. 's and insight into taxonomic information. One key difference is the abundance of Ascomycota (a diverse phylum with the largest number of fungal species); in Gill et al. 's study, Ascomycota represent over 50% of the fungal sequences in all sample types. 33 A different study on microbiomes of urban greenspaces (non-engineered soils) had 40–50% abundance for Ascomycota . 57 Surprisingly, Ascomycota abundance did not fall within the 40–50% range for all samples. Only forb- and turf-dominant cells exhibit close to 50% abundance of Ascomycota , with abundance at or below 30% in legume-, trees/shrubs-, and weed-dominant cells. Another unexpected result is the high percentage of Basidiomycota relative to Ascomycota within our samples, especially in the weed-dominant cells. Basidiomycota include a multitude of fungal taxa that fall on the “white/brown rot continuum”. 58 These fungal taxa could aid in contaminant removal within bioretention cells, as demonstrated in our previous work. 59

Relative abundance of fungal phyla for bioretention cells, grouped by the dominant planting type in each cell. Number of cells for each category is as follows: forb (n = 2), legume (n = 2), mixed (n = 4), native grass (n = 13), tree/shrub (n = 2), turf (n = 3), weed (n = 2).

The distribution of other phyla is also notable. Interestingly, the early-diverging phylum, Aphelidiomycota , was detected with a relative abundance around 2% in forb-, legume-, and native grass-dominant samples. Aphelidiomycota , once considered protists and/or a sister group to fungi, parasitize algae and diatoms and may have a role in leaf litter decomposition. 60,61 The majority of fungi with an arbuscular mycorrhizal lifestyle are found in the Glomeromycota phylum. Within our samples, Glomeromycota were scarce in most samples, except in cells with trees/shrubs (5% abundance), and were only detected at or above 1% in samples with dominant plantings of trees/shrubs, turf, and weeds. Surprisingly, the two cells with dominant legumes (the North Liberty cells that also grouped separately in the dendrogram) had negligible Glomeromycota relative abundance, despite evidence that legumes host higher arbuscular mycorrhizal diversity relative to some native grass species. 62 Indeed, the opposite was true within our results. Thus, there is a possible need for mycorrhizal inoculation within bioretention cells to enhance benefits from mycorrhizal fungi, as well as a need to further investigate the presence of mycorrhizal fungi in legume-dominant bioretention cells.

3.2 Functional gene quantification

(A) Aggregated quantified functional gene copies per gram dry soil quantified from all of the bioretention sites (n = 28, including the composite sample from the bottom 10 cm of the core). Each dot represents the mean of triplicate extractions from composited soil samples collected from each site. The black bars represent the median value for each functional gene for all sites. Each gene set was significantly different from the others (p < 0.0001; Cu1A > Cu1 > nirKf). (B) Functional gene copies separated by dominant planting type. Plot bars represent mean with standard error. The functional gene quantities were not different between dominant plant type (p > 0.8) for a given functional gene. Additional information on the sites and planting type designations is in the ESI.

Soil characteristics correlated with functional gene abundance in surprising ways ( Fig. 5 ). Despite the involvement of fungal nirK in the nitrogen cycle, neither soil nitrate nor soil ammonium was significantly or strongly correlated with nirKf gene abundance. Somewhat similarly, none of the functional gene abundances were correlated with soil copper content, despite both laccase and nirKf being copper-containing enzymes. All functional genes were significantly positively correlated ( p < 0.05) with soil pH and percent organic matter. Additionally, we found that percent organic matter (OM) and pH result in statistically significant differences in gene abundance for each functional gene (Fig. S.24; † note that OM/pH graphs would be equivalent and only one is depicted, as the samples group in the same manner for both variables). When we compared functional gene abundance for samples with percent OM less than the median to samples with percent OM greater than the median, we found statistically significant differences between the means, as indicated by the 95% confidence intervals excluding zero for all functional genes. We posit that the correlation between functional genes and percent organic matter as well as the differences in gene abundances with percent OM are likely because higher organic matter could result in a higher overall fungal richness. 66 Indeed, based on the correlation matrix, it appears that percent OM and pH are driving soil characteristics for fungal species carrying these functional genes. These findings in stormwater bioretention cells are consistent with related literature from the fields of mycology/soil science. For example, soil fungal communities in the natural environment are known to vary with depth and organic matter content, 67,68 with greater richness and diversity in the organic horizon 69 that decreases with depth. 70,71 Additionally, soil pH has been shown to affect fungal abundance and composition, e.g. , resulting in changes in nutrient cycling through stimulated heterotrophic nitrification. 72 Nevertheless, we caveat the interpretation of pH with the results in that pH values were generally circumneutral (mean of 7.4 ± 0.06); thus large deviations in soil pH are not expected, whereas percent organic matter can vary depending on the proportion of compost added to the geomedia. In the context of Spearman's correlations between functional gene abundance and site characteristic categorical variables were generally not statistically significant (Fig. S.25 † ); Cu1 was weakly negatively correlated with the presence of forbs ( p = 0.044, rho = −0.383) and nirKf was weakly positively correlated with smaller-scale local geography ( p = 0.042, rho = 0.388). Local geography was categorized based on within-city areas ( e.g. , “Iowa River Landing”) whereas geography was categorized based on city. It is possible that the fungi represented by the selected functional genes are widespread within bioretention cells, which could explain the weak correlation with planting variables and significant correlation with soil organic matter and pH.

Pearson's correlation matrix for soil physicochemical characteristics and functional gene abundance. Median functional gene abundances were calculated for each site and used in the analysis. Numerical values are correlation coefficients for statistically significant values (rho < 0.05).

Our work demonstrated the utility of measuring fungal functional genes in bioretention cells. Pairing functional gene abundance measurements with fungal sequencing allows for a quantitative metric of key processes along with knowledge of the fungal community. One challenge of using DNA extracts from soil is that there is a possibility for detection of “relic” DNA, or DNA from organisms no longer alive. 73 This could be potentially problematic if a study's goal is to capture a fully representative picture of bioretention function. An alternative could be RNA extraction, although this process has substantial methodological challenges. Nonetheless, qPCR is a useful method to understand how bioretention design parameters ( e.g. , amount of compost addition) enhances or reduces desirable biotic functions such as contaminant transformation. Future studies could investigate impacts from bioretention saturated zones, further optimize qPCR primer specificity, and target a wider range of fungal functional genes. Altogether, our study increases understanding of factors influencing fungal diversity and functional gene abundance in bioretention cells and revealed fruitful areas of further research.

In a broader context, there is only limited existing work investigating the role of microbial (and even less on fungal) diversity or functional genes and their role in green stormwater infrastructure (GSI). Several papers studying GSI practices in New York City have characterized microbial community diversity in vegetated swales, right-of-way bioswales, tree pits, and other practices. In one investigation, 37 bacterial communities in engineered soils were compositionally distinct from non-engineered soils/not-engineered site. Specifically, bioswales exhibited distinct, phylogenetically diverse communities, including taxa associated with nutrient cycling and contaminants biodegradation. Bioswale soils also had a significantly greater diversity of genes involved in several functional pathways, including carbon and nitrogen cycling, as well as for degradation of a diverse suite of contaminants ( e.g. , noxZ and amoA for N cycling; bphA and monooxygenases for contaminant biodegradation). GSI design had significant impacts on measured microbial variables. 74 Nevertheless, a subsequent NYC study indicated bacterial communities in GSI had levels of diversity similar to nonurban soils. 75 A study of microbial (bacteria, archaea) and fungal diversity of urban greenspaces in France 76 reported fungal richness increased in urban agriculture soils while bacterial richness was lower in public leisure areas. Similar to our findings, trace metals nor PAH contents in their study explained variations in microbial communities; organic carbon and C/N best predicted overall biomass. A study in the southwest US studying divergence of microbial communities in GSI soils concluded that diversifying vegetation can lead to positive feedback cycles between plants and microbes. 77 These positive feedback cycles can result in more resilient systems and higher carbon storage, strengthening evidence for the importance of vegetation–microbe relationships in GSI. Indeed, a recent study on greenroofs 78 in the Midwestern US reported greenroof fungal community composition was distinct between their nearby ground areas and between cities, positively correlated with plant cover. The overall paucity of work characterizing fungal (and even bacterial) communities in GSI practices represents a critical need—especially in the context of potential to bioremediation captured stormwater contaminants 37–39,59 —and warrants further investigation.

4. Conclusions

Our work also demonstrates potential for in situ fungal biotransformation within GSI practices. We noted the unexpectedly high representation of Basidiomycota within our samples, a fungal taxa that fall on the “white/brown rot continuum”. 58 These fungal taxa could aid in contaminant removal within bioretention cells, as demonstrated in our previous work 59 that white-rot fungi biodegrade toxic tire-wear compounds. White-rot fungi are able to biodegrade a wide suite of other recalcitrant organic contaminants that may be present in stormwater ( e.g. , some pesticides 80–82 ) and may otherwise accumulate in bioretention cells. Indeed, a recent study revealed that more recalcitrant, higher molecular weight PAHs were accumulating at the mid-depth below the surface of a bioretention cell where biodegradation may be limited, despite presence of PAH-ring cleaving dioxygenase bacterial genes at the surface. 15 This suggests potential utility for in situ fungal bioremediation in GSI, and indicates value in investigating fungal bioaugmentation/inoculation opportunities of native/beneficial bioremediating fungi. 83 Another area for further exploration would be the impact of bioretention geomedia amendments to fungal diversity in bioretention cells, as the addition of biochar can impact beta diversity of fungi due to alterations in soil characteristics. 84 With growing interest in synergizing coupled contaminant sorption to geomedia with subsequent biological degradation in stormwater systems, 21,83 probing the relationships between fungi and bioretention geomedia will be critical. Our novel reporting of fungal denitrification genes in bioretention systems also points towards a possible role of fungi for enhanced nutrient cycling in GSI; there is growing interest in using passive treatment systems for non-point nitrogen pollution control 85–88 ( e.g. , bioretention, woodchip bioreactors). Overall, our work points to great value in further understanding the fungal communities of GSI and potential for improved contaminant degradation.

Data availability

Conflicts of interest, acknowledgements.

  • G. LeFevre, K. Paus, P. Natarajan, J. Gulliver, P. Novak and R. Hozalski, Review of Dissolved Pollutants in Urban Storm Water and Their Removal and Fate in Bioretention Cells, J. Environ. Eng. , 2015, 141 (1), 4014050,  DOI: 10.1061/(ASCE)EE.1943-7870.0000876 .
  • S. Spahr, M. Teixidó, D. L. Sedlak and R. G. Luthy, Hydrophilic Trace Organic Contaminants in Urban Stormwater: Occurrence, Toxicological Relevance, and the Need to Enhance Green Stormwater Infrastructure, Environ. Sci.: Water Res. Technol. , 2020, 6 (1), 15–44,   10.1039/C9EW00674E .
  • J. R. Masoner, D. W. Kolpin, I. M. Cozzarelli, L. B. Barber, D. S. Burden, W. T. Foreman, K. J. Forshay, E. T. Furlong, J. F. Groves, M. L. Hladik, M. E. Hopton, J. B. Jaeschke, S. H. Keefe, D. P. Krabbenhoft, R. Lowrance, K. M. Romanok, D. L. Rus, W. R. Selbig, B. H. Williams and P. M. Bradley, Urban Stormwater: An Overlooked Pathway of Extensive Mixed Contaminants to Surface and Groundwaters in the United States, Environ. Sci. Technol. , 2019, 53 (17), 10070–10081,  DOI: 10.1021/acs.est.9b02867 .
  • B. Seiwert, P. Klöckner, S. Wagner and T. Reemtsma, Source-Related Smart Suspect Screening in the Aqueous Environment: Search for Tire-Derived Persistent and Mobile Trace Organic Contaminants in Surface Waters, Anal. Bioanal. Chem. , 2020, 412 (20), 4909–4919,  DOI: 10.1007/s00216-020-02653-1 .
  • W. F. Hunt, R. G. Traver, A. P. Davis, C. H. Emerson, K. A. Collins and J. H. Stagge, Low Impact Development Practices: Designing to Infiltrate in Urban Environments, in Effects of Urbanization on Groundwater , American Society of Civil Engineers, Reston, VA, 2010, pp. 308–343  Search PubMed .
  • A. P. Davis, W. F. Hunt, R. G. Traver and M. Clar, Bioretention Technology: Overview of Current Practice and Future Needs, J. Environ. Eng. , 2009, 135 (3), 109–117,  DOI: 10.1061/(ASCE)0733-9372(2009)135:3(109) .
  • J. E. Grebel, S. K. Mohanty, A. A. Torkelson, A. B. Boehm, C. P. Higgins, R. M. Maxwell, K. L. Nelson and D. L. Sedlak, Engineered Infiltration Systems for Urban Stormwater Reclamation, Environ. Eng. Sci. , 2013, 30 (8), 437–454,  DOI: 10.1089/ees.2012.0312 .
  • T. F. M. Rodgers, L. Wu, X. Gu, S. Spraakman, E. Passeport and M. L. Diamond, Stormwater Bioretention Cells Are Not an Effective Treatment for Persistent and Mobile Organic Compounds (PMOCs), Environ. Sci. Technol. , 2022, 56 (10), 6349–6359,  DOI: 10.1021/acs.est.1c07555 .
  • K. Zhang, A. Randelovic, D. Page, D. T. McCarthy and A. Deletic, The Validation of Stormwater Biofilters for Micropollutant Removal Using in Situ Challenge Tests, Ecol. Eng. , 2014, 67 , 1–10,  DOI: 10.1016/j.ecoleng.2014.03.004 .
  • T. F. M. Rodgers, S. Spraakman, Y. Wang, C. Johannessen, R. C. Scholes and A. Giang, Bioretention Design Modifications Increase the Simulated Capture of Hydrophobic and Hydrophilic Trace Organic Compounds, Environ. Sci. Technol. , 2024, 58 (12), 5500–5511,  DOI: 10.1021/acs.est.3c10375 .
  • S. K. Mohanty, R. Valenca, A. W. Berger, I. K. M. Yu, X. Xiong, T. M. Saunders and D. C. W. Tsang, Plenty of Room for Carbon on the Ground: Potential Applications of Biochar for Stormwater Treatment, Sci. Total Environ. , 2018, 625 , 1644–1658,  DOI: 10.1016/j.scitotenv.2018.01.037 .
  • R. A. Tirpak, A. N. Afrooz, R. J. Winston, R. Valenca, K. Schiff and S. K. Mohanty, Conventional and Amended Bioretention Soil Media for Targeted Pollutant Treatment: A Critical Review to Guide the State of the Practice, Water Res. , 2021, 189 , 116648,  DOI: 10.1016/j.watres.2020.116648 .
  • S. Spahr, M. Teixidó, S. S. Gall, J. C. Pritchard, N. Hagemann, B. Helmreich and R. G. Luthy, Performance of Biochars for the Elimination of Trace Organic Contaminants and Metals from Urban Stormwater, Environ. Sci.: Water Res. Technol. , 2022, 8 (6), 1287–1299,   10.1039/D1EW00857A .
  • B. A. Ulrich, M. Loehnert and C. P. Higgins, Improved Contaminant Removal in Vegetated Stormwater Biofilters Amended with Biochar, Environ. Sci.: Water Res. Technol. , 2017, 3 (4), 726–734,   10.1039/C7EW00070G .
  • C. Yuan, A. P. Davis, D. Kaya and B. V. Kjellerup, Distribution and Biodegradation Potential of Polycyclic Aromatic Hydrocarbons (PAHs) Accumulated in Media of a Stormwater Bioretention, Chemosphere , 2023, 336 , 139188,  DOI: 10.1016/j.chemosphere.2023.139188 .
  • A. Skorobogatov, J. He, A. Chu, C. Valeo and B. van Duin, The Impact of Media, Plants and Their Interactions on Bioretention Performance: A Review, Sci. Total Environ. , 2020, 715 , 136918,  DOI: 10.1016/j.scitotenv.2020.136918 .
  • H. S. Fowdar, B. E. Hatt, T. Cresswell, J. J. Harrison, P. L. M. Cook and A. Deletic, Phosphorus Fate and Dynamics in Greywater Biofiltration Systems, Environ. Sci. Technol. , 2017, 51 (4), 2280–2287,  DOI: 10.1021/acs.est.6b04181 .
  • G. H. LeFevre, C. E. Müller, R. J. Li, R. G. Luthy and E. S. Sattely, Rapid Phytotransformation of Benzotriazole Generates Synthetic Tryptophan and Auxin Analogs in Arabidopsis, Environ. Sci. Technol. , 2015, 49 (18), 10959–10968,  DOI: 10.1021/acs.est.5b02749 .
  • J. Pritchard, Y.-M. Cho, N. Ashoori, J. Wolfand, J. Sutton, M. Carolan, E. Gamez, K. Doan, J. Wiley and R. Luthy, Benzotriazole Uptake and Removal in Vegetated Biofilter Mesocosms Planted with Carex Praegracilis, Water , 2018, 10 (11), 1605,  DOI: 10.3390/w10111605 .
  • X. Gu, T. F. M. Rodgers, S. Spraakman, T. Van Seters, R. Flick, M. L. Diamond, J. Drake and E. Passeport, Trace Organic Contaminant Transfer and Transformation in Bioretention Cells: A Field Tracer Test with Benzotriazole, Environ. Sci. Technol. , 2021, 55 (18), 12281–12290,  DOI: 10.1021/acs.est.1c01062 .
  • A. C. Portmann, G. H. LeFevre, R. Hankawa, D. Werner and C. P. Higgins, The Regenerative Role of Biofilm in the Removal of Pesticides from Stormwater in Biochar-Amended Biofilters, Environ. Sci.: Water Res. Technol. , 2022, 8 (5), 1092–1110,   10.1039/D1EW00870F .
  • L. Tedersoo, M. Bahram and M. Zobel, How Mycorrhizal Associations Drive Plant Population and Community Biology, Science , 2020, 367 (6480)  DOI: 10.1126/science.aba1223 .
  • A. Berruti, E. Lumini, R. Balestrini and V. Bianciotto, Arbuscular Mycorrhizal Fungi as Natural Biofertilizers: Let's Benefit from Past Successes, Front. Microbiol. , 2016, 6 (JAN), 1559,  DOI: 10.3389/fmicb.2015.01559 .
  • B. K. Winfrey, B. E. Hatt and R. F. Ambrose, Arbuscular Mycorrhizal Fungi in Australian Stormwater Biofilters, Ecol. Eng. , 2017, 102 , 483–489,  DOI: 10.1016/j.ecoleng.2017.02.041 .
  • C. Poor, C. Balmes, M. Freudenthaler and A. Martinez, Role of Mycelium in Bioretention Systems: Evaluation of Nutrient and Metal Retention in Mycorrhizae-Inoculated Mesocosms, J. Environ. Eng. , 2018, 144 (6), 04018034,  DOI: 10.1061/(ASCE)EE.1943-7870.0001373 .
  • Y. M. Palacios, R. Gleadow, C. Davidson, W. Gan and B. Winfrey, Do Mycorrhizae Increase Plant Growth and Pollutant Removal in Stormwater Biofilters?, Water Res. , 2021, 202 , 117381,  DOI: 10.1016/j.watres.2021.117381 .
  • Y. M. Palacios and B. K. Winfrey, Three Mechanisms of Mycorrhizae That May Improve Stormwater Biofilter Performance, Ecol. Eng. , 2021, 159 , 106085,  DOI: 10.1016/j.ecoleng.2020.106085 .
  • I. Kamei, S. Sonoki, K. Haraguchi and R. Kondo, Fungal Bioconversion of Toxic Polychlorinated Biphenyls by White-Rot Fungus, Phlebia Brevispora, Appl. Microbiol. Biotechnol. , 2006, 73 (4), 932–940,  DOI: 10.1007/s00253-006-0529-9 .
  • Č. Novotný, B. R. M. Vyas, P. Erbanová, A. Kubátová and V. Šašek, Removal of PCBs by Various White Rot Fungi in Liquid Cultures, Folia Microbiol. , 1997, 42 (2), 136–140,  DOI: 10.1007/BF02898723 .
  • N. Golan-Rozen, B. Chefetz, J. Ben-Ari, J. Geva and Y. Hadar, Transformation of the Recalcitrant Pharmaceutical Compound Carbamazepine by Pleurotus Ostreatus: Role of Cytochrome P450 Monooxygenase and Manganese Peroxidase, Environ. Sci. Technol. , 2011, 45 (16), 6800–6805,  DOI: 10.1021/es200298t .
  • N. Golan-Rozen, B. Seiwert, C. Riemenschneider, T. Reemtsma, B. Chefetz and Y. Hadar, Transformation Pathways of the Recalcitrant Pharmaceutical Compound Carbamazepine by the White-Rot Fungus Pleurotus Ostreatus: Effects of Growth Conditions, Environ. Sci. Technol. , 2015, 49 (20), 12351–12362,  DOI: 10.1021/acs.est.5b02222 .
  • Y.-C. Toh, J. J. L. Yen, J. P. Obbard and Y.-P. Ting, Decolourisation of Azo Dyes by White-Rot Fungi (WRF) Isolated in Singapore, Enzyme Microb. Technol. , 2003, 33 (5), 569–575,  DOI: 10.1016/S0141-0229(03)00177-7 .
  • A. S. Gill, K. Purnell, M. I. Palmer, J. Stein and K. L. McGuire, Microbial Composition and Functional Diversity Differ Across Urban Green Infrastructure Types, Front. Microbiol. , 2020, 11 , 912,  DOI: 10.3389/fmicb.2020.00912 .
  • O. L. Brodsky, K. L. Shek, D. Dinwiddie, S. G. Bruner, A. S. Gill, J. M. Hoch, M. I. Palmer and K. L. McGuire, Microbial Communities in Bioswale Soils and Their Relationships to Soil Properties, Plant Species, and Plant Physiology, Front. Microbiol. , 2019, 10 , 2368,  DOI: 10.3389/fmicb.2019.02368 .
  • L. J. Waller, G. K. Evanylo, L.-A. H. Krometis, M. S. Strickland, T. Wynn-Thompson and B. D. Badgley, Engineered and Environmental Controls of Microbial Denitrification in Established Bioretention Cells, Environ. Sci. Technol. , 2018, 52 (9), 5358–5366,  DOI: 10.1021/acs.est.7b06704 .
  • X. Chen, E. Peltier, B. S. M. M. Sturm and C. B. Young, Nitrogen Removal and Nitrifying and Denitrifying Bacteria Quantification in a Stormwater Bioretention System, Water Res. , 2013, 47 (4), 1691–1700,  DOI: 10.1016/j.watres.2012.12.033 .
  • A. S. Gill, A. Lee and K. L. McGuire, Phylogenetic and Functional Diversity of Total (DNA) and Expressed (RNA) Bacterial Communities in Urban Green Infrastructure Bioswale Soils, Appl. Environ. Microbiol. , 2017, 83 (16)  DOI: 10.1128/AEM.00287-17 .
  • G. H. LeFevre, R. M. Hozalski and P. J. Novak, The Role of Biodegradation in Limiting the Accumulation of Petroleum Hydrocarbons in Raingarden Soils, Water Res. , 2012, 46 (20), 6753–6762,  DOI: 10.1016/j.watres.2011.12.040 .
  • G. H. LeFevre, P. J. Novak and R. M. Hozalski, Fate of Naphthalene in Laboratory-Scale Bioretention Cells: Implications for Sustainable Stormwater Management, Environ. Sci. Technol. , 2012, 46 (2), 995–1002,  DOI: 10.1021/es202266z .
  • K. Ihrmark, I. T. M. Bödeker, K. Cruz-Martinez, H. Friberg, A. Kubartova, J. Schenck, Y. Strid, J. Stenlid, M. Brandström-Durling, K. E. Clemmensen and B. D. Lindahl, New Primers to Amplify the Fungal ITS2 Region - Evaluation by 454-Sequencing of Artificial and Natural Communities, FEMS Microbiol. Ecol. , 2012, 82 (3), 666–677,  DOI: 10.1111/j.1574-6941.2012.01437.x .
  • T. J. White, T. Bruns, S. Lee and J. Taylor, Amplification and Direct Sequencing of Fungal Ribosomal RNA Genes for Phylogenetics, in PCR Protocols , Elsevier, 1990, pp. 315–322  Search PubMed .
  • S. Anslan, M. Bahram, I. Hiiesalu and L. Tedersoo, PipeCraft: Flexible Open-Source Toolkit for Bioinformatics Analysis of Custom High-Throughput Amplicon Sequencing Data, Mol. Ecol. Resour. , 2017, 17 (6), e234–e240,  DOI: 10.1111/1755-0998.12692 .
  • L. Tedersoo, M. Bahram, L. Zinger, R. H. Nilsson, P. G. Kennedy, T. Yang, S. Anslan and V. Mikryukov, Best Practices in Metabarcoding of Fungi: From Experimental Design to Results, Mol. Ecol. , 2022, 31 (10), 2769–2795,  DOI: 10.1111/mec.16460 .
  • L. Joos, S. Beirinckx, A. Haegeman, J. Debode, B. Vandecasteele, S. Baeyen, S. Goormachtig, L. Clement and C. De Tender, Daring to Be Differential: Metabarcoding Analysis of Soil and Plant-Related Microbial Communities Using Amplicon Sequence Variants and Operational Taxonomical Units, BMC Genomics , 2020, 21 (1), 733,  DOI: 10.1186/s12864-020-07126-4 .
  • T. Rognes, T. Flouri, B. Nichols, C. Quince and F. Mahé, VSEARCH: A Versatile Open Source Tool for Metagenomics, PeerJ , 2016, 4 (10), e2584,  DOI: 10.7717/peerj.2584 .
  • J. Bengtsson-Palme, M. Ryberg, M. Hartmann, S. Branco, Z. Wang, A. Godhe, P. De Wit, M. Sánchez-García, I. Ebersberger, F. de Sousa, A. S. Amend, A. Jumpponen, M. Unterseher, E. Kristiansson, K. Abarenkov, Y. J. K. Bertrand, K. Sanli, K. M. Eriksson, U. Vik, V. Veldre and R. H. Nilsson, Improved Software Detection and Extraction of ITS1 and ITS2 from Ribosomal ITS Sequences of Fungi and Other Eukaryotes for Analysis of Environmental Sequencing Data, Methods Ecol. Evol. , 2013, 4 (10), 914–919,  DOI: 10.1111/2041-210X.12073 .
  • T. G. Frøslev, R. Kjøller, H. H. Bruun, R. Ejrnæs, A. K. Brunbjerg, C. Pietroni and A. J. Hansen, Algorithm for Post-Clustering Curation of DNA Amplicon Data Yields Reliable Biodiversity Estimates, Nat. Commun. , 2017, 8 (1), 1188,  DOI: 10.1038/s41467-017-01312-x .
  • R. C. Edgar, UNCROSS2: Identification of Cross-Talk in 16S RRNA OTU Tables, bioRxiv , 2018, p. 400762,  DOI: 10.1101/400762 .
  • K. Abarenkov, A. Zirk, T. Piirmann, R. Pöhönen, F. Ivanov, R. H. Nilsson and U. Kõljalg, UNITE General FASTA Release for Fungi , UNITE community, 2023,  DOI: 10.15156/BIO/2938067 .
  • L. Tedersoo, S. Sánchez-Ramírez, U. Kõljalg, M. Bahram, M. Döring, D. Schigel, T. May, M. Ryberg and K. Abarenkov, High-Level Classification of the Fungi and a Tool for Evolutionary Ecological Analyses, Fungal Diversity , 2018, 90 (1), 135–159,  DOI: 10.1007/s13225-018-0401-0 .
  • C. Martino, J. T. Morton, C. A. Marotz, L. R. Thompson, A. Tripathi, R. Knight and K. Zengler, A Novel Sparse Compositional Technique Reveals Microbial Perturbations, mSystems , 2019, 4 (1), e00016–e00019,  DOI: 10.1128/mSystems.00016-19 .
  • M. J. Anderson, Permutational Multivariate Analysis of Variance (PERMANOVA), in Wiley StatsRef: Statistics Reference Online , Wiley, 2017, pp. 1–15  Search PubMed .
  • J. Oksanen, G. L. Simpson, F. G. Blanchet, R. Kindt, P. Legendre, P. R. Minchin, R. B. O'Hara, P. Solymos, M. H. H. Stevens, E. Szoecs, H. Wagner, M. Barbour, M. Bedward, B. Bolker, D. Borcard, G. Carvalho, M. Chirico, M. De Caceres, S. Durand, H. B. A. Evangelista, R. FitzJohn, M. Friendly, B. Furneaux, G. Hannigan, M. O. Hill, L. Lahti, D. McGlinn, M.-H. Ouellette, E. Ribeiro Cunha, T. Smith, A. Stier, C. J. F. Ter Braak and J. Weedon, Vegan: Community Ecology Package , 2022  Search PubMed .
  • P. Luis, G. Walther, H. Kellner, F. Martin and F. Buscot, Diversity of Laccase Genes from Basidiomycetes in a Forest Soil, Soil Biol. Biochem. , 2004, 36 (7), 1025–1036,  DOI: 10.1016/j.soilbio.2004.02.017 .
  • H. Kellner, P. Luis and F. Buscot, Diversity of Laccase-like Multicopper Oxidase Genes in Morchellaceae: Identification of Genes Potentially Involved in Extracellular Activities Related to Plant Litter Decay, FEMS Microbiol. Ecol. , 2007, 61 (1), 153–163,  DOI: 10.1111/j.1574-6941.2007.00322.x .
  • W. Wei, K. Isobe, Y. Shiratori, T. Nishizawa, N. Ohte, Y. Ise, S. Otsuka and K. Senoo, Development of PCR Primers Targeting Fungal NirK to Study Fungal Denitrification in the Environment, Soil Biol. Biochem. , 2015, 81 , 282–286,  DOI: 10.1016/j.soilbio.2014.11.026 .
  • M. Delgado-Baquerizo, D. J. Eldridge, Y.-R. Liu, B. Sokoya, J.-T. Wang, H.-W. Hu, J.-Z. He, F. Bastida, J. L. Moreno, A. R. Bamigboye, J. L. Blanco-Pastor, C. Cano-Díaz, J. G. Illán, T. P. Makhalanyane, C. Siebe, P. Trivedi, E. Zaady, J. P. Verma, L. Wang, J. Wang, T. Grebenc, G. F. Peñaloza-Bojacá, T. U. Nahberger, A. L. Teixido, X.-Q. Zhou, M. Berdugo, J. Duran, A. Rodríguez, X. Zhou, F. Alfaro, S. Abades, C. Plaza, A. Rey, B. K. Singh, L. Tedersoo and N. Fierer, Global Homogenization of the Structure and Function in the Soil Microbiome of Urban Greenspaces, Sci. Adv. , 2021, 7 (28), eabg5809,  DOI: 10.1126/sciadv.abg5809 .
  • R. Riley, A. A. Salamov, D. W. Brown, L. G. Nagy, D. Floudas, B. W. Held, A. Levasseur, V. Lombard, E. Morin, R. Otillar, E. A. Lindquist, H. Sun, K. M. LaButti, J. Schmutz, D. Jabbour, H. Luo, S. E. Baker, A. G. Pisabarro, J. D. Walton, R. A. Blanchette, B. Henrissat, F. Martin, D. Cullen, D. S. Hibbett and I. V. Grigoriev, Extensive Sampling of Basidiomycete Genomes Demonstrates Inadequacy of the White-Rot/Brown-Rot Paradigm for Wood Decay Fungi, Proc. Natl. Acad. Sci. U. S. A. , 2014, 111 (27), 9923–9928,  DOI: 10.1073/pnas.1400592111 .
  • E. A. Wiener and G. H. LeFevre, White Rot Fungi Produce Novel Tire Wear Compound Metabolites and Reveal Underappreciated Amino Acid Conjugation Pathways, Environ. Sci. Technol. Lett. , 2022, 9 (5), 391–399,  DOI: 10.1021/acs.estlett.2c00114 .
  • P. M. Letcher and M. J. Powell, A Taxonomic Summary of Aphelidiaceae, IMA Fungus , 2019, 10 (1), 4,  DOI: 10.1186/s43008-019-0005-7 .
  • J. Li and C. Yang, Inconsistent Response of Soil Bacterial and Fungal Communities in Aggregates to Litter Decomposition during Short-Term Incubation, PeerJ , 2019, 2019 (11), e8078,  DOI: 10.7717/peerj.8078 .
  • A. H. Eom, D. C. Hartnett and G. W. T. Wilson, Host Plant Species Effects on Arbuscular Mycorrhizal Fungal Communities in Tallgrass Prairie, Oecologia , 2000, 122 (3), 435–444,  DOI: 10.1007/s004420050050 .
  • H. Xu, R. Sheng, X. Xing, W. Zhang, H. Hou, Y. Liu, H. Qin, C. Chen and W. Wei, Characterization of Fungal NirK-Containing Communities and N2O Emission From Fungal Denitrification in Arable Soils, Front. Microbiol. , 2019, 10 (FEB), 117,  DOI: 10.3389/fmicb.2019.00117 .
  • Y. Lin, G. Ye, J. Luo, H. J. Di, S. Lindsey, J. Fan, D. Liu and W. Ding, Long-Term Organic Fertilization Regulates the Abundance of Major Nitrogen-Cycling-Related Genes in Aggregates from an Acidic Ultisol, Appl. Soil Ecol. , 2021, 165 , 104014,  DOI: 10.1016/j.apsoil.2021.104014 .
  • Y. Ma, J. L. Zilles and A. D. Kent, An Evaluation of Primers for Detecting Denitrifiers via Their Functional Genes, Environ. Microbiol. , 2019, 21 (4), 1196–1210,  DOI: 10.1111/1462-2920.14555 .
  • M. del M. Montiel-Rozas, Á. López-García, P. Madejón and E. Madejón, Native Soil Organic Matter as a Decisive Factor to Determine the Arbuscular Mycorrhizal Fungal Community Structure in Contaminated Soils, Biol. Fertil. Soils , 2017, 53 (3), 327–338,  DOI: 10.1007/s00374-017-1181-5 .
  • J. P. van Leeuwen, I. Djukic, J. Bloem, T. Lehtinen, L. Hemerik, P. C. de Ruiter and G. J. Lair, Effects of Land Use on Soil Microbial Biomass, Activity and Community Structure at Different Soil Depths in the Danube Floodplain, Eur. J. Soil Biol. , 2017, 79 , 14–20,  DOI: 10.1016/j.ejsobi.2017.02.001 .
  • F. Ekelund, R. Rønn and S. Christensen, Distribution with Depth of Protozoa, Bacteria and Fungi in Soil Profiles from Three Danish Forest Sites, Soil Biol. Biochem. , 2001, 33 (4), 475–481,  DOI: 10.1016/S0038-0717(00)00188-7 .
  • X. Luo, K. Liu, Y. Shen, G. Yao, W. Yang, P. E. Mortimer and H. Gui, Fungal Community Composition and Diversity Vary With Soil Horizons in a Subtropical Forest, Front. Microbiol. , 2021, 12  DOI: 10.3389/fmicb.2021.650440 .
  • C. Du, Z. Geng, Q. Wang, T. Zhang, W. He, L. Hou and Y. Wang, Variations in Bacterial and Fungal Communities through Soil Depth Profiles in a Betula Albosinensis Forest, J. Microbiol. , 2017, 55 (9), 684–693,  DOI: 10.1007/s12275-017-6466-8 .
  • T. Yang, J. M. Adams, Y. Shi, H. Sun, L. Cheng, Y. Zhang and H. Chu, Fungal Community Assemblages in a High Elevation Desert Environment: Absence of Dispersal Limitation and Edaphic Effects in Surface Soil, Soil Biol. Biochem. , 2017, 115 , 393–402,  DOI: 10.1016/j.soilbio.2017.09.013 .
  • Y. Zhang, S. Dai, X. Huang, Y. Zhao, J. Zhao, Y. Cheng, Z. Cai and J. Zhang, PH-Induced Changes in Fungal Abundance and Composition Affects Soil Heterotrophic Nitrification after 30 Days of Artificial PH Manipulation, Geoderma , 2020, 366 , 114255,  DOI: 10.1016/j.geoderma.2020.114255 .
  • P. Carini, P. J. Marsden, J. W. Leff, E. E. Morgan, M. S. Strickland and N. Fierer, Relic DNA Is Abundant in Soil and Obscures Estimates of Soil Microbial Diversity, Nat. Microbiol. , 2017, 2 (3), 16242,  DOI: 10.1038/nmicrobiol.2016.242 .
  • M. Deeb, P. M. Groffman, J. L. Joyner, G. Lozefski, A. Paltseva, B. Lin, K. Mania, D. L. Cao, J. McLaughlin, T. Muth, B. Prithiviraj, J. Kerwin and Z. Cheng, Soil and Microbial Properties of Green Infrastructure Stormwater Management Systems, Ecol. Eng. , 2018, 125 , 68–75,  DOI: 10.1016/j.ecoleng.2018.10.017 .
  • J. L. Joyner, J. Kerwin, M. Deeb, G. Lozefski, B. Prithiviraj, A. Paltseva, J. McLaughlin, P. Groffman, Z. Cheng and T. R. Muth, Green Infrastructure Design Influences Communities of Urban Soil Bacteria, Front. Microbiol. , 2019  DOI: 10.3389/fmicb.2019.00982 .
  • A. Christel, S. Dequiedt, N. Chemidlin-Prevost-Bouré, F. Mercier, J. Tripied, G. Comment, C. Djemiel, L. Bargeot, E. Matagne, A. Fougeron, J.-B. Mina Passi, L. Ranjard and P.-A. Maron, Urban Land Uses Shape Soil Microbial Abundance and Diversity, Sci. Total Environ. , 2023, 883 , 163455,  DOI: 10.1016/j.scitotenv.2023.163455 .
  • V. Buzzard, J. Gil-Loaiza, N. Graf Grachet, H. Talkington, C. Youngerman, M. M. Tfaily and L. K. Meredith, Green Infrastructure Influences Soil Health: Biological Divergence One Year after Installation, Sci. Total Environ. , 2021, 801 , 149644,  DOI: 10.1016/j.scitotenv.2021.149644 .
  • A. G. Droz, R. R. Coffman, A. C. Eagar and C. B. Blackwood, Drivers of Fungal Diversity and Community Biogeography Differ between Green Roofs and Adjacent Ground-Level Green Space, Environ. Microbiol. , 2022, 24 (12), 5809–5824,  DOI: 10.1111/1462-2920.16190 .
  • C. P. Muerdter, C. K. Wong and G. H. LeFevre, Emerging Investigator Series: The Role of Vegetation in Bioretention for Stormwater Treatment in the Built Environment: Pollutant Removal, Hydrologic Function, and Ancillary Benefits, Environ. Sci.: Water Res. Technol. , 2018, 4 (5), 592–612,   10.1039/c7ew00511c .
  • J. M. Wolfand, C. Seller, C. D. Bell, Y.-M. Cho, K. Oetjen, T. S. Hogue and R. G. Luthy, Occurrence of Urban-Use Pesticides and Management with Enhanced Stormwater Control Measures at the Watershed Scale, Environ. Sci. Technol. , 2019, 53 (7), 3634–3644,  DOI: 10.1021/acs.est.8b05833 .
  • J. M. Wolfand, G. H. LeFevre and R. G. Luthy, Metabolization and Degradation Kinetics of the Urban-Use Pesticide Fipronil by White Rot Fungus Trametes Versicolor, Environ. Sci.: Processes Impacts , 2016, 18 (10), 1256–1265,   10.1039/C6EM00344C .
  • L. M. M. Weaver, N. L. Alexander, M. A. Cubeta, D. R. U. Knappe and T. N. Aziz, Degradation of Imidacloprid by Phanerodontia Chrysosporium on Wood Chips for Stormwater Treatment, Environ. Sci.: Water Res. Technol. , 2023, 9 (12), 3333–3343,   10.1039/D3EW00545C .
  • D. S. Tanmoy and G. H. LeFevre, Development of Composite Alginate Bead Media with Encapsulated Sorptive Materials and Microorganisms to Bioaugment Green Stormwater Infrastructure, Environ. Sci.: Water Res. Technol. , 2024, 10 (8), 1890–1907,   10.1039/D4EW00289J .
  • Q. Yao, J. Liu, Z. Yu, Y. Li, J. Jin, X. Liu and G. Wang, Three Years of Biochar Amendment Alters Soil Physiochemical Properties and Fungal Community Composition in a Black Soil of Northeast China, Soil Biol. Biochem. , 2017, 110 , 56–67,  DOI: 10.1016/j.soilbio.2017.03.005 .
  • B. J. Halaburka, G. H. LeFevre and R. G. Luthy, Quantifying the Temperature Dependence of Nitrate Reduction in Woodchip Bioreactors: Experimental and Modeled Results with Applied Case-Study, Environ. Sci.: Water Res. Technol. , 2019, 5 (4), 782–797,   10.1039/C8EW00848E .
  • B. J. Halaburka, G. H. LeFevre and R. G. Luthy, Evaluation of Mechanistic Models for Nitrate Removal in Woodchip Bioreactors, Environ. Sci. Technol. , 2017, 51 (9), 5156–5164,  DOI: 10.1021/acs.est.7b01025 .
  • J. Li, T. B. Culver, P. P. Persaud and J. M. Hathaway, Developing Nitrogen Removal Models for Stormwater Bioretention Systems, Water Res. , 2023, 120381,  DOI: 10.1016/j.watres.2023.120381 .
  • E. V. Lopez-Ponnada, T. J. Lynn, S. J. Ergas and J. R. Mihelcic, Long-Term Field Performance of a Conventional and Modified Bioretention System for Removing Dissolved Nitrogen Species in Stormwater Runoff, Water Res. , 2020, 170 , 115336,  DOI: 10.1016/j.watres.2019.115336 .
Electronic supplementary information (ESI) available. See DOI:
  • Search Menu

Sign in through your institution

  • Browse content in Arts and Humanities
  • Browse content in Archaeology
  • Anglo-Saxon and Medieval Archaeology
  • Archaeological Methodology and Techniques
  • Archaeology by Region
  • Archaeology of Religion
  • Archaeology of Trade and Exchange
  • Biblical Archaeology
  • Contemporary and Public Archaeology
  • Environmental Archaeology
  • Historical Archaeology
  • History and Theory of Archaeology
  • Industrial Archaeology
  • Landscape Archaeology
  • Mortuary Archaeology
  • Prehistoric Archaeology
  • Underwater Archaeology
  • Zooarchaeology
  • Browse content in Architecture
  • Architectural Structure and Design
  • History of Architecture
  • Residential and Domestic Buildings
  • Theory of Architecture
  • Browse content in Art
  • Art Subjects and Themes
  • History of Art
  • Industrial and Commercial Art
  • Theory of Art
  • Biographical Studies
  • Byzantine Studies
  • Browse content in Classical Studies
  • Classical History
  • Classical Philosophy
  • Classical Mythology
  • Classical Numismatics
  • Classical Literature
  • Classical Reception
  • Classical Art and Architecture
  • Classical Oratory and Rhetoric
  • Greek and Roman Papyrology
  • Greek and Roman Epigraphy
  • Greek and Roman Law
  • Greek and Roman Archaeology
  • Late Antiquity
  • Religion in the Ancient World
  • Social History
  • Digital Humanities
  • Browse content in History
  • Colonialism and Imperialism
  • Diplomatic History
  • Environmental History
  • Genealogy, Heraldry, Names, and Honours
  • Genocide and Ethnic Cleansing
  • Historical Geography
  • History by Period
  • History of Emotions
  • History of Agriculture
  • History of Education
  • History of Gender and Sexuality
  • Industrial History
  • Intellectual History
  • International History
  • Labour History
  • Legal and Constitutional History
  • Local and Family History
  • Maritime History
  • Military History
  • National Liberation and Post-Colonialism
  • Oral History
  • Political History
  • Public History
  • Regional and National History
  • Revolutions and Rebellions
  • Slavery and Abolition of Slavery
  • Social and Cultural History
  • Theory, Methods, and Historiography
  • Urban History
  • World History
  • Browse content in Language Teaching and Learning
  • Language Learning (Specific Skills)
  • Language Teaching Theory and Methods
  • Browse content in Linguistics
  • Applied Linguistics
  • Cognitive Linguistics
  • Computational Linguistics
  • Forensic Linguistics
  • Grammar, Syntax and Morphology
  • Historical and Diachronic Linguistics
  • History of English
  • Language Evolution
  • Language Reference
  • Language Acquisition
  • Language Variation
  • Language Families
  • Lexicography
  • Linguistic Anthropology
  • Linguistic Theories
  • Linguistic Typology
  • Phonetics and Phonology
  • Psycholinguistics
  • Sociolinguistics
  • Translation and Interpretation
  • Writing Systems
  • Browse content in Literature
  • Bibliography
  • Children's Literature Studies
  • Literary Studies (American)
  • Literary Studies (Romanticism)
  • Literary Studies (European)
  • Literary Studies (Eco-criticism)
  • Literary Studies (Asian)
  • Literary Studies (Modernism)
  • Literary Studies - World
  • Literary Studies (1500 to 1800)
  • Literary Studies (19th Century)
  • Literary Studies (20th Century onwards)
  • Literary Studies (African American Literature)
  • Literary Studies (British and Irish)
  • Literary Studies (Early and Medieval)
  • Literary Studies (Fiction, Novelists, and Prose Writers)
  • Literary Studies (Gender Studies)
  • Literary Studies (Graphic Novels)
  • Literary Studies (History of the Book)
  • Literary Studies (Plays and Playwrights)
  • Literary Studies (Poetry and Poets)
  • Literary Studies (Postcolonial Literature)
  • Literary Studies (Queer Studies)
  • Literary Studies (Science Fiction)
  • Literary Studies (Travel Literature)
  • Literary Studies (War Literature)
  • Literary Studies (Women's Writing)
  • Literary Theory and Cultural Studies
  • Mythology and Folklore
  • Shakespeare Studies and Criticism
  • Browse content in Media Studies
  • Browse content in Music
  • Applied Music
  • Dance and Music
  • Ethics in Music
  • Ethnomusicology
  • Gender and Sexuality in Music
  • Medicine and Music
  • Music Cultures
  • Music and Media
  • Music and Religion
  • Music and Culture
  • Music Education and Pedagogy
  • Music Theory and Analysis
  • Musical Scores, Lyrics, and Libretti
  • Musical Structures, Styles, and Techniques
  • Musicology and Music History
  • Performance Practice and Studies
  • Race and Ethnicity in Music
  • Sound Studies
  • Browse content in Performing Arts
  • Browse content in Philosophy
  • Aesthetics and Philosophy of Art
  • Epistemology
  • Feminist Philosophy
  • History of Western Philosophy
  • Metaphysics
  • Moral Philosophy
  • Non-Western Philosophy
  • Philosophy of Language
  • Philosophy of Mind
  • Philosophy of Perception
  • Philosophy of Science
  • Philosophy of Action
  • Philosophy of Law
  • Philosophy of Religion
  • Philosophy of Mathematics and Logic
  • Practical Ethics
  • Social and Political Philosophy
  • Browse content in Religion
  • Biblical Studies
  • Christianity
  • East Asian Religions
  • History of Religion
  • Judaism and Jewish Studies
  • Qumran Studies
  • Religion and Education
  • Religion and Health
  • Religion and Politics
  • Religion and Science
  • Religion and Law
  • Religion and Art, Literature, and Music
  • Religious Studies
  • Browse content in Society and Culture
  • Cookery, Food, and Drink
  • Cultural Studies
  • Customs and Traditions
  • Ethical Issues and Debates
  • Hobbies, Games, Arts and Crafts
  • Natural world, Country Life, and Pets
  • Popular Beliefs and Controversial Knowledge
  • Sports and Outdoor Recreation
  • Technology and Society
  • Travel and Holiday
  • Visual Culture
  • Browse content in Law
  • Arbitration
  • Browse content in Company and Commercial Law
  • Commercial Law
  • Company Law
  • Browse content in Comparative Law
  • Systems of Law
  • Competition Law
  • Browse content in Constitutional and Administrative Law
  • Government Powers
  • Judicial Review
  • Local Government Law
  • Military and Defence Law
  • Parliamentary and Legislative Practice
  • Construction Law
  • Contract Law
  • Browse content in Criminal Law
  • Criminal Procedure
  • Criminal Evidence Law
  • Sentencing and Punishment
  • Employment and Labour Law
  • Environment and Energy Law
  • Browse content in Financial Law
  • Banking Law
  • Insolvency Law
  • History of Law
  • Human Rights and Immigration
  • Intellectual Property Law
  • Browse content in International Law
  • Private International Law and Conflict of Laws
  • Public International Law
  • IT and Communications Law
  • Jurisprudence and Philosophy of Law
  • Law and Politics
  • Law and Society
  • Browse content in Legal System and Practice
  • Courts and Procedure
  • Legal Skills and Practice
  • Legal System - Costs and Funding
  • Primary Sources of Law
  • Regulation of Legal Profession
  • Medical and Healthcare Law
  • Browse content in Policing
  • Criminal Investigation and Detection
  • Police and Security Services
  • Police Procedure and Law
  • Police Regional Planning
  • Browse content in Property Law
  • Personal Property Law
  • Restitution
  • Study and Revision
  • Terrorism and National Security Law
  • Browse content in Trusts Law
  • Wills and Probate or Succession
  • Browse content in Medicine and Health
  • Browse content in Allied Health Professions
  • Arts Therapies
  • Clinical Science
  • Dietetics and Nutrition
  • Occupational Therapy
  • Operating Department Practice
  • Physiotherapy
  • Radiography
  • Speech and Language Therapy
  • Browse content in Anaesthetics
  • General Anaesthesia
  • Clinical Neuroscience
  • Browse content in Clinical Medicine
  • Acute Medicine
  • Cardiovascular Medicine
  • Clinical Genetics
  • Clinical Pharmacology and Therapeutics
  • Dermatology
  • Endocrinology and Diabetes
  • Gastroenterology
  • Genito-urinary Medicine
  • Geriatric Medicine
  • Infectious Diseases
  • Medical Toxicology
  • Medical Oncology
  • Pain Medicine
  • Palliative Medicine
  • Rehabilitation Medicine
  • Respiratory Medicine and Pulmonology
  • Rheumatology
  • Sleep Medicine
  • Sports and Exercise Medicine
  • Community Medical Services
  • Critical Care
  • Emergency Medicine
  • Forensic Medicine
  • Haematology
  • History of Medicine
  • Browse content in Medical Skills
  • Clinical Skills
  • Communication Skills
  • Nursing Skills
  • Surgical Skills
  • Browse content in Medical Dentistry
  • Oral and Maxillofacial Surgery
  • Paediatric Dentistry
  • Restorative Dentistry and Orthodontics
  • Surgical Dentistry
  • Medical Ethics
  • Medical Statistics and Methodology
  • Browse content in Neurology
  • Clinical Neurophysiology
  • Neuropathology
  • Nursing Studies
  • Browse content in Obstetrics and Gynaecology
  • Gynaecology
  • Occupational Medicine
  • Ophthalmology
  • Otolaryngology (ENT)
  • Browse content in Paediatrics
  • Neonatology
  • Browse content in Pathology
  • Chemical Pathology
  • Clinical Cytogenetics and Molecular Genetics
  • Histopathology
  • Medical Microbiology and Virology
  • Patient Education and Information
  • Browse content in Pharmacology
  • Psychopharmacology
  • Browse content in Popular Health
  • Caring for Others
  • Complementary and Alternative Medicine
  • Self-help and Personal Development
  • Browse content in Preclinical Medicine
  • Cell Biology
  • Molecular Biology and Genetics
  • Reproduction, Growth and Development
  • Primary Care
  • Professional Development in Medicine
  • Browse content in Psychiatry
  • Addiction Medicine
  • Child and Adolescent Psychiatry
  • Forensic Psychiatry
  • Learning Disabilities
  • Old Age Psychiatry
  • Psychotherapy
  • Browse content in Public Health and Epidemiology
  • Epidemiology
  • Public Health
  • Browse content in Radiology
  • Clinical Radiology
  • Interventional Radiology
  • Nuclear Medicine
  • Radiation Oncology
  • Reproductive Medicine
  • Browse content in Surgery
  • Cardiothoracic Surgery
  • Gastro-intestinal and Colorectal Surgery
  • General Surgery
  • Neurosurgery
  • Paediatric Surgery
  • Peri-operative Care
  • Plastic and Reconstructive Surgery
  • Surgical Oncology
  • Transplant Surgery
  • Trauma and Orthopaedic Surgery
  • Vascular Surgery
  • Browse content in Science and Mathematics
  • Browse content in Biological Sciences
  • Aquatic Biology
  • Biochemistry
  • Bioinformatics and Computational Biology
  • Developmental Biology
  • Ecology and Conservation
  • Evolutionary Biology
  • Genetics and Genomics
  • Microbiology
  • Molecular and Cell Biology
  • Natural History
  • Plant Sciences and Forestry
  • Research Methods in Life Sciences
  • Structural Biology
  • Systems Biology
  • Zoology and Animal Sciences
  • Browse content in Chemistry
  • Analytical Chemistry
  • Computational Chemistry
  • Crystallography
  • Environmental Chemistry
  • Industrial Chemistry
  • Inorganic Chemistry
  • Materials Chemistry
  • Medicinal Chemistry
  • Mineralogy and Gems
  • Organic Chemistry
  • Physical Chemistry
  • Polymer Chemistry
  • Study and Communication Skills in Chemistry
  • Theoretical Chemistry
  • Browse content in Computer Science
  • Artificial Intelligence
  • Computer Architecture and Logic Design
  • Game Studies
  • Human-Computer Interaction
  • Mathematical Theory of Computation
  • Programming Languages
  • Software Engineering
  • Systems Analysis and Design
  • Virtual Reality
  • Browse content in Computing
  • Business Applications
  • Computer Security
  • Computer Games
  • Computer Networking and Communications
  • Digital Lifestyle
  • Graphical and Digital Media Applications
  • Operating Systems
  • Browse content in Earth Sciences and Geography
  • Atmospheric Sciences
  • Environmental Geography
  • Geology and the Lithosphere
  • Maps and Map-making
  • Meteorology and Climatology
  • Oceanography and Hydrology
  • Palaeontology
  • Physical Geography and Topography
  • Regional Geography
  • Soil Science
  • Urban Geography
  • Browse content in Engineering and Technology
  • Agriculture and Farming
  • Biological Engineering
  • Civil Engineering, Surveying, and Building
  • Electronics and Communications Engineering
  • Energy Technology
  • Engineering (General)
  • Environmental Science, Engineering, and Technology
  • History of Engineering and Technology
  • Mechanical Engineering and Materials
  • Technology of Industrial Chemistry
  • Transport Technology and Trades
  • Browse content in Environmental Science
  • Applied Ecology (Environmental Science)
  • Conservation of the Environment (Environmental Science)
  • Environmental Sustainability
  • Environmentalist Thought and Ideology (Environmental Science)
  • Management of Land and Natural Resources (Environmental Science)
  • Natural Disasters (Environmental Science)
  • Nuclear Issues (Environmental Science)
  • Pollution and Threats to the Environment (Environmental Science)
  • Social Impact of Environmental Issues (Environmental Science)
  • History of Science and Technology
  • Browse content in Materials Science
  • Ceramics and Glasses
  • Composite Materials
  • Metals, Alloying, and Corrosion
  • Nanotechnology
  • Browse content in Mathematics
  • Applied Mathematics
  • Biomathematics and Statistics
  • History of Mathematics
  • Mathematical Education
  • Mathematical Finance
  • Mathematical Analysis
  • Numerical and Computational Mathematics
  • Probability and Statistics
  • Pure Mathematics
  • Browse content in Neuroscience
  • Cognition and Behavioural Neuroscience
  • Development of the Nervous System
  • Disorders of the Nervous System
  • History of Neuroscience
  • Invertebrate Neurobiology
  • Molecular and Cellular Systems
  • Neuroendocrinology and Autonomic Nervous System
  • Neuroscientific Techniques
  • Sensory and Motor Systems
  • Browse content in Physics
  • Astronomy and Astrophysics
  • Atomic, Molecular, and Optical Physics
  • Biological and Medical Physics
  • Classical Mechanics
  • Computational Physics
  • Condensed Matter Physics
  • Electromagnetism, Optics, and Acoustics
  • History of Physics
  • Mathematical and Statistical Physics
  • Measurement Science
  • Nuclear Physics
  • Particles and Fields
  • Plasma Physics
  • Quantum Physics
  • Relativity and Gravitation
  • Semiconductor and Mesoscopic Physics
  • Browse content in Psychology
  • Affective Sciences
  • Clinical Psychology
  • Cognitive Psychology
  • Cognitive Neuroscience
  • Criminal and Forensic Psychology
  • Developmental Psychology
  • Educational Psychology
  • Evolutionary Psychology
  • Health Psychology
  • History and Systems in Psychology
  • Music Psychology
  • Neuropsychology
  • Organizational Psychology
  • Psychological Assessment and Testing
  • Psychology of Human-Technology Interaction
  • Psychology Professional Development and Training
  • Research Methods in Psychology
  • Social Psychology
  • Browse content in Social Sciences
  • Browse content in Anthropology
  • Anthropology of Religion
  • Human Evolution
  • Medical Anthropology
  • Physical Anthropology
  • Regional Anthropology
  • Social and Cultural Anthropology
  • Theory and Practice of Anthropology
  • Browse content in Business and Management
  • Business Ethics
  • Business Strategy
  • Business History
  • Business and Technology
  • Business and Government
  • Business and the Environment
  • Comparative Management
  • Corporate Governance
  • Corporate Social Responsibility
  • Entrepreneurship
  • Health Management
  • Human Resource Management
  • Industrial and Employment Relations
  • Industry Studies
  • Information and Communication Technologies
  • International Business
  • Knowledge Management
  • Management and Management Techniques
  • Operations Management
  • Organizational Theory and Behaviour
  • Pensions and Pension Management
  • Public and Nonprofit Management
  • Social Issues in Business and Management
  • Strategic Management
  • Supply Chain Management
  • Browse content in Criminology and Criminal Justice
  • Criminal Justice
  • Criminology
  • Forms of Crime
  • International and Comparative Criminology
  • Youth Violence and Juvenile Justice
  • Development Studies
  • Browse content in Economics
  • Agricultural, Environmental, and Natural Resource Economics
  • Asian Economics
  • Behavioural Finance
  • Behavioural Economics and Neuroeconomics
  • Econometrics and Mathematical Economics
  • Economic History
  • Economic Systems
  • Economic Methodology
  • Economic Development and Growth
  • Financial Markets
  • Financial Institutions and Services
  • General Economics and Teaching
  • Health, Education, and Welfare
  • History of Economic Thought
  • International Economics
  • Labour and Demographic Economics
  • Law and Economics
  • Macroeconomics and Monetary Economics
  • Microeconomics
  • Public Economics
  • Urban, Rural, and Regional Economics
  • Welfare Economics
  • Browse content in Education
  • Adult Education and Continuous Learning
  • Care and Counselling of Students
  • Early Childhood and Elementary Education
  • Educational Equipment and Technology
  • Educational Strategies and Policy
  • Higher and Further Education
  • Organization and Management of Education
  • Philosophy and Theory of Education
  • Schools Studies
  • Secondary Education
  • Teaching of a Specific Subject
  • Teaching of Specific Groups and Special Educational Needs
  • Teaching Skills and Techniques
  • Browse content in Environment
  • Applied Ecology (Social Science)
  • Climate Change
  • Conservation of the Environment (Social Science)
  • Environmentalist Thought and Ideology (Social Science)
  • Management of Land and Natural Resources (Social Science)
  • Natural Disasters (Environment)
  • Pollution and Threats to the Environment (Social Science)
  • Social Impact of Environmental Issues (Social Science)
  • Sustainability
  • Browse content in Human Geography
  • Cultural Geography
  • Economic Geography
  • Political Geography
  • Browse content in Interdisciplinary Studies
  • Communication Studies
  • Museums, Libraries, and Information Sciences
  • Browse content in Politics
  • African Politics
  • Asian Politics
  • Chinese Politics
  • Comparative Politics
  • Conflict Politics
  • Elections and Electoral Studies
  • Environmental Politics
  • Ethnic Politics
  • European Union
  • Foreign Policy
  • Gender and Politics
  • Human Rights and Politics
  • Indian Politics
  • International Relations
  • International Organization (Politics)
  • Irish Politics
  • Latin American Politics
  • Middle Eastern Politics
  • Political Behaviour
  • Political Economy
  • Political Institutions
  • Political Methodology
  • Political Communication
  • Political Philosophy
  • Political Sociology
  • Political Theory
  • Politics and Law
  • Politics of Development
  • Public Policy
  • Public Administration
  • Qualitative Political Methodology
  • Quantitative Political Methodology
  • Regional Political Studies
  • Russian Politics
  • Security Studies
  • State and Local Government
  • UK Politics
  • US Politics
  • Browse content in Regional and Area Studies
  • African Studies
  • Asian Studies
  • East Asian Studies
  • Japanese Studies
  • Latin American Studies
  • Middle Eastern Studies
  • Native American Studies
  • Scottish Studies
  • Browse content in Research and Information
  • Research Methods
  • Browse content in Social Work
  • Addictions and Substance Misuse
  • Adoption and Fostering
  • Care of the Elderly
  • Child and Adolescent Social Work
  • Couple and Family Social Work
  • Direct Practice and Clinical Social Work
  • Emergency Services
  • Human Behaviour and the Social Environment
  • International and Global Issues in Social Work
  • Mental and Behavioural Health
  • Social Justice and Human Rights
  • Social Policy and Advocacy
  • Social Work and Crime and Justice
  • Social Work Macro Practice
  • Social Work Practice Settings
  • Social Work Research and Evidence-based Practice
  • Welfare and Benefit Systems
  • Browse content in Sociology
  • Childhood Studies
  • Community Development
  • Comparative and Historical Sociology
  • Disability Studies
  • Economic Sociology
  • Gender and Sexuality
  • Gerontology and Ageing
  • Health, Illness, and Medicine
  • Marriage and the Family
  • Migration Studies
  • Occupations, Professions, and Work
  • Organizations
  • Population and Demography
  • Race and Ethnicity
  • Social Theory
  • Social Movements and Social Change
  • Social Research and Statistics
  • Social Stratification, Inequality, and Mobility
  • Sociology of Religion
  • Sociology of Education
  • Sport and Leisure
  • Urban and Rural Studies
  • Browse content in Warfare and Defence
  • Defence Strategy, Planning, and Research
  • Land Forces and Warfare
  • Military Administration
  • Military Life and Institutions
  • Naval Forces and Warfare
  • Other Warfare and Defence Issues
  • Peace Studies and Conflict Resolution
  • Weapons and Equipment

Genes: A Very Short Introduction (1st edn)

A newer edition of this book is available.

  • < Previous chapter

(page 110) p. 110 Conclusion: the varied concepts of the gene

  • Published: September 2014
  • Cite Icon Cite
  • Permissions Icon Permissions

There are many concepts of the gene. They range from defined sequences of DNA encoding proteins, to variant genes distinguishing individuals (markers), to unknown genes controlling quantitative traits, to hypothetical entities controlling behaviour as well as other complex characteristics. The science of genes is at its most precise and reliable when dealing with known protein coding genes. But all of the different concepts of the gene have been and continue to be important in numerous areas of human thought.

Personal account

  • Sign in with email/username & password
  • Get email alerts
  • Save searches
  • Purchase content
  • Activate your purchase/trial code
  • Add your ORCID iD

Institutional access

Sign in with a library card.

  • Sign in with username/password
  • Recommend to your librarian
  • Institutional account management
  • Get help with access

Access to content on Oxford Academic is often provided through institutional subscriptions and purchases. If you are a member of an institution with an active account, you may be able to access content in one of the following ways:

IP based access

Typically, access is provided across an institutional network to a range of IP addresses. This authentication occurs automatically, and it is not possible to sign out of an IP authenticated account.

Choose this option to get remote access when outside your institution. Shibboleth/Open Athens technology is used to provide single sign-on between your institution’s website and Oxford Academic.

  • Click Sign in through your institution.
  • Select your institution from the list provided, which will take you to your institution's website to sign in.
  • When on the institution site, please use the credentials provided by your institution. Do not use an Oxford Academic personal account.
  • Following successful sign in, you will be returned to Oxford Academic.

If your institution is not listed or you cannot sign in to your institution’s website, please contact your librarian or administrator.

Enter your library card number to sign in. If you cannot sign in, please contact your librarian.

Society Members

Society member access to a journal is achieved in one of the following ways:

Sign in through society site

Many societies offer single sign-on between the society website and Oxford Academic. If you see ‘Sign in through society site’ in the sign in pane within a journal:

  • Click Sign in through society site.
  • When on the society site, please use the credentials provided by that society. Do not use an Oxford Academic personal account.

If you do not have a society account or have forgotten your username or password, please contact your society.

Sign in using a personal account

Some societies use Oxford Academic personal accounts to provide access to their members. See below.

A personal account can be used to get email alerts, save searches, purchase content, and activate subscriptions.

Some societies use Oxford Academic personal accounts to provide access to their members.

Viewing your signed in accounts

Click the account icon in the top right to:

  • View your signed in personal account and access account management features.
  • View the institutional accounts that are providing access.

Signed in but can't access content

Oxford Academic is home to a wide variety of products. The institutional subscription may not cover the content that you are trying to access. If you believe you should have access to that content, please contact your librarian.

For librarians and administrators, your personal account also provides access to institutional account management. Here you will find options to view and activate subscriptions, manage institutional settings and access options, access usage statistics, and more.

Our books are available by subscription or purchase to libraries and institutions.

Month: Total Views:
October 2022 1
November 2022 1
January 2023 2
March 2023 1
May 2023 4
November 2023 2
April 2024 2
June 2024 2
August 2024 1
  • About Oxford Academic
  • Publish journals with us
  • University press partners
  • What we publish
  • New features  
  • Open access
  • Rights and permissions
  • Accessibility
  • Advertising
  • Media enquiries
  • Oxford University Press
  • Oxford Languages
  • University of Oxford

Oxford University Press is a department of the University of Oxford. It furthers the University's objective of excellence in research, scholarship, and education by publishing worldwide

  • Copyright © 2024 Oxford University Press
  • Cookie settings
  • Cookie policy
  • Privacy policy
  • Legal notice

This Feature Is Available To Subscribers Only

Sign In or Create an Account

This PDF is available to Subscribers Only

For full access to this pdf, sign in to an existing account, or purchase an annual subscription.

How to Write a Personal Narrative: A Step-by-Step Guide

essay introduction about genes

“As I sat down to write this article, memories flooded back, each one a brushstroke in the painting of my past…”

That could be the beginning of your personal narrative. Writing it lets you turn your memories and experiences into stories that click with others. This type of writing goes beyond school assignments or essays for college applications; it’s a chance to get really good at sharing your life's events in ways that matter. 

In this article, we're going to explore what personal narratives are all about and guide you through a simple seven-step process to create your own. You’ll learn how to pull out moments that make your story stand out and how to tweak your writing until it’s just right. We’ve got practical examples for you to follow along, making sure you have everything you need to tell your story. 

What is a Personal Narrative?

A personal narrative is a way to tell your own story. It's a style of writing that puts your experiences front and center, inviting readers into your world. Teachers often assign personal narratives to encourage free, expressive writing. 

The personal narrative definition is wider than academic settings, though. . These narratives can also show potential employers who you are beyond your resume. At its core, writing a personal narrative is a form of storytelling, using a first-person perspective to bring real-life tales to life. Whether it's for a grade, a job, or just for fun, it's about getting your story out there.

Your Story, Perfected

Let our experts refine your personal narrative, making sure every detail shines and your story is both clear and impactful.

How to Write a Personal Narrative: Steps

In this section, we'll break down the process into manageable steps, starting with how to zero in on the right topic that speaks about who you are.

How to Write a Personal Narrative

Step 1. Choosing a Personal Narrative Topic

The first step in crafting your personal narrative is picking the perfect topic. It should be something meaningful to you, something that has not just happened, but also shaped who you are or has a significant story behind it. Here’s how to frame your personal narrative ideas:

  • Story Arc : Your narrative is like a mini-movie. Start with setting the scene, build up to the main event, and wrap up with a reflection. For example, if you’re writing about your first solo travel experience, begin with your initial feelings, describe the challenges you faced, and end with what you learned about yourself.
  • Thematic Focus : Instead of moving through time, center your narrative around a central theme. Maybe it’s about resilience, and you could link different times you had to be resilient, ending with a major life challenge.
  • A Day to Remember : Sometimes a single day can tell a lot about you. Pick a day that was particularly memorable and unpack it from start to finish. Maybe it was a seemingly ordinary day that brought unexpected lessons or joys.

Step 2. Working on Your Personal Narrative Outline

When putting together your personal narrative, starting with a solid outline can help keep your story on track. Here's how you can lay it all out:

  • Introduction: Kick things off with a hook that grabs attention, like an intriguing question or a vivid snapshot of a key moment. Set the scene and introduce the main theme.
  • Setting and Characters : Give a good sense of where your story is unfolding and who's involved. Paint a clear picture of the backdrop and the key people.
  • Plot Development : Lay out the events in the order they happened, or group them around major themes. Build up to your main event, adding conflicts or challenges as you go.
  • Climax : This is the high point of your story, where everything comes to a head. Make it a moment that has the most impact.
  • Resolution : Wrap up the main storyline, showing how things settled down after the climax.
  • Reflection : Spend some time reflecting on what happened. Share what you learned or how you changed because of the experience.

If you're looking for help crafting your personal narrative, consider checking out some legit essay writing services to get professional guidance.

Step 3. Writing the First Draft of Your Personal Narrative

Now let’s move on to the fun part! Don't worry about getting everything perfect right away — the first draft’s goal is to let your story flow naturally:

  • Start with Your Hook: Revisit the introduction you outlined and flesh it out. Begin with the attention-grabbing sentence that will make readers want to continue.
  • Let the Story Unfold: Follow your outline, but allow yourself some flexibility. As you write, new memories or details may come to mind. Embrace them! Think about what you saw, heard, and felt during these moments. Were you sitting in a sunlit room, listening to the hum of a busy street outside? Maybe you felt the chill of an autumn breeze? 
  • Stay True to Your Voice: This is your story, so let your unique voice shine through. Whether you're humorous, reflective, or serious, maintain a consistent tone that feels authentically you. Keep the tone conversational and straightforward, as if you’re telling this story to a friend. 

Once the first personal narrative draft is done, set it aside for a bit before revisiting it with fresh eyes.

Step 4. Revising Your Personal Narrative

Once the first personal narrative draft is done, set it aside for a bit before revisiting it with fresh eyes:

  • Tighten Up the Story : As you go through your draft, focus on making everything clear and to the point. If you’ve talked about how nervous you were before a big event more than once, try to combine those thoughts into one powerful sentence that really captures how you felt.
  • Keep Your Tone Consistent : Make sure your voice stays the same throughout the story. If you start off with a casual, conversational tone, like saying, “I couldn’t shake the nerves before my big test,” stick with that style instead of suddenly becoming formal later on.
  • Adjust the Pacing : Pay attention to how smoothly your story flows from one part to the next. When you’re describing a key moment, like meeting someone important or going through a major experience, give it the detail and time it deserves. Let those moments develop naturally without rushing.
  • Enhance Your Descriptions : Make your imagery more vivid to help the reader visualize your story. For example, instead of just saying, “The room was noisy,” you could say, “The room buzzed with excited chatter.” These small tweaks can make your story feel more alive and engaging.

Step 5. Adding Personal Touches

As you polish your personal narrative, focus on making it uniquely yours. You can include personal reflections on your experiences. For example, if you’re writing about a challenging project, discuss not just the struggle but how it impacted you personally and professionally.

Besides, add unique details that only you can share. Instead of generic descriptions, use specific anecdotes or sensory details, like how the scent of freshly baked cookies from your grandmother's kitchen made you feel nostalgic.

Last but not least, incorporate dialogues or direct quotes from people involved in your story to add authenticity and depth. For instance, if your mentor gave you advice, include their exact words to capture the moment’s impact. This approach will help you understand how to write a personal narrative that is both engaging and deeply personal.

Not sure where to begin? You can always buy a narrative essay from experts who can help shape your story.

Step 6: Editing for Clarity and Style

When you’re editing your personal narratives, the goal is to make sure everything flows smoothly and makes sense. Here’s how to get it just right:

  • Clarify Your Message: Check for any parts of your story that might be a bit confusing. If you talked about being excited about a project and then suddenly shifted to its challenges, make sure to connect these thoughts clearly. For instance, you might rephrase it as “I was excited about the project, but I soon faced some unexpected challenges, like tight deadlines.”
  • Simplify Complex Sentences: Break down long or complicated sentences. Instead of saying, “My enthusiasm for the project, which was incredibly high despite the difficulties I faced, was the driving force behind my perseverance,” you could simplify it to, “Even though the project was tough, my excitement kept me going.”
  • Smooth Transitions: Check how your paragraphs and sections flow together. If you jump from describing a problem to the solution without a clear link, add a transition. For example, “After struggling with the project’s challenges, I realized that asking my mentor for help was the key to overcoming the obstacles.”

Oh, and read your narrative out loud. This can help you spot any awkward phrases or spots where the story might be a bit choppy. It’s a great way to catch any issues and make those final tweaks to get everything just right.

Personal Narrative Prompts

Here are ten personal narrative prompts to get you thinking about different moments in your life:

Topic Prompt
🏆 Facing Challenges Think about a tough situation you faced and how you got through it. Maybe you conquered a big project or overcame a personal hurdle. Share what happened and what you learned from it.
🌟 A Big Change Write about something that changed your life or perspective. This could be anything from a life-changing trip to a meaningful conversation that made you see things differently.
🎓 School Memories Share a standout moment from your school years that made a big impact on you. It might be a memorable class, a special event, or something else that stuck with you.
🚀 Achieving Goals Talk about a goal you set and achieved. Explain what it was, how you worked towards it, and what reaching this goal meant to you.
🤝 Helping Others Describe a time when you helped someone out. What did you do, and how did it make you feel? It could be anything from assisting a friend to volunteering in your community.
💪 Your Strengths Reflect on a personal strength or skill you're proud of. Share how you discovered it, developed it, and how it's helped you in different areas of your life.
🎉 Fun Times Write about a fun or exciting experience you had. It could be a family celebration, a personal achievement, or just a memorable day that made you smile.
📚 Influential Media Think about a book or movie that had an impact on you. Describe what it was and how it changed the way you think or feel.
✈️ Travel Adventures Share a memorable travel experience. Whether it’s the places you visited or the people you met, talk about how the trip affected you or what you learned from it.
💬 Meaningful Conversations Write about a conversation that really stuck with you. Who were you talking to, what was it about, and how did it make a difference in your life?

Need more tips on how to get started? Check out this guide on how to start a narrative essay to kick off your writing with a strong opening.

Personal Narrative Examples

Here are a few personal narrative beginnings to spark your creativity. These snippets are designed to get you started and inspire your own storytelling.

Wrapping Up

As you finish up your story, think about how those moments shaped who you are today. It's not just about what happened, but how it changed you. When learning how to write a personal narrative, it’s important to focus on the moments that truly matter to you and tell them in your own voice. This way, your narrative can really connect with others. 

Remember, the best stories come straight from the heart, so trust yourself and let your experiences shine through!

If you're working on a personal statement, you might want to explore a personal statement service that can help you create a compelling narrative.

Turn Memories into Masterpieces

Let us transform your experiences into a beautifully crafted narrative that stands out and makes an impact.

How to Start a Personal Narrative?

Can a personal narrative be about anything, what is the format of a personal narrative.

Daniel Parker

Daniel Parker

is a seasoned educational writer focusing on scholarship guidance, research papers, and various forms of academic essays including reflective and narrative essays. His expertise also extends to detailed case studies. A scholar with a background in English Literature and Education, Daniel’s work on EssayPro blog aims to support students in achieving academic excellence and securing scholarships. His hobbies include reading classic literature and participating in academic forums.

essay introduction about genes

is an expert in nursing and healthcare, with a strong background in history, law, and literature. Holding advanced degrees in nursing and public health, his analytical approach and comprehensive knowledge help students navigate complex topics. On EssayPro blog, Adam provides insightful articles on everything from historical analysis to the intricacies of healthcare policies. In his downtime, he enjoys historical documentaries and volunteering at local clinics.

  • The New York Times. (2020, January 7). Personal Narrative Essay Winners. The New York Times. https://www.nytimes.com/2020/01/07/learning/personal-narrative-essay-winners.html

How to Write a Music Essay: Topics and Examples

IMAGES

  1. Essay Explaining How a Gene Function is Determined

    essay introduction about genes

  2. Essay for molecular genetics

    essay introduction about genes

  3. Next Generation DNA Sequencing Technologies Biology Essay

    essay introduction about genes

  4. Characterization of Gene and Its Product of Given Sequence Using

    essay introduction about genes

  5. PPT

    essay introduction about genes

  6. Introduction to Genes & DNA

    essay introduction about genes

VIDEO

  1. Genetic and Cultural Factors of Human Development

  2. The Basics of DNA and Genetics

  3. 1. What is gene? Introduction to fundamental aspects

  4. Genetics: Introduction to Genes

  5. Influence of Nature and Nurture on Personality Traits

  6. Essay Introduction

COMMENTS

  1. Introduction to Genetics

    The passing of traits from parents to offspring is known as heredity, therefore, genetics is the study of heredity. This introduction to genetics takes you through the basic components of genetics such as DNA, genes, chromosomes and genetic inheritance. Genetics is built around molecules called DNA. DNA molecules hold all the genetic ...

  2. Genetics essay

    Introduction: Genetics, the study of heredity and the variation of inherited characteristics, is a captivating field that has transformed our understanding of life itself. The intricate dance of genes orchestrates the development, functioning, and diversity of living organisms.

  3. Genetics

    Genetics forms one of the central pillars of biology and overlaps with many other areas, such as agriculture, medicine, and biotechnology. Since the dawn of civilization, humankind has recognized the influence of heredity and applied its principles to the improvement of cultivated crops and domestic animals. A Babylonian tablet more than 6,000 ...

  4. Introduction to genetics

    Genetics is the study of genes and tries to explain what they are and how they work. Genes are how living organisms inherit features or traits from their ancestors; for example, children usually look like their parents because they have inherited their parents' genes. Genetics tries to identify which traits are inherited and to explain how these traits are passed from generation to generation.

  5. Gene

    The number of genes in an organism's genome (the entire set of chromosomes) varies significantly between species. For example, whereas the human genome contains an estimated 20,000 to 25,000 genes, the genome of the bacterium Escherichia coli O157:H7 houses precisely 5,416 genes. Arabidopsis thaliana—the first plant for which a complete genomic sequence was recovered—has roughly 25,500 ...

  6. Genes: A Very Short Introduction

    Genes: A Very Short Introduction explores the concept of the gene and looks at the discovery, nature, and role of genes in both evolution and development. Explaining the nature of genetic variation in the human population, how hereditary factors were identified as molecules of DNA, and how certain specific mutations can lead to disease, it ...

  7. Introduction to Genetics

    Genetics, otherwise known as the Science of Heredity, is the study of biological information, and how this information is stored, replicated, transmitted and used by subsequent generations. The study of genetics can be sub-divided into three main areas: Transmission Genetics, Molecular Genetics, and Population Genetics. In this Introductory text, the focus is on Transmission or Classical ...

  8. Genetics: Introduction, law of inheritance and Sex Determination

    Genetics is termed as the study to understand the functioning of inheritance of traits from parents to offspring. The groundwork on which heredity stands is known as inheritance. It is defined as the procedure by which characteristics are handed down from one generation to the other. Gregor Johann Mendel is known as the "Father of Modern ...

  9. Introduction to DNA and Genetics

    Introduction to DNA and Genetics. Genetics is the study of the qualities that are inherited and transmitted to the offspring from the parents during reproduction. Parents pass traits to their offspring, making the basis of heredity. The inherited traits are coded for in genes, which are the inherited elements.

  10. Essay on Genetics (For College and Medical Students)

    1. Essay on the Meaning of Genes: The term 'gene' was coined by Danish botanist Wilhelm Johannsen in 1909. It is the basic physical and functional unit of heredity. Heredity is the transfer of characters from parents to their offspring that is why children resemble their parents.

  11. GENETICS 101

    GENETICS 101. Almost every human trait and disease has a genetic component, whether inherited or influenced by behavioral factors such as exercise. Genetic components can also modify the body's response to environmental factors such as toxins. Understanding the underlying concepts of human genetics and the role of genes, behavior, and the ...

  12. History of genetics

    The history of genetics dates from the classical era with contributions by Pythagoras, Hippocrates, Aristotle, Epicurus, and others.Modern genetics began with the work of the Augustinian friar Gregor Johann Mendel. His works on pea plants, published in 1866, provided the initial evidence that, on its rediscovery in 1900's, helped to establish the theory of Mendelian inheritance.

  13. Genetics Essay

    Introduction: This essay will explore the cell cycle, cancer, cancer, mitosis and meioses, genetic variation, Mendelian genetics, and complex genetic inheritance. Each area will be broken down in further detail, and the result should demonstrate the knowledge learned on the subjects. The Cell Cycle The cell cycle is a series of stages that ...

  14. DNA

    The configuration of the DNA molecule is highly stable, allowing it to act as a template for the replication of new DNA molecules, as well as for the production (transcription) of the related RNA (ribonucleic acid) molecule.A segment of DNA that codes for the cell's synthesis of a specific protein is called a gene.. DNA replicates by separating into two single strands, each of which serves ...

  15. Essay on DNA: Meaning, Features and Forms

    DNA stores and transmits the genetic information in cells. It forms the basis for genetic code. The genes are made of DNA and are responsible for passing on traits from generation to generation. DNA contains the genetic instructions for the development and functioning of living organisms. Thus it is the substance of heredity. Essay # 3.

  16. The genetic basis of disease

    Genetics plays a role, to a greater or lesser extent, in all diseases. Variations in our DNA and differences in how that DNA functions (alone or in combinations), alongside the environment (which encompasses lifestyle), contribute to disease processes. This review explores the genetic basis of human disease, including single gene disorders, chromosomal imbalances, epigenetics, cancer and ...

  17. Nature Vs Nurture

    Nature vs. Nurture Essay. Nature is the influence of genetics or hereditary factors in determining the individual's behavior. In other words, it is how natural factors shape the behavior or personality of an individual. In most cases, nature determines the physical characteristics which in effect influence the behavior of an individual.

  18. Essay Introduction Paragraph Example (pdf)

    Essay Introduction Paragraph Example Crafting an essay on the topic of "Essay Introduction Paragraph Example" might initially seem like a straightforward task, but delving into the intricacies of creating an effective introduction can prove to be quite challenging. The difficulty lies in striking the right balance between capturing the reader's attention, providing a concise overview of the ...

  19. Genes as DNA

    In the 1980s and 1990s the complex mechanisms by which genes control embryonic development were discovered. The complete sequencing of a typical human genome was started in the late 1990s and achieved in 2003. It showed that the genome of human beings contains about three billion base pairs of DNA. Keywords: cell division, development, DNA ...

  20. Introduction to Gene Ontology

    What Is Gene Ontology? The Gene Ontology (GO) database is an effort to create an explicitly defined dictionary to describe the roles of gene products and their relationships to each other. 1,2 "It is essentially a vocabulary that allows scientists or biologists across all domains of life to describe what they are seeing… Every term is defined and has a relation to other terms," said Paul ...

  21. Introduction

    1 Introduction. Genome editing 1 is a powerful new tool for making precise additions, deletions, and alterations to the genome—an organism's complete set of genetic material. The development of new approaches—involving the use of meganucleases; zinc finger nucleases (ZFNs); transcription activator-like effector nucleases (TALENs); and, most ...

  22. Pervasive tissue-, genetic background-, and allele-specific gene

    Introduction. Gene expression variation is extensive at all organismal levels, including among tissues [1-2], cells [3-4], or alleles [5-6] of the same individual, and underlies much of the phenotypic variation that we see among individuals, populations, and species [7-9].A long-standing challenge in evolutionary genetics has been to identify and characterize this variation.

  23. Novel genetic structures associated with adverse response to

    Introduction: The role of genetic variants in response to chemotherapy has been investigated in several studies. This study aimed to investigate genetic variants associated with response to chemotherapy in breast cancer (BC) patients. Methods: Significant variants (p < 5 × 10 −8) associated with response to chemotherapy were obtained from GWA studies.

  24. Spatial multi-omics: deciphering technological landscape of integration

    The emergence of spatial multi-omics has helped address the limitations of single-cell sequencing, which often leads to the loss of spatial context among cell populations. Integrated analysis of the genome, transcriptome, proteome, metabolome, and epigenome has enhanced our understanding of cell biology and the molecular basis of human diseases. Moreover, this approach offers profound insights ...

  25. Genes: A Very Short Introduction

    Genes: A Very Short Introduction explores the concept of the gene and looks at the discovery and nature of genes and their position in human thought today. It explains how hereditary factors were identified as molecules of DNA, the nature of genetic variation in the human population, and how certain specific mutations can lead to disease. ...

  26. Transcriptomic and Gene Expression Analysis of Chemosensory Genes from

    Overall, 47 chemosensory genes were identified (2 ORs, 1 GR, 11 IRs, 9 CSPs, and 24 OBPs). Gene expression analysis revealed the predominant presence of IRs in the legs, whereas ORs and the GR were present in the heads and/or antennae. ... Introduction. Olfaction is an important sensory system for insects and is involved in vital processes such ...

  27. Fungal diversity and key functional gene abundance in Iowa bioretention

    1. Introduction It is well-established that stormwater is a highly complex mixture of contaminants. 1 Outside of "traditional" stormwater contaminants (e.g., heavy metals, nutrients, polycyclic aromatic hydrocarbons), evidence is growing for the presence of trace mobile/hydrophilic organic contaminants within stormwater. 2-4 To prevent mobile organic contaminants from entering ground and ...

  28. Conclusion: the varied concepts of the gene

    Abstract. There are many concepts of the gene. They range from defined sequences of DNA encoding proteins, to variant genes distinguishing individuals (markers), to unknown genes controlling quantitative traits, to hypothetical entities controlling behaviour as well as other complex characteristics. The science of genes is at its most precise ...

  29. How to Write a Personal Narrative: Easy Step-by-Step Guide

    Introduction: Kick things off with a hook that grabs attention, like an intriguing question or a vivid snapshot of a key moment. Set the scene and introduce the main theme. ... and various forms of academic essays including reflective and narrative essays. His expertise also extends to detailed case studies. A scholar with a background in ...

  30. Microorganisms

    Mixed forests often increase their stability and species richness in comparison to pure stands. However, a comprehensive understanding of the effects of mixed forests on soil properties, bacterial community diversity, and soil nitrogen cycling remains elusive. This study investigated soil samples from pure Robinia pseudoacacia stands, pure Quercus variabilis stands, and mixed stands of both ...