Appendix: Glossary of Terms
Adult Stem Cell: Undifferentiated stem cells located in various differentiated tissues, with limited self-renewal and differentiation potential.
Aminoacyl tRNA Synthetase: An enzyme that recognizes specific amino acids and catalyzes the formation of an ester bond between the carboxyl group of the amino acid and the hydroxyl group of the corresponding tRNA. The aminoacylated tRNA then participates in protein synthesis in the ribosome.
Artificial Cofactor: A molecule with a structure similar to natural cofactor, capable of performing electron transfer functions.
Artificial Intelligence (AI): A branch of computer science that aims to understand the nature of intelligence and create machines that can respond in ways similar to human intelligence. AI research includes robotics, speech recognition, image recognition, natural language processing, and expert systems.
Artificial Neural Network (ANN): A mathematical model that mimics the behavior of animal neural networks, enabling complex information processing through intricate internal connections. ANN is a method of machine learning.
Base Editing: A CRISPR-Cas-based technology for targeted base substitution, primarily including cytosine base editors (CBE) and adenine base editors (ABE). CBEs and ABEs use Cas9 mutants fused to cytosine deaminase or evolved adenine deaminase, respectively, to precisely edit target bases, enabling C-to-T (G-to-A) or A-to-G (T-to-C) substitutions.
Biofoundry: A synthetic biology platform that integrates industrial automation and intelligent manufacturing concepts. It uses high-throughput, automated equipment combined with design software and machine learning to rapidly and cost-effectively complete the “design-build-test-learn” cycle, enabling large-scale, high-throughput, and intelligent synthetic biology research and industrial applications.
Biopart: A functional amino acid or nucleotide sequence, such as regulatory elements (e.g., promoters, terminators, ribosome binding sites) or structural elements (e.g., enzyme genes in natural product synthesis pathways). These are the basic building blocks of organisms and the foundational modules for constructing artificial life forms in synthetic biology.
Bioprinting: An additive manufacturing technology that integrates medicine, engineering, electronics, and biology to precisely arrange bioactive materials and cells in space, creating functional living tissues. Bioprinting uses 3D printing-like technology to combine cells, growth factors, and biomaterials to produce biomedical components that mimic or surpass the features of natural tissues, enabling tissue repair and organ transplantation.
Biosensing: The use of biological materials (e.g., enzymes, proteins, DNA, antibodies, antigens, biomembranes, microorganisms, cells) as recognition elements to convert biochemical reactions into quantifiable physical or chemical signals, enabling the detection and monitoring of biological and chemical substances.
Biosynthesis Technology: A DNA synthesis technology that does not rely on existing DNA templates, instead using DNA terminal transferase and certain DNA polymerases to directly catalyze DNA strand synthesis.
Cell Factory: A metabolically engineered cell system designed for the production of specific substances. These modified cells utilize organic or inorganic substrates as carbon sources, driving energy-dependent activation of gene transcription, translation, and protein expression to produce specific biochemical products.
Cell Therapy: A therapeutic approach that involves isolating normal or functionally specific cells, expanding or processing them in vitro, and then transplanting or infusing them into patients. These cells can enhance immunity, kill pathogens and tumor cells, promote tissue regeneration, and aid in recovery. Stem cells or immune cells are commonly used in this therapy.
Cell-Free Protein Synthesis (CFPS): The process of achieving transcription and translation, as described by the central dogma of biology, in an open in vitro system.
Chassis Cell: A synthetic biology engineering platform where functionalized elements, circuits, and pathways are integrated into a host cell for rational design.
Chimeric Antigen Receptor T Cell (CAR-T Therapy): CAR-T therapy involves genetically modifying T cells to express a chimeric antigen receptor that combines a specific antigen recognition domain with T cell activation signals. These modified T cells directly bind to specific antigens on tumor cells, activating them to release perforin, granzyme B, and other substances that kill tumor cells. Additionally, they release cytokines to recruit endogenous immune cells for tumor cell destruction. This therapy can also generate memory T cells, providing long-term, specific anti-tumor immunity.
Coenzyme Regeneration: In this strategic research, coenzyme regeneration refers to: ①The conversion of oxidized nicotinamide coenzymes to reduced forms, such as NAD(P)+ to NADPH; ②The conversion of adenosine diphosphate (ADP) to adenosine triphosphate (ATP).
Column Synthesizer: A low-throughput synthesis instrument that uses porous silica material packed in a column as the reaction carrier.
Covalent Protein Drug: A protein-based drug that forms covalent bonds with target molecules, often exhibiting stronger and more sustained effects.
CRISPR-Cas Gene Editing Technology: This technology uses a designed guide RNA (gRNA) to direct Cas nucleases (e.g., Cas9, Cas12a) or Cas complexes to cut specific genomic targets, causing double-strand breaks. The cell’s DNA repair mechanisms, such as non-homologous end joining (NHEJ) or homology-directed repair (HDR), then enable precise cutting, deletion, insertion, or replacement of target sequences.
CRISPR-Cas System: An adaptive immune system in bacteria and archaea that defends against phage and other foreign DNA invasions. It consists of CRISPR (clustered regularly interspaced short palindromic repeats) and Cas (CRISPR-associated genes). When phages invade, foreign DNA is stored in CRISPR sequences. Upon re-invasion, CRISPR is transcribed into mature crRNA, guiding Cas effector proteins to target and eliminate the invading DNA.
Deep Learning: A type of neural network with many interconnected layers, capable of accurately extracting and representing complex biological patterns due to their more complex internal structure, thereby achieving precise prediction and generation tasks.
DNA Assembly: The artificial synthesis of double-stranded DNA sequences in vitro or in vivo.
DNA Sequencing: The process of determining the order of nucleotides (adenine (A), thymine (T), cytosine (C), and guanine (G)) in a specific DNA fragment.
DNA Synthesis: The artificial synthesis of single-stranded DNA sequences in vitro, typically in the 3'→5' direction.
Electron Donor: A substance that donates electrons in electron transfer reactions and accept oxidation, also known as a hole scavenger.
Embryonic Stem Cell (ESC): Undifferentiated cells derived from the inner cell mass of a human blastocyst. ESCs can proliferate indefinitely and have the potential to differentiate into all cell types of the three germ layers.
Emergence: A phenomenon where the interaction of many small entities produces a larger entity that exhibits properties not present in its individual components.
Extracellular Matrix (ECM): A network of substances secreted by cells into the extracellular space, primarily composed of fibrous components (e.g., collagen and elastin), adhesive proteins (e.g., fibronectin), and filler molecules (e.g., glycosaminoglycans).
Fecal Transplantation of Intestinal Flora: The transfer of functional microbiota from a healthy donor’s feces into a patient’s gastrointestinal tract to reconstruct the gut microbiota and treat intestinal and extra-intestinal diseases.
Gene Editing (Genome Editing): The targeted modification of specific DNA sequences in a genome using nucleases or complexes with targeting capabilities, enabling cutting, deletion, insertion, or replacement of DNA sequences to alter genetic information and achieve new functions or phenotypes.
Genetic Circuit: A synthetic genetic device designed using regulatory elements and regulated genes based on known mechanisms of gene and protein regulatory networks. It controls the expression of target genes under specific conditions, including the reconstruction and design of genetic circuits.
Genetic Code Expansion: A technology that uses orthogonal aminoacyl-tRNA synthetases and corresponding tRNAs to encode unnatural amino acids via nonsense codons or newly created codons.
Glycosylation: Refers to the process under the catalysis of glycosyltransferase where sugars are transferred to proteins, forming glycosidic bonds with amino acid residues on the proteins. It is an important post-translational modification of proteins and plays a significant role in regulating protein functions.
Gold Nanoparticle (Colloidal Gold): A well-studied nanomaterial with particle sizes typically ranging from 1 to 100 nm. It exhibits strong light absorption, scattering, and photothermal conversion properties, high electron density, dielectric characteristics, and catalytic activity. Gold nanoparticles can bind to various biomacromolecules and have excellent biocompatibility, making them widely used in bioanalysis and nanomedicine.
High-Throughput Chip Synthesizer: A high-throughput synthesis instrument that uses chips as synthesis carriers.
Immunogenicity: The ability of an antigen to induce an immune response, stimulating specific immune cells to activate, proliferate, and differentiate, ultimately producing immune effectors such as antibodies and sensitized lymphocytes.
Induced Pluripotent Stem Cell (iPSC): A type of stem cell artificially induced through reprogramming of human cells, possessing pluripotent differentiation potential similar to embryonic stem cells.
Knowledge Graph: A visual representation of the relationships between various concepts in a text, typically presented as a network.
Machine Learning: A core component of artificial intelligence that uses computer algorithms to simulate human learning, enabling the acquisition of new knowledge and skills. It typically uses mathematical models to fit patterns in data, allowing accurate predictions about the characteristics of new data or the generation of new data based on these patterns.
Mesenchymal Stem Cell (MSC): A type of adult stem cell with self-renewal and multi-lineage differentiation potential. MSCs are non-terminally differentiated cells with characteristics of mesenchymal, endothelial, and epithelial cells. Under specific induction conditions, they can differentiate into various tissue cells, including fat, cartilage, bone, muscle, nerve, liver, cardiac muscle, and endothelial cells, while retaining multi-lineage differentiation potential after continuous passaging and cryopreservation.
Metabolic Pathway Design: The redesign and engineering of microbial metabolic pathways using their diverse biochemical reaction networks to produce high-value products from low-value or renewable resources. This process focuses on efficient production, balancing metabolic flux and cofactor regeneration, and using gene editing and dynamic regulation to overcome feedback inhibition, creating an optimal synthesis pathway.
Mirror-Image Biology System: A system composed of mirror-image bioparts, which are chiral counterparts of natural biological molecules (e.g., proteins composed entirely of D-amino acids or nucleic acids composed entirely of L-nucleotides).
Multi-Enzyme Catalysis: A cascade reaction involving two or more enzymes, where the product of the first enzyme serves as the substrate or one of the substrates for subsequent enzymes.
Non-Equilibrium State: A steady state where system variables are not constant. A steady state refers to the final state that a system enters as it evolves over time from its initial state. A non-equilibrium state is a steady state other than an equilibrium state. It includes periodic motion (oscillatory state), quasi-periodic state (ergodic state), and chaotic state.
Nonsense Codon: Also known as a stop codon, it includes the amber codon (UAG), opal codon (UGA), and ochre codon (UAA). These codons were named by their discoverers.
One-Pot: A reaction method where all reactants are mixed in a single container in solution form.
Organ Repair: The use of tissue engineering methods to form or regenerate organs for repairing organ defects.
Organoid: A three-dimensional microtissue formed by the proliferation, differentiation, and self-organization of stem cells or organ-specific progenitor cells in vitro. Organoids contain multiple cell types and specific cellular arrangements, partially reflecting the physiological structure and function of the source tissue or organ.
Organoid Engineering: The use of engineering strategies to controllably design the cellular composition and self-organization of organoids, as well as to simulate complex tissue microenvironments, enabling the in vitro construction of highly reliable 3D organ models.
Organ-on-a-Chip: A miniature cell culture device that simulates the complex microenvironment of living tissues and organs in vitro, including multiple cell types, functional tissue interfaces, biochemical factors, and biomechanical forces (e.g., fluid shear stress, tension). It reflects the response of human organs to drugs or environmental stimuli.
Orthogonal Ribosomes: Artificially modified ribosomes that can recognize specific aminoacylated tRNAs without interfering with the endogenous system.
Phase Transition: The transformation of a substance from one state to another due to continuous changes in external parameters.
Phosphoramidite Triester Synthesis: The most widely used chemical method for oligonucleotide synthesis, where oligonucleotide monomers in solution form 3’→5’ phosphodiester bonds and attach to a solid-phase carrier.
Plastid Engineering: In addition to traditional nuclear genome engineering, plastid genomes in plants can also be genetically manipulated. Since the first plastid-transgenic tobacco was created in 1990, successful plastid transformations have been increasingly reported, making plastids a hotspot in plant synthetic biology research.
Polymerase Cycle Assembly Method: A method for assembling target DNA fragments directly from synthetic oligonucleotide primers without the need for additional DNA ligase.
Prime Editing Technology: A novel gene editing technology based on the CRISPR-Cas system. The prime editing system consists of two parts: ① A fusion protein of nCas9 (H840A) and engineered reverse transcriptase; ② A prime editing guide RNA (pegRNA) containing a reverse transcription template. Without requiring double-strand breaks or exogenous DNA templates, the pegRNA guides the nCas9- reverse transcriptase fusion protein to perform single-strand DNA cutting at the target site. The reverse transcriptase template carried by the pegRNA then reverse transcribes the designed DNA fragment, enabling precise insertions or arbitrary base substitutions at the target site.
Protein Prodrug: A precursor drug that undergoes chemical or enzymatic conversion in the body to become an active protein-based drug.
Protein-Drug Conjugate: A protein derivative formed by chemically linking biological or chemical molecules (e.g., small molecules, proteins, DNA, RNA, sugars, lipids) to a target protein, creating a drug candidate.
Pyrosequencing: A novel enzyme-based sequencing technology suitable for analyzing short known sequences. It offers reproducibility and accuracy comparable to Sanger sequencing but with significantly faster speed. Pyrosequencing has the capability to simultaneously sequence and analyze a large number of samples, enables high-throughput, low-cost, rapid, real-time and intuitive analysis of single nucleotide polymorphisms (SNPs) and clinical testing.
Pyrrolysine: A lysine derivative found in methanogenic archaea, known as the 22nd genetically encoded amino acid involved in protein biosynthesis.
Quadruple Codon: Through the modification of bioorthogonal elements, certain regions of mRNA are translated into a single amino acid using four nucleotides, i.e. the “quadruple codon”, which distinct from the “triple codon” in the natural amino acid translation system. It is an expanded genetic coding system that introduces additional base pairs (e.g., X and Y) to increase codon diversity, enabling the encoding of more amino acids or unnatural amino acids. This system can expand the diversity and functionality of protein synthesis.
Quantum Dot: A low-dimensional semiconductor material composed of group IV, II-VI, IV-VI, or III-V elements, typically spherical or quasi-spherical with diameters of 2–20 nm. Quantum confinement effects give quantum dots unique optical properties, including high fluorescence quantum yield, strong photostability, broad excitation spectra, narrow emission spectra, and tunable fluorescence wavelengths. They are widely used in life sciences and semiconductor devices.
Regenerative Medicine: An interdisciplinary field that applies principles and methods from life sciences, materials science, clinical medicine, computer science, and engineering to develop theories and technologies for replacing, repairing, reconstructing, or regenerating human tissues and organs.
Sample Loading Density of High-Throughput Sequencing: The density of DNA clusters or nanoballs per unit area on a high-throughput sequencing chip.
Sanger Sequencing (Chain Termination Method): A sequencing method invented by Sanger et al. in 1977, based on the incorporation of chain-terminating dideoxynucleotides (ddNTPs) during DNA synthesis by DNA polymerase. Each sequence determination consists of a set of four separate reactions, each sequencing reaction includes all four dNTPs and a limited amount of one ddNTP.
Self-Organization: A process in which a system spontaneously forms an ordered structure without external instructions, relying on mutually agreed-upon rules and coordination among its components.
Semi-Synthetic Organism: A life form created by introducing artificial, non-natural bases into genomic DNA, enabling the replication and transcription of genetic information. Currently, Escherichia coli is commonly used for this purpose.
Sequencing by Ligase chain reaction: A DNA sequencing method that uses DNA hybridization and ligation reactions to determine DNA sequences.
Sequencing by Synthesis: The most widely used high-throughput sequencing technology, which involves sequencing DNA by extending bases with DNA polymerase.
Sequencing Read: The length of sequence obtained in a sequencing reaction. In high-throughput sequencing, it is limited by the asynchronous extension of multiple copies of molecules, while in single-molecule sequencing, it is limited by the length of the extracted DNA fragment.
Single Molecule Sequencing: Sequencing individual DNA molecules in real-time without PCR amplification, primarily through single-molecule fluorescence sequencing or nanopore sequencing.
Spatial Omics: The analysis of gene expression patterns in individual cells and the spatial relationships and biological characteristics of cell populations across time and space.
Spatial Transcriptome Resolution: The minimum distance between capture probe clusters on the surface of a carrier.
Stem Cell: Cells with varying differentiation potential and the ability to self-renew in an undifferentiated state.
Synthetic Efficiency: The efficiency of coupling one base to the next in an oligonucleotide during a single synthesis cycle.
Synthetic Error Rate: A measure of synthesis fidelity, calculated as the ratio of incorrect bases to the total number of synthesized bases.
Synthetic Flux: The maximum number of oligonucleotide species that can be synthesized in a single run.
Synthetic Hybrid Biotic-abiotic System: A system or component composed of biological (e.g., nucleic acids, proteins, viruses) and non-biological (e.g., inorganic nanomaterials) elements. These components can be ordered through molecular assembly or biomineralization, leading to enhanced properties or emergent functions. Such systems include non-cellular hybrid systems and hybrid cellular systems.
Synthetic Hybrid Cellular Biosystem: An engineered cell system that carries or contains non-bioparts (e.g., inorganic nanomaterials) as functional units. These systems can be constructed by loading chemically synthesized non-bioparts into living cells or directly synthesizing non-bioparts within cells.
Synthetic Hybrid Non-Cellular Biosystem: Functional structures formed by the ordered combination of bioparts (e.g., proteins, nucleic acids, viruses) and non-bioparts (e.g., inorganic nanomaterials) through chemical crosslinking, adsorption, mineralization, or assembly.
Synthetic Yield: The total amount of oligonucleotide product from a single synthesis unit.
Transcription Activator-like Effector Nuclease (TALEN) Technology: A gene editing technology that uses transcription activator-like effector nucleases to target and edit specific DNA sequences. TALENs consist of a programmable transcription activator-like effector (TALE) protein for DNA recognition and the nuclease domain of FokI for DNA cleavage.
Universal Cell Therapy: A form of therapy that involves genetically modifying human leukocyte antigen (HLA) or T cell receptor (TCR) molecules to avoid immune rejection in allogeneic cell therapy. This allows for the standardization, scalability, and universalization of therapeutic cells, enabling their characterization, quality control, and application like pharmaceutical drugs.
Universality Class: In statistical physics, critical exponents are used to quantitatively describe the critical behavior of physical quantities near phase transition points. Phase transitions with the same critical exponents belong to the same universality class.
Unnatural Amino Acid: Amino acids other than the 20 naturally encoded ones. Selenocysteine and pyrrolysine are also naturally encoded in specific systems, making them the 21st and 22nd natural amino acids. All other amino acids are considered unnatural.
Unnatural Base: Artificially designed and synthesized bases introduced into biological systems, including derivatives of the four natural bases (A, T, C, G) and novel bases (e.g., X, Y) that can pair complementarily and be used in transcription and translation.
Unnatural Protein: Proteins containing amino acids other than the 20 standard natural amino acids, including synthetic unnatural amino acids.
Vascularization: The formation of blood vessel-like structures in three-dimensional tissues constructed in vitro. Vascularization addresses the issue of oxygen and nutrient deficiency in organoids and is a key focus in current organoid research.
Virus Vector: A virus modified through genetic engineering to serve as a vehicle for delivering foreign genes into cells. Virus vectors efficiently transduce cells and enable long-term gene expression. Tool virus vectors are characterized by high transduction efficiency and high levels of foreign gene expression.
Zinc-Finger Nuclease (ZFN) Technology: A gene editing technology that uses zinc-finger nucleases to target and edit specific DNA sequences. ZFN consists of a programmable zinc-finger protein for DNA recognition and the nuclease domain of FokI for DNA cleavage.