Luciferins Under Construction: A Review of Known Biosynthetic Pathways
Aleksandra S. Tsarkova
Frontiers in Ecology and Evolution, Vol. 9, Article 667829 (Sep. 20, 2021)
Note: this is a single-author mini-review, Tsarkova alone, not “Tsarkova et al.” She is at the Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry / Pirogov RNRMU in Moscow, which means she is part of the same Russian institutional network as Yampolsky, Sarkisyan, Kotlobay, and the Planta LLC team that engineered the autoluminescent plants in Kotlobay 2018 and Mitiouchkina 2020. That context matters: this review is written from the perspective of a community that has actually completed a heterologous luciferin biosynthesis pathway, and it carries an implicit “we know how hard this is, we did it once” weight.
- The headline framing: only two of ten known luciferins have fully characterized biosynthesis pathways. Bacterial (lux operon, FMNH₂ + long-chain aldehyde, encoded as a single gene cluster) and fungal (caffeic acid cycle, four-gene cluster luz/h3h/hisps/cph). The remaining eight, beetle D-luciferin, coelenterazine, vargulin, Fridericia luciferin, Odontosyllis luciferin, dinoflagellate luciferin, Latia luciferin, Diplocardia luciferin, are all in varying states of partial elucidation, with structures known but full enzymatic pathways missing. This is the cleanest single-sentence motivation for any project working on firefly luciferin biosynthesis: completing the D-luciferin pathway would move it from the “partially characterized” column into the “fully characterized” column, and would be the third complete luciferin pathway in any organism. That framing matters in any pitch context where the question is “why is this project worth doing?”
- The recurring pattern: most luciferins are small modified peptides built from aromatic amino acids plus L-cysteine. Tsarkova explicitly notes that “a surprising number of luciferins [belong] to a class of small modified peptides, often comprising aromatic amino acids (L-tyrosine, L-tryptophan, and L-phenylalanine) and L-cysteine.” Coelenterazine (Phe + 2 Tyr), vargulin (Trp + Arg + Ile), Fridericia luciferin (modified Tyr + GABA + Lys + oxalate), Odontosyllis luciferin (DOPA + 2 Cys), Arachnocampa LRC (Tyr + xanthurenic acid from Trp pathway), beetle D-luciferin (BQ from Tyr/phenolic pool + 2 L-Cys). The pattern is striking: aromatic amino acid building blocks + cysteine recur across phylogenetically unrelated bioluminescence systems. This is conceptually important for any “exploit endogenous plant phenolic chemistry” framing, the p-benzoquinone + cysteine route used by fireflies isn't an exotic firefly-specific accident; it's an instance of a much broader pattern in how nature builds luminogenic substrates from common metabolic precursors.
- The “detoxification origin” hypothesis is laid out a year before de Souza 2022 experimentally confirmed it. Tsarkova explicitly proposes that “biosynthetic pathways of D-luciferin and Odontosyllis luciferin might have arisen from a mutation-induced deviation of the melanogenic pathways to provide photoprotection against free radical species, particularly reactive oxygen species,” citing Rees 1998, Timmins 2001, Dubuisson 2004, and Napolitano 2013 as the foundational papers for this view. The framing is that luciferins might originally have been antioxidant byproducts of phenolic detoxification or pheomelanogenesis-like cysteine + quinone chemistry, with light emission as an initially incidental property that was later co-opted. de Souza 2022 then provided the direct experimental confirmation in E. coli, but Tsarkova's review is the cleaner cite for the conceptual framework, especially in any pitch that wants to set up “luciferin is the byproduct of a defensive chemistry” as the headline framing.
- Concise summary of the beetle D-luciferin pathway as currently understood. Arbutin → 1,4-hydroquinone → p-benzoquinone → 1,4-Michael addition of L-cysteine → carbon-sulfur rearrangement and decarboxylation analogous to late pheomelanogenesis → addition of second L-cysteine → L-luciferin → luciferase-mediated CoA esterification → epimerization → thioester hydrolysis → D-luciferin. Tsarkova explicitly frames this as the Lampyridae pathway and notes that “luciferin biosynthetic pathways in Elateridae and Phengodidae lineages are vastly underinvestigated,” a gap that Zhang 2020 partially addressed for Elateridae but that remains genuinely open for Phengodidae (railroad worms). For any project bibliography this is the cleanest single-paragraph summary of the firefly luciferin biosynthesis pathway as it stood in 2021, with all the major intermediates in one place and explicit attribution to the supporting primary literature.
- The peroxisomal compartmentation question is not addressed in detail. Worth flagging: Tsarkova reviews biosynthetic pathways but does not deeply engage with compartmentation. The peroxisomal targeting story (Gould 1987, the canonical SKL PTS1 tag, and the implications for which biosynthesis enzymes need to be peroxisomally targeted vs cytosolic) is in Adams & Miller 2020 and Fallon 2018 but not here. So this review covers chemistry-pathway scope thoroughly but cell-biology scope only lightly.
- Oxyluciferin recycling remains an open question. Tsarkova explicitly notes that “the question of oxyluciferin recycling mechanism in Lampyridae also remains unanswered” and is dismissive of the LRE (luciferin regenerating enzyme) hypothesis: “more recent evidence suggests that LRE may perform other functions in cells and its role in D-luciferin recycling in vivo requires further clarification.” This matches Adams & Miller 2020's skepticism. Useful for the project to know: there is no consensus enzyme for converting oxyluciferin back to luciferin in fireflies, which means any heterologous tobacco system running TU1 will accumulate oxyluciferin over time and there is no obvious gene to add to the construct to recycle it. If oxyluciferin accumulation turns out to be a Phase 2 or Phase 3 problem, the field hasn't solved it yet.
- Sulfoluciferin storage corroborated by Zhang 2020 expression analysis. Tsarkova notes that the Fallon 2016 sulfoluciferin storage hypothesis was further supported by sulfotransferase expression analysis in Zhang 2020's Lamprigera yunnana and Abscondita terminalis lanterns. So LST is now considered a confirmed component of beetle luciferin metabolism, and the storage-via-sulfation strategy is increasingly understood as a recurring motif (also seen in coelenterazine, vargulin, and other marine luciferins) for managing reactive luciferin pools.
- The autonomy / self-sustained luminescence framing is explicitly invoked. In the closing paragraph, Tsarkova states: “The knowledge of all enzymes participating in bioluminescence cascades opens the possibility of engineering organisms with self-sustained luminescence, thus allowing the development of substrate-independent bioluminescence-based reporter technologies.” This is the field's standard framing of why luciferin biosynthesis pathways matter for synthetic biology, and it is articulated here by an author who literally helped engineer the only existing example of this in plants (the Mitiouchkina 2020 autoluminescent tobacco using fungal genes from Kotlobay 2018). The implicit framing is: completing the firefly D-luciferin pathway would enable a second autonomous-bioluminescence platform in plants, with different photonic and biotechnological properties than the fungal one.
Bottom line for the project: This is the single-best citation for the field-wide context that motivates the project. Three concrete uses for the bibliography. First, in any introduction, motivation, or pitch context, whether for a grant proposal, a startup deck, or a paper introduction, Tsarkova 2021 is the cleanest citation for “the state of luciferin biosynthesis knowledge as of the early 2020s,” with the headline statistic that only 2 of 10 known luciferins have fully characterized pathways. That framing makes the project's goal (completing the firefly D-luciferin pathway in a heterologous host) sound like the obvious next move in a clear field-level program rather than a one-off engineering effort. Second, the conceptual framework laid out here, luciferins as detoxification byproducts of cysteine-quinone chemistry, luciferases as repurposed metabolic enzymes from unrelated families, aromatic amino acid + cysteine as the recurring building-block motif, is the right intellectual frame for any “why this approach” discussion, because it positions the project as exploiting a deeply general pattern in how nature builds luminogenic substrates rather than betting on firefly-specific accidents. Third, Tsarkova's institutional context (Shemyakin-Ovchinnikov / Pirogov RNRMU, the same network that delivered the Mitiouchkina 2020 autoluminescent plants) makes this review an implicit precedent paper: the Russian community has done this exact thing once before with the fungal pathway, the conceptual playbook is established, and the question is whether the firefly route can match or exceed the fungal one in plant context. Cite Tsarkova 2021 as the field-context opener, with Adams & Miller 2020 as the firefly-luciferase-specific deep dive, and Fallon 2018 / Zhang 2020 as the genomics anchors. The four-paper opening salvo of any project document should be: Tsarkova for context, Fallon for genetics, Adams & Miller for enzymology, Zhang for biochemistry, with everything else in the bibliography filling in the specifics.