same same, but different – a comparative genomics story

posted on september 11, 2024   by dr. hung nguyen

dr. hung nguyen takes us behind the scenes of their latest publication 'genomic investigations of diverse corbiculate bee gut-associated gilliamella reveal conserved pathways for energy metabolism, with diverse and variable energy sources' published in access microbiology.

blob.jpg
istock/inventori

hi there! my name is hung nguyen, and i am the founder and primary investigator of my independent research organization - project genomes to functional, ecological, and evolutionary characterizations (g2feec). i started the organization after completing my phd in 2022 and dedicated it to uncovering the treasure trove that is the vast amount of genomic data available.

corbiculate bees are important pollinators in agriculture and the wild (their name references the corbiculae, i.e. pollen baskets). their microbiota often comprises a few highly conserved phylotypes, especially among honeybees and bumblebees. these symbionts are crucial in mediating host health, including the degradation of consumed plant matter and derivatives, and antagonistic roles against bee pathogens. among these phylotypes is gilliamella, which often dominates the host ileum. much research has been done on this genus in vivo and in vitro, but in silico studies have been lacking. for many years, this has meant it was unclear if differential features were characterized. sometimes, it was not even possible to identify if a certain characteristic may be present or absent. this was particularly true for energy metabolism, whereby it was unclear what was the cause of the differential patterns of fermentation, and if gilliamella may even be capable of respiration.

yet energy metabolism is crucial to understanding an organism’s physiology. it determines everything from growth and reproduction rates to what environments an organism can inhabit. in total, i analysed 95 high-quality gilliamella genomes, the results of which answered many preexisting questions and comprised several new discoveries.

first and foremost were the questions surrounding the ability to respire and how it may vary across taxa. crucially, and as suspected, i uncovered that gilliamella is not genetically capable of fully aerobic respiration, which has important implications for their activity outside the host environment. at the same time, i identified the conserved capability for effective respiration under microoxic conditions, exactly that of their host ileum environment, and very likely explains why they are so often able to dominate the bee gut microbial community they are a part of. this is a highly important discovery, as it also led to the discovery that all gilliamella should be capable of ubiquinone biosynthesis, despite lacking ubic, which was once thought to be vital in the biosynthesis pathway. as the various omics fields continued to develop, we continuously found exceptions to what were once considered ‘rules,’ and this discovery is no different. it is exciting to consider how future studies will uncover just how gilliamella replaces the function of ubic, whether with a different enzyme or via an entirely unknown mechanism.

fermentation pathways were also highly conserved across the genomes examined, which is a very important finding as that meant previous differential patterns of fermentation identified were phenotypic, not genotypic. this has significant implications for both past and future studies. it is evidence that despite apparently identical growth conditions, there were still unaccounted factors in prior laboratory experiments, factors that should continue to be investigated or at least considered in future studies.

beyond being conserved, the core energy metabolism pathways are also highly ‘robust,’ often comprising many alternate pathways allowing for the interconversion of diverse substrates. a very exciting discovery was that such robust core energy metabolism pathways could also be found in the genomes of sister genera to the gilliamella from the same order, comprising members isolated from animal guts. this may mean such conserved energy metabolism features are pre-adaptations not only to the bee gut environment but also to that of other animals!

despite the conserveness observed, the specific energy sources that can be utilized were highly variable across species and sometimes strains. i found that the robustness of the core energy metabolism pathways may itself be driving this metabolism flexibility. due to the multiple alternate conversion pathways present, it would have only taken the acquisition of a few additional genes to confer the ability for a strain to uptake and digest new types of sugars. it was known that the diversity of energy sources likely reflects adaptations to specific host species and/or environments, but now we also know why and how such adaptations could arise.

all in all, we are no longer in the dark about the genomic underpinnings of gilliamella energy metabolism. it is very exciting to see how future research will build on my discoveries here, both to confirm my findings and to better design experiments in the future, now that we know what gilliamella should and should not be capable of when it comes to energy metabolism.

fun fact – despite the clear importance of this genus of bacteria, it was entirely by chance that they would be the focus of my research. in the beginning, i just randomly picked a high-quality genome, and it happened to be that of an undescribed gilliamella species. characterizing this species in depth has, and continues to be, my main project. characterizing the energy metabolism of the entire genus was, in fact, just a spin-off of the main project, which i deemed necessary when i realized it would lead to material and meaningful contributions to the field. to me, laboratory and computational research go hand-in-hand, and when the chance arose to answer the many questions posed by laboratory investigations, it was not an opportunity i could pass. i am excited that my research can fuel future laboratory research. i hope i can keep the cycle going as both sides of research continue to complement each other to greatly further our knowledge of the fascinating biology of life.

remarkably, my research was entirely self-funded, with minimal expenditure required. much of the analysis utilized publicly available, free tools and databases: the department of energy systems biology knowledgebase (kbase), gtdb, the integrated microbial genomes & microbiomes (img/m) database and web service, megax, ncbi. i spent very little setting up my organization and website, and only paid for interactive tree of life (itol), lucidchart, and metacyc subscriptions. i want to highlight that all of these expenditures were optional. the key message? if you, like me, want to carry out self-funded, independent research… maybe it is possible! consider your options. perhaps it is truly out of reach, and so it is what it is. or perhaps… with all the available public resources out there, you can do your own independent research too!