demystifying borrelia: novel computational methods to detect and identify ospa types
posted on june 19, 2024 by jonathan t lee, phd and li hao, phd
dr jonathan t lee and dr li hao take us behind the scenes of their latest publication ‘development of a sequence-based in silico ospa typing method for borrelia burgdorferi sensu lato’ published in microbial genomics.
lyme disease is a vector-borne infectious disease caused by the bacterium borrelia burgdorferi sensu lato (b. burgdorferi s.l., or burgdorferi). infection is transferred to humans through the bites of infected ixodes ticks. lyme disease is estimated to affect over 600,000 people annually in the united states and europe, with symptoms varying in severity between cases. early indicators of lyme disease include the characteristic erythema migrans (em) rash at the bite site, that can occur with non-specific symptoms such as fatigue, fever and headache. untreated early lyme disease can progress to more serious late-stage disease that impact the skin, joints (arthritis), heart (carditis) and nervous system (neuroborreliosis). symptoms have been shown to persist post-treatment in some individuals. several lyme borrelia species are documented as pathogenic to humans, with differences in distribution between north america, europe and asia (figure 1). with the exception of em, clinical manifestations are known to differ between genospecies and consequently by geographic area. for instance, in the us and canada, where b. burgdorferi sensu stricto predominates, lyme arthritis is more common. in europe, however, neuroborreliosis is more common due to the differential pathogenesis of b. garinii. in addition, borrelial lymphocytoma and acrodermatitis chronica atrophicans (aca) occur in europe due to prevalence of b. afzelii.
a potential vaccine against lyme disease
as the most common vector-borne disease in the northern hemisphere, there is an unmet medical need for a vaccine to protect against lyme disease. as of this post, there are currently no approved human vaccines for lyme disease. pfizer and valneva are collaborating on the development of vla15, a six-valent candidate vaccine designed to help protect against the dominant genospecies causing lyme disease in north america and europe. the vla15 vaccine candidate aims to provide protection from lyme disease by inhibiting borrelia outer surface protein a (ospa), a protein expressed on the bacterial cell surface while in the tick vector. the investigational vaccine follows a mechanism of action based on antibodies against ospa that are introduced into the tick during ingestion of the blood meal. binding to ospa on borrelia in the tick midgut that aims to prevent transmission and thus infection.
history of traditional borrelia serotyping
vla15 is a multivalent vaccine candidate designed to provide protection against the six most common borrelia ospa serotypes, spread across the four dominant genospecies in north america and europe: b. burgdorferi sensu stricto (ospa serotype 1), b. afzelii (ospa serotype 2), b. garinii ospa serotypes 3, 5, 6) and b. bavariensis (ospa serotype 4). these serotypes were originally classified in the mid-1990s using a panel of monoclonal antibodies (mabs). unfortunately, the mabs used to type these initial serotypes were not made widely available and new mabs were not elaborated to classify the large number of putative remaining and emerging ospa serotypes. over the last 20 years, an ad-hoc combination of amplicon sequencing and restriction typing of ospa has been implemented by investigators. however, while these methods can broadly differentiate serotypes, they are unable to identify sequence diversity within ospa. such information is important invaccine design in order to assign serotypes, assess variation within serotypes, and predict immune response to new ospa variants. to this end, our team at pfizer vaccine research and development turned to next-generation sequencing (ngs) to provide higher sequence resolution of the ospa antigen. in doing so, we defined a formal typing system that is broadly available and amenable to assignment of the full range of known ospa sequences with accommodation for potential new variants.
introducing an in silico typing method for borrelia surveillance
our group compiled a collection of over 400 sequenced borrelia genomes spanning 11 genospecies that account for the majority of lyme disease cases in north america, europe and asia. by leveraging phylogenetic analysis and sequence alignment, we observed clusters of ospa sequences that correlated with the known ospa serotypes. additionally, we identified novel phylogenetic clusters outside of these canonical serotypes, many of which consist of human disease-causing genospecies. based on this purely sequence-based classification method, we dubbed these groups ospa in silico types (ists). ists 1-8 correspond to the mabs-determined ospa serotypes 1-8, and de-novo ist assignments (ists 9-17) were given to previously unclassified types, such as the human pathogenic genospecies b. mayonii and b. spielmanii (figure 2). we further developed and released an open-source computational pipeline, the lyme ospa in silico typing tool (github.com/pfizer-opensource/listt), to provide an automated method to determine the ospa ist of a borrelia strain based on ngs of the ospa gene. this pipeline identifies both the in silico type and genospecies by performing sequence alignment and calculating amino acid identity to known ospa variants belonging to multiple serotypes. as this method only takes the sequence of the borrelia ospa gene into consideration, it is highly sensitive to b. burgdorferi s.l. and avoids false positives that could be caused by other pathogens that do not carry ospa. additionally, this means analysis can be performed in a background of host dna, making it an ideal method for clinical surveillance as no isolation or culturing of borrelia, a sometimes arduous task, is required.
we hope that our in silico typing method will provide a valuable new tool in the lyme disease field, both in terms of borrelia research and vaccine design. the ist scheme has already been implemented by the public database for microbial genome diversity, pubmlst (pubmlst.org), in order to track the in silico types of novel strains as they are sequenced and deposited by contributors worldwide. as more data is collected over time, this scheme can be adapted to incorporate novel ospa variants and ists and provide continued monitoring of borrelia diversity.