Time course profiling of host cell response to herpesvirus infection using nanopore and synthetic long-read transcriptome sequencing

Maróti, Zoltán; Tombácz, Dóra; Moldován, Norbert; Torma, Gábor; Jefferson, Victoria A.; Csabai, Zsolt; Gulyás, Gábor; Dörmő, Ákos; Boldogkői, Miklós; Kalmár, Tibor; Meyer, Florencia; Boldogkői, Zsolt

doi:10.1038/s41598-021-93142-7

Download PDF

Article
Open access
Published: 09 July 2021

Time course profiling of host cell response to herpesvirus infection using nanopore and synthetic long-read transcriptome sequencing

Zoltán Maróti¹^na1,
Dóra Tombácz^2,3^na1,
Norbert Moldován²^na1,
Gábor Torma²,
Victoria A. Jefferson⁴,
Zsolt Csabai²,
Gábor Gulyás²,
Ákos Dörmő²,
Miklós Boldogkői²,
Tibor Kalmár¹,
Florencia Meyer⁴ &
…
Zsolt Boldogkői²

Scientific Reports volume 11, Article number: 14219 (2021) Cite this article

2287 Accesses
3 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Third-generation sequencing is able to read full-length transcripts and thus to efficiently identify RNA molecules and transcript isoforms, including transcript length and splice isoforms. In this study, we report the time-course profiling of the effect of bovine alphaherpesvirus type 1 on the gene expression of bovine epithelial cells using direct cDNA sequencing carried out on MinION device of Oxford Nanopore Technologies. These investigations revealed a substantial up- and down-regulatory effect of the virus on several gene networks of the host cells, including those that are associated with antiviral response, as well as with viral transcription and translation. Additionally, we report a large number of novel bovine transcript isoforms identified by nanopore and synthetic long-read sequencing. This study demonstrates that viral infection causes differential expression of host transcript isoforms. We could not detect an increased rate of transcriptional readthroughs as described in another alphaherpesvirus. According to our knowledge, this is the first report on the use of LoopSeq for the analysis of eukaryotic transcriptomes. This is also the first report on the application of nanopore sequencing for the kinetic characterization of cellular transcriptomes. This study also demonstrates the utility of nanopore sequencing for the characterization of dynamic transcriptomes in any organisms.

Time-course profiling of bovine alphaherpesvirus 1.1 transcriptome using multiplatform sequencing

Article Open access 24 November 2020

Norbert Moldován, Gábor Torma, … Zsolt Boldogkői

In-depth Temporal Transcriptome Profiling of Monkeypox and Host Cells using Nanopore Sequencing

Article Open access 09 May 2023

Balázs Kakuk, Ákos Dörmő, … Zsolt Boldogkői

Combined nanopore and single-molecule real-time sequencing survey of human betaherpesvirus 5 transcriptome

Article Open access 14 July 2021

Balázs Kakuk, Dóra Tombácz, … Zsolt Boldogkői

Introduction

Bovine alphaherpesvirus type 1 (BoHV-1) is a large DNA virus belonging to the Alphaherpesvirinae subfamily. This virus infects cattle and causes the disease commonly known as bovine respiratory disease, which leads to severe economic losses annually worldwide¹. Like other alphaherpesviruses, such as herpes simplex virus type 1 (HSV-1), or pseudorabies virus (PRV), BoHV-1 also enters a latent state most commonly in the trigeminal ganglia following primary infection². From this state, the virus can be reactivated by various types of stress and can re-establish an acute infection³.

Short-read sequencing (SRS) technology has expanded the frontiers of genomic and transcriptomic research due to its capacity to collect vast quantities of sequencing data at a relatively low cost. However, the past decade has witnessed incredible advances in long-read sequencing (LRS) technology. Besides the Pacific Biosciences and Oxford Nanopore Technologies platforms, Loop Genomics has recently also developed an LRS technique based on single molecule synthetic long-read sequencing (LoopSeq). LRS approaches present a strategy that is able to elude the limitations of SRS, including its ineffectiveness in the identification of transcript isoforms and in distinguishing overlapping RNA molecules. Recently, LRS techniques have been widely applied for the transcriptome analysis of a variety of organisms^4,5,6,7,8, including herpesviruses^9,10,11,12. These studies have uncovered a far more complex transcriptional landscape of the examined species than previously thought. Genome-wide sequencing assays have annotated the global transcriptome of BoHV-1¹³, including microRNAs¹⁴. The effect of herpesvirus infection on host cell transcription using SRS (Illumina HiSeq) has been characterized by¹⁵. In this paper, the authors described alternative splicing and polyadenylation in human skin fibroblast cells due to the infection by HSV-1.

In this work, we carried out a time-lapse assay for the examination of the effect of BoHV-1 infection on host [bovine (Bos taurus)] cell gene expression. The transcriptome analysis was performed using MinION sequencing from Oxford Nanopore Technologies (ONT) and Illumina-based LoopSeq from Loop Genomics.

Results

Annotation of Bos taurus transcripts

In this work, we applied the following techniques for the analysis of bovine transcriptome: (1) direct cDNA sequencing (dcDNA-Seq) based on oligo(dT)-primed reverse transcription (RT), (2) amplified cDNA sequencing based on random-oligonucleotide-primed RT using nanopore sequencing on ONT MinION platform, as well as (3) synthetic long-read sequencing (LoopSeq) on Illumina platform. All of the three techniques were used for bovine transcript annotation, whereas dcDNA-Seq was used for the time-varying analysis of the effect of BoHV-1 on host cell gene expression. For transcript detection and annotation, mapped reads were analyzed using the LoRTIA software suite developed in our laboratory (https://github.com/zsolt-balazs/LoRTIA).

For the annotation of introns, transcription start sites (TSSs), and transcription end sites (TESs), we set the criterion that these sequences have to be identified by the LoRTIA suit in at least two independent bovine cell samples. With this restriction, we identified altogether 11,025 TSSs, 21,317 TESs and 139,771 introns (Supplementary Table S1). Additionally, LoRTIA produced a total of 227,672 bovine transcripts (Supplementary Data Item 1). The median length of these transcripts was 1678 nt (σ = 2386.5).

Three biological replicates were prepared for each time-point in dcDNA sequencing used for the time-lapse experiment. Seven time points post infection (p.i.) and a mock-infected sample was used in each replicate for this part of the analysis (Supplementary Fig. S1).

We identified consensus TATA boxes at a mean distance of 31.15 nt (σ = 2.96) upstream of bovine TSSs. The polyadenylation signals (PASs) were located at a mean distance of 25.35 nt (σ = 8.26) upstream of the host TES. Our data show that viral infection does not induce significant changes in the distance between promoters and TSSs as well as between PASs and TESs (Fig. 1a, b). No significant modification was found in the sequence of the ± 5 nt surrounding region of the TSS and the ± 50 nt surrounding region of bovine gene TESs during the infection (Fig. 1c, d).

To assess changes in splicing, and the usage of TSSs and TESs of the host cell during BoHV-1 infection, we evaluated transcripts represented by more than ten reads in the infected samples (n = 69,726) reported by LoRTIA. We detected altogether 130 alternatively spliced transcripts (Fig. 2a).

FOS, an immediate responder of the stress signaling pathway, is quickly degraded if its third intron is retained¹⁶. We detected a non-spliced variant of FOS in very low abundance and additional splice variants of the transcript lacking the above-mentioned exon, which were present starting from the first hour of the infection (Fig. 2b). This confirms previous reports on the presence of FOS in the early stages of viral infections^15,17.

The 3′-UTRs of genes often contain miRNA targets, contributing to mRNA degradation. Thus, shorter 3′-UTR length can lead to increased transcript stability¹⁸, whereas longer 3′-UTRs can be targeted by several miRNAs and other trans-acting elements thereby generating distinct regulation patterns¹⁹. We detected 72 transcripts with TESs located further downstream and 122 transcripts with TESs located more upstream compared to transcripts in mock samples. Superoxide dismutase 1 (SOD1) confers protection against oxidative damage²⁰, including that induced by the IFN-I signaling²¹. A 3′-UTR isoform of SOD1 detected in infected cells was shorter than that of found in the mock sample (Fig. 2c).

A previous work reported the disruption of transcript termination in the host caused by HSV-1 infection, resulting in extensive transcriptional overlaps between adjacent gene products²². According to our results, the length of polyadenylated transcripts has significantly changed starting from the second hour of the infection. The most pronounced increase in transcript length can be seen at 2 h p.i. (${\bar{\text{x}}}$ change of 134bps, p < 0.05), while the greatest decrease was detected at 12 h p.i. (${\bar{\text{x}}}$ change of − 193 bps, p < 0.05) compared to the mock samples (${\bar{\text{x}}}$ = 1365 bps). At the interim time points the median transcript length exhibited a fluctuating pattern (at 4 h a ${\bar{\text{x}}}$ change of 53 bps, at 6 h a ${\bar{\text{x}}}$ change of − 27 bps, while at 8 h a ${\bar{\text{x}}}$ change of 44 bps) (Fig. 3). In order to investigate whether disruption of transcript termination also occurs in BoHV-1-infected bovine cells and results non-polyadenylated transcripts, we carried out ONT sequencing based on random oligonucleotide-primed RT, and the obtained dataset was used for the analysis of transcription activity at the intergenic regions. Despite this library yielding a comparable measure of reads mapping to Bos taurus (n = 2,222,987), we were unable to detect any substantial amount of fragments mapping to the intergenic regions. Using LRS, we were able to differentiate between TSS isoforms. We detected 80 transcripts with upstream and 142 with downstream TSSs.

Overall host cell gene expression during the 12 h of virus infection

This study investigated the effect of viral infection on the cultured bovine cells by a time-course transcriptome analysis using ONT LRS analysis. We carried out direct cDNA sequencing using three biological replicates in each of the six time-points (1 h, 2 h, 4 h, 6 h, 8 h, 12 h) and in the mock-infected sample. We identified a total of 8342 host genes that produced more than ten transcripts in each of the three biological replicates. Applying differential expression (DE) analysis with a 0.01 false discovery rate (FDR) threshold, we identified 686 genes among the 8342 host genes that exhibited significantly altered expression levels during the course of virus infection. Genes were clustered by their expression profile and not by their absolute expression levels. In this part of the analysis, we transformed the time series of expression levels to a relative scale representing the expression changes between sampling points. This allowed to cluster the genes by their expression profiles during the course of viral transfection instead of their absolute abundance. We identified six clusters of genes with distinctive expression profiles (Fig. 4a, b and Supplementary Table S2). By analyzing mean expression profiles of gene clusters, we identified four groups of genes (clusters 2–5) that were constantly upregulated, a single group of genes where expression levels were steadily downregulated throughout the entire period of virus infection (cluster 6), and finally, one group that showed initial upregulation followed by downregulation (cluster 1).

We performed an over-representation analysis using the 8342 genes as reference with the PANTHER software tool. We summarized the results of this analysis using GO (Gene Ontology) biological processes and GO molecular functions annotation datasets in Supplementary Table S2 (an FDR < 0.05 was used). Over-represented genes were categorized into six functional groups according to the GO database (Fig. 4c) as follows: 296 genes play a role in cellular metabolism, 257 are involved in transcription and RNA decay, 242 in developmental and morphogenetic processes, 187 in immune response and host defense, 161 in translation and protein folding, whereas 61 genes are specifically associated with in viral transcription related processes.

Genes of the first cluster (n = 53) had medium expression preceding the infection (which was transiently slightly upregulated at the 1 h, and 2 h p.i. time points) followed by downregulation at later measurements. Genes in this cluster were over-represented in pathways controlling a wide variety of developmental and morphogenetic processes. Several genes coding transcription regulatory proteins present in this cluster show diminishing expression throughout the infection. Genes involved in the cytokine regulation of the immune response and inflammatory processes are also affected. The second and third cluster of genes (n = 64, n = 82 respectively) had medium expression preceding the infection that rose at each consecutive time points. The genes of these clusters were over-represented in functions and molecular processes that can be associated with viral gene expression and the virion assembly. An upregulation of genes involved in transcriptional and translational processes, as well as RNA decay was also observed. RNA decay can be an immediate response of the host cell to counteract the accumulation of viral transcripts, or it may be an effort of the virus to eliminate competing host mRNAs in order to facilitate the translation of viral transcripts^24,25. Some of the over represented genes in these cluster are the members of GO molecular functions that have overlapping sets of genes. For example, the 12 genes (RPS26, RPL5, RPL30, RPS29, RPL31, RPS6, RPL36, RPL37, RPL8, RPS10, RPS21, RPL19) that were significantly upregulated during infection are the members of both the “viral transcription” and the “SRP-dependent co-translational protein targeting to membrane” pathways. Many of these genes are also members of the “nuclear-transcribed mRNA catabolic process, nonsense-mediated decay” pathway.

Genes in the fourth cluster (n = 64) had low relative expression preceding the infection. These genes were upregulated following a sigmoid curve during the infection. Genes in the fourth cluster were not significantly over-represented in any of the GO molecular functions or GO biological processes. The fifth cluster of genes (n = 88) had zero or negligible expression preceding infection but showed an exponential increase in expression during the course of infection. The over-represented genes of this cluster were associated with anti-viral cellular and defense responses such as the type I interferon signaling pathways. The sixth cluster of genes (n = 335) consisted of a huge variety of host genes with high expression preceding viral infection that showed sharp downregulation during the infection. These genes were over-represented in pathways associated with protein folding, cell cycle regulation and mitochondrial processes including aerobic respiration.

Key response host genes

We performed DE analysis (with FDR = 0.01) on the Mock and 1-h expression values to describe the immediate response of host cells. We identified 6 bovine genes that were significantly down regulated and 19 genes that were significantly up regulated in the three biological replicates (Table 1). Over expression analysis revealed no significant association to either of the GO biological processes or GO molecular functions using the subset of up and down regulated genes or the whole set of genes. However, STRING association analysis revealed 4 networks between these genes. The first gene network (GADD45B, GADD45A, DDIT3, ATF3, IFRD1, CARM1, SQSTM1) contains genes that are associated to host DNA damage response, transcription regulation. Furthermore, one gene plays a role in selective automacrophagy. The second network consists of two of the interferon gamma stimulated genes; IRF9, a transcription factor that plays an essential role in anti-viral activity and MT2A, a metallothionein protein. The third network consists of two genes (SRSF5 and HNRNPDL) associated with pre-mRNA processing, transport and splicing regulation. The cytokine IL11which regulates hematopoietic cells was part of the fourth network. We found IL11 to be down-regulated. In contrast, CXCL5, a gene associated with neutrophil activation and also present in network 4, was up-regulated following virus infection. The remaining four (LASP1, HDAC7, SLC44A2 HSPG2) out of 6 down- regulated genes and eight (ID2, HMGN3, TMEM190, TSC22D1, PRKAR2B, LOC100847759, LOC100847143, LOC100174924) out of 19 up-regulated genes include signaling, transcriptional regulator, developmental genes.

Table 1 Up- and downregulated bovine genes in response of viral infection.

Full size table

In this part of the study, we validated the differential expression of 10 host genes as a result of virus infection using real-time RT-PCR. The expression of the given genes was measured in two different cell populations: mock-infected and infected cells which produced larger up- or downregulation for the particular gene. The differences in transcripts levels between the infected and mock-infected samples are shown in Supplementary Fig. S2. The qPCR data validate our findings that we reached using an LRS approach in each examined gene. We found the expression of the genes ARC, DDIT3, FAM102A, GADD45G, MYLIP and TSC22D1 to be increased in virus-infected cells, while expression levels of the genes ANXA1, C15H11orf74, LOC112446408 and SLC44A2 were decreased compared to mock-infected cells. R values shown in Table 2 indicate the means of the three replicates.

Table 2 Real-time RT-PCR analysis of 10 Bos taurus transcripts.

Full size table

The results obtained in the analysis of host gene expression were also validated using an in silico approach. We selected 30 housekeeping genes exhibiting highly stable expression in different tissues²⁶. Although the abundance of these genes varies, it is expected that, in the normalized data, their expression levels will have very high and significant correlation between biological replicates and the samples at different time-points. We calculated the correlation coefficient (R) and the significance between each biological replicate and measurement point (Supplementary Table S3; Supplementary Fig. S3). The fact that the transcript ratios of housekeeping genes are correlated with a high significance represents an internal validation of the reliability of our sequencing technique and data normalization protocol.

Discussion

High-throughput long-read sequencing approaches are able to read full-length transcripts, and therefore allow a more comprehensive annotation of RNA molecules. LRS-based studies led to the discovery that the transcriptomes are much more complex than previously thought.

In this study, we annotated a large number of bovine transcripts and analyzed the effect of viral infection upon host gene expression. We found no significant change in the usage of promoters or PASs of the host genes. However, we observed an altered usage of transcript length and splice isoforms of the host RNA molecules. This indicates a modulation of cellular mRNA turnover. The analysis of TSS isoforms suggests that viral infection may have an effect on host mRNA translation, potentially through uORFs²⁷, or through other cis-acting elements, such as miRNA binding sites of 5’-UTRs. However, downstream TSSs can also result in truncated in-frame ORFs, which might code for N-terminally truncated polypeptides^28,29. Unlike in HSV-1-infected cells²², we found no increase in the extent of transcriptional readthroughs in BoHV-1-infected-cells.

Based on the alteration of expression kinetics, we detected six distinct gene clusters that had significantly changed expression during the course of virus infection. Based on the overrepresentation analysis of these clusters we distinguished three functional groups. Genes involved in basic cell functions, including morphogenesis, cell cycle regulation, signaling, catabolic pathways and aerobe respiration, are generally downregulated during viral infection. On the other hand, we observed a considerable upregulation of genes involved in antiviral response. Additionally, genes playing a role in transcription, RNA decay, translation and protein folding were also upregulated. Our analysis shows that most of these genes are associated to distinct molecular functions and biological processes indicating general response to virus infection. However, the rest of the unassociated genes could also be associated with either susceptibility to or defense against viral infection. We also identified a small set of immediate response genes that exhibited significantly altered expression 1 h after viral infection.

Altogether, our data provides valuable resources for future functional studies and for understanding how the virus can overcome host defense mechanisms. Furthermore, these results may be helpful for the development of novel antiviral therapies.

Materials and methods

Cells and viruses

Madin Darby Bovine Kidney (MDBK, purchased from American Type Culture Collection) cells were infected with the Cooper isolate of Bovine Herpesvirus 1.1. (GenBank Accession JX898220.1). Cells were incubated at 37 °C in a humidified incubator with 5% CO₂, and were cultured with Dulbecco’s modified Eagle’s medium (DMEM) supplemented with 5% (v/v) fetal bovine serum, 100 µg/mL streptomycin and 100 U/mL penicillin. Cells were either mock-infected or infected with Cooper isolate of BoHV-1 at a multiplicity of infection (MOI) of 5 plaque-forming units (PFU)/cell, incubated at 4 °C for one hour for synchronization of infection, and then placed in an incubator at 37 °C and 5% CO₂. Infected cells were collected at 1, 2, 4, 6, 8, and 12 h post infection (HPI). Each time-point and mock infection consisted of three replicates (n = 3). Cells were washed with phosphate buffered saline (PBS), scraped from the culture plate and centrifuged at 300 RPM for 5 min at 4 °C.

RNA isolation

RNA from infected and uninfected cells (MDBK cells) was extracted using the NucleoSpin RNA kit (Machery-Nagel, Bethlehem, PA, USA), with the lysis step augmented by the addition of proteinase K (final concentration 0.37 mg/mL).

Poly(A) RNA selection and rRNA depletion

For the analysis of the polyadenylated RNAs, the RNA fraction was enriched using Oligotex mRNA Mini Kit (Qiagen). To obtain potential non-polyadenylated transcripts, rRNA depletion was performed using Ribo-Zero Magnetic Kit H/M/R (Epicentre/Illumina).

ONT non-amplified cDNA sequencing

Direct cDNA libraries were prepared from the mock and six BoHV-1 p.i samples in three replicates using the ONT’s Direct cDNA Sequencing Kit (SQK-DCS109) according to the manufacturer’s instructions. The first cDNA strand synthesis was performed using Maxima H Minus Reverse Transcriptase (Thermo Fisher Scientific) with SSP and VN primers (supplied in the kit) and 100 ng of poly(A) + RNA for each sample. This was followed by the removal of potential RNA contamination using RNase Cocktail Enzyme Mix (Thermo Fisher Scientific), and second strand synthesis using LongAmp Taq Master Mix (New England Biolabs). Double stranded cDNA ends were repaired using NEBNext End repair /dA-tailing Module (New England Biolabs). This was followed by ligation of sequencing adapter employing the NEB Blunt /TA Ligase Master Mix (New England Biolabs). Libraries were barcoded using Native Barcoding (12) Kit (ONT) according to the manufacturer’s instructions.

RT with oligo(dT) primers

50 ng of poly(A) + RNA was reverse transcribed using SuperScript IV Reverse Transcriptase and oligo(dT) primers (supplied in the kit). The cDNA samples were subjected to PCR using KAPA HiFi DNA Polymerase (Kapa Biosystems) and Ligation Sequencing Kit Primer Mix. End repair and sequencing adapter ligation were carried out as described for the dcDNA-Seq library preparation.

RT with random oligonucleotide primers

Fifty ng of ribodepleted RNA was reverse transcribed using SuperScript IV Reverse Transcriptase. Additionally, we used a custom-made primer mix composed of a random hexamer sequence and another which is complementary to the Ligation Sequencing Kit Primer (supplied in the kit). PCR, end repair and sequencing adapter ligation were identical to the oligo(dT)-primed RT library. The obtained libraries were barcoded using the 1D PCR Barcoding (96) Kit from Oxford Nanopore Technologies, according to the manufacturer’s instructions. The random oligonucleotide-based cDNA sequencing was primarily used for the identification TESs of the host transcripts.

LoopSeq single-molecule synthetic long-read sequencing

LoopSeq libraries were prepared from multiplexed 2 h and 12 h post-infection samples in three replicates using the LoopSeqTM Transcriptome 3 × 8-plex Kit. Phasing mRNA protocol was performed as recommended by the manufacturer.

Purification of libraries

Libraries were purified after each enzymatic step using Agencourt AMPure XP magnetic beads or in the case of dRNA-Seq, RNAClean XP beads (both from Beckman Coulter). Qubit RNA BR and HS Assay Kits and Qubit DNA HS Assay Kit (Thermo Fisher Scientific) were used to measure the total RNA, poly(A) + RNA, and cDNA concentrations, respectively.

Sequencers

Sequencing of the ONT dcDNA libraries was performed on R9.4.1 SpotON Flow Cells (ONT). To avoid barcode cross-talk from later time points, mock-infected, 1 h and 2 h p.i. samples were sequenced separately from other samples. The LoopSeq library was sequenced on a v2 300 flow cell on the Illumina MiSeq system.

Pre-processing and data analysis

The MinION data was basecalled using the Guppy basecaller v. 3.4.1. with --qscore_filtering. Reads with a Q-score larger than 7 were mapped to the Bos taurus GCF_002263795.1_ARS-UCD1.2 reference genomes using the “-ax splice -Y -C5” options in the minimap2 software³⁰ (detailed mapping statistics: Supplementary Table S4). LoopSeq short read data were assembled into long reads using the manufacturer’s web service (https://analysis.loopgenomics.com/accounts/login/), and mapped to the host genome using Minimap2. TSSs, TESs and introns were annotated using our LoRTIA software suite (https://github.com/zsolt-balazs/LoRTIA). Transcripts represented by less than 10 reads were excluded from further analysis. We compared the genomic position of TSSs, TESs and splice junctions of transcripts annotated from the infected samples, to those of the mock control samples. A TSS or a TES was considered “downstream” or “upstream” of the control TSS or TES, if it was located more than 10 bp from the control position, whereas “unaffected”, if it was within 10 bp from the control. If a transcript had splice junctions located at different position than the splice junction in the control, the transcript was considered “alternatively spliced”. If the given transcript was not present in the mock infected samples, but was found in the infected samples the transcript was named “not in the control”.

The sequences of the TSSs and TESs and their neighboring sequences were extracted with our in-house script (https://github.com/moldovannorbert/seqtools), using the previously annotated positions and the reference genome as source. To assess the change in transcript lengths, we first log₁₀-transformed the data and tested for variation using the Kruskal–Wallis test. To detect those time points where transcript lengths significantly differed from the those in the mock sample, pairwise two-sided Mann–Whitney U tests were performed between the mock and the infected samples followed by multiple test correction using Bonferroni’s method.

Analysis of host cell gene expression

In order to assess the effect of the infection on host gene expression, we used the cfDNA-seq results. We excluded MAPQ = 0, secondary and supplementary alignments from all downstream analysis. The reads aligned to the host genome were associated to host genes according to the GCF_002263795.1_ARS-UCD1.2_genomic.gff genome coordinates. Only reads matching the exon structure of the host reference genes (using a + /− 5 base pair window for matching exon start and end positions) were counted. The three biological replicates were grouped in EdgeR³¹ (DGEList function) by the corresponding time points. We normalized the date (calcNormFactors function) with the method = ”TMM” options and we used the robust = True option in the downstream analysis (estimateDisp, glmQLFit, and glmQLFTest functions). Since we had mock, 1 h, 2 h, 4 h, 6 h, 8 h, 12 h measurements, in our model, we tested for DE against mock expression for each time point using data from three biological replicates. To detect genes with significantly changed expression levels (decideTests function), we applied a 0.01 false discovery rate (FDR) threshold, with p-values adjusted by the Benjamini & Hochberg procedure (p.adjust function with method = ”BH” option).

Medians of normalized pseudo-counts of DE genes were exported from edgeR³¹ (Table 1). Gene expression levels were normalized to maximal expression levels and were then compared to each other by cluster analysis to reveal which genes had similar expression kinetics during viral infection. Genes were clustered using the amap_0.8-16 R package Kmeans function with the Euclidean distance method. Based on the Calinski criteria, our dataset had an optimal cluster number of 6. The six clusters of genes were visualized via ggplot2 with the geom_smooth function using the median of the relative gene expressions of genes for each time points in each of the identified gene clusters. Using the identified subset of genes, we also performed overrepresentation analysis for each cluster using the number of expressed genes as reference via the PANTHER (version 14.1 using the 2018_04 dataset release)³² software tool. We summarized the results of our over-representation analysis (FDR < 0.05) using the Gene Ontology (GO) biological processes and GO molecular functions annotation datasets.

Schematic representation of the workflow is shown in Supplementary Fig. S1.

Primer design

To validate the effect of viral infection on host gene expression, reverse transcription-based quantitative real-time PCR (qRT-PCR) was used. The gene specific primers of the 10 Bos taurus genes which had altered expression levels due to BoHV-1 infection were designed using the PrimerQuest™ Tool (Integrated DNA Technologies, IDT) (Supplementary Table S5). The latest GenBank assembly (GCA_002263795.2) was used as a reference sequence. Primers were purchased from IDT.

Reverse transcription

RNA samples were used to produce single stranded cDNAs with gene specific primers. Reverse transcription (RT) reactions were performed using 6 ng of RNA and gene-specific primers (0.1 µM final concentration) with SuperScript IV RT enzyme (Invitrogen). The RT reactions were carried out according to the SuperScript IV manual: briefly, the RNA, primer and dNTP containing mixtures were denatured at 65 °C for 5 min, cooled down to 4 °C, then the buffer, the DTT and the RT enzyme were added. RNaseOUT (Invitrogen) was used to avoid RNA degradation. The samples were incubated at 50 °C for 10 min. The reactions were stopped by raising the temperature to 80 °C for 10 min. First-strand cDNAs were diluted tenfold with UltraPure™ DNase/RNase-Free Distilled Water (Invitrogen), then subjected to real-time PCR analysis.

Quantitative PCR

qPCR experiments were performed using a Rotor-Gene Q cycler (Qiagen). Reactions were carried out in 20 μl reaction mixtures containing 7 μl of cDNAs, 10 μl of ABsolute QPCR SYBR Green Mix (Thermo Scientific) and 3 μl primer mixture (1 μl of forward and 1 μl of reverse primers, 50 nM final concentration, each), as was previously published³³. The running conditions were as follows: (1) 15 min at 95 °C, followed by 35 cycles of 94 °C for 25 s (denaturation), 60 °C for 25 s (annealing), and 72 °C for 6 s (extension). For those experiments where primer dimer formation was detected, we applied an extra extension step in every PCR cycle with an elevated temperature (between the Tm of the specific product and the Tm of the primer dimers) for 2 s for detection (details are summarized in Supplementary Table S6). The 28S rRNA was used as loading control (reference gene).

Calculation of relative expression ratios

Relative expression values were calculated according to the following equation, as published in our previous articles³³:

$$ R = \frac{{E^{{Ct}} gene_{i} /E^{{Ct}} gene_{m} }}{{E^{{Ct}} ref_{i} /E^{{Ct}} ref_{m} }}, $$

where R is the relative expression ratio; E is the efficiency of amplification; Ct is the cycle threshold value; gene refers to any particular gene at the most down- or upregulated time points in virus-infected cells (i) compared to mock-infected (m) samples; and ref is the 28S housekeeping gene, which was used as a reference gene. Average Ct values with their standard deviance (SD) values, amplification efficiencies with SDs and the examined time points are shown in Supplementary Table S6. The relative copy numbers of mRNAs were calculated by normalizing cDNAs to 28S rRNA gene using the Comparative Quantitation module of the Rotor-Gene Q software (Version 2.3.5, Qiagen), which automatically calculates the qPCR amplification efficiency and the take-off points sample-by-sample. Thresholds were set automatically by the Rotor-Gene software. For each gene, 3 replicates were carried out for statistical confidence, and the median of these values along with the standard deviances was calculated.

Data availability

The sequencing datasets generated during this study are available at the European Nucleotide Archive’s SRA database under the accession PRJEB33511 (https://www.ebi.ac.uk/ena/browser/view/PRJEB33511).

Code availability

The LoRTIA software suite is available on GitHub: https://github.com/zsolt-balazs/LoRTIA. Other scripts used: https://github.com/moldovannorbert/seqtools.

References

van Oirschot, J. T. Bovine herpesvirus 1 in semen of bulls and the risk of transmission: A brief review. Vet. Q. 17, 29–33 (1995).
Article PubMed Google Scholar
Jones, C. Alphaherpesvirus latency: Its role in disease and survival of the virus in nature. Adv. Virus Res. 51, 81–133 (1998).
Article CAS PubMed Google Scholar
Jones, C. Herpes simplex virus type 1 and bovine herpesvirus 1 latency. Clin. Microbiol. Rev. 16(1), 79–95. https://doi.org/10.1128/CMR.16.1.79-95.2003 (2003).
Article CAS PubMed PubMed Central Google Scholar
Byrne, A. et al. Nanopore long-read RNAseq reveals widespread transcriptional variation among the surface receptors of individual B cells. Nat. Commun. 8, 16027 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, S.-Y., Deng, F., Jia, X., Li, C. & Lai, S.-J. A transcriptome atlas of rabbit revealed by PacBio single-molecule long-read sequencing. Sci. Rep. 7, 7648 (2017).
Article ADS PubMed PubMed Central Google Scholar
Tombácz, D. et al. Dynamic transcriptome profiling dataset of vaccinia virus obtained from long-read sequencing techniques. Gigascience 7, giy139 (2018).
Article PubMed Central Google Scholar
Zhao, L. et al. Analysis of transcriptome and epitranscriptome in plants using PacBio Iso-seq and nanopore-based direct RNA sequencing. Front. Genet. 10, 253 (2019).
Article CAS PubMed PubMed Central Google Scholar
Boldogkői, Z., Moldován, N., Balázs, Z., Snyder, M. & Tombácz, D. Long-read sequencing—A powerful tool in viral transcriptome research. Trends Microbiol. 27, 578–592 (2019).
Article PubMed Google Scholar
Moldován, N. et al. Multi-platform sequencing approach reveals a novel transcriptome profile in pseudorabies virus. Front. Microbiol. 8, 1–13 (2018).
Article Google Scholar
Balázs, Z. et al. Long-read sequencing of human cytomegalovirus transcriptome reveals RNA isoforms carrying distinct coding potentials. Sci. Rep. 7, 1–9 (2017).
Article Google Scholar
Tombácz, D. et al. Transcriptome-wide survey of pseudorabies virus using next- and third-generation sequencing platforms. Sci. Data 5, 180119 (2018).
Article PubMed PubMed Central Google Scholar
Prazsák, I. et al. Long-read sequencing uncovers a complex transcriptome topology in varicella zoster virus. BMC Genom. 19, 873 (2018).
Article Google Scholar
Moldován, N. et al. Time-course profiling of bovine alphaherpesvirus 1.1 transcriptome using multiplatform sequencing. Sci. Rep. 10, 1–14 (2020).
Article Google Scholar
Glazov, E. A. et al. Characterization of microRNAs encoded by the bovine herpesvirus 1 genome. J. Gen. Virol. 91, 32–41 (2010).
Article CAS PubMed Google Scholar
Hu, B. et al. Cellular responses to HSV-1 infection are linked to specific types of alterations in the host transcriptome. Sci. Rep. 6, 1–14 (2016).
Google Scholar
Jurado, J., Fuentes-Almagro, C. A., Prieto-Álamo, M. J. & Pueyo, C. Alternative splicing of c-fos pre-mRNA: Contribution of the rates of synthesis and degradation to the copy number of each transcript isoform and detection of a truncated c-Fos immunoreactive species. BMC Mol. Biol. 8, 1–13 (2007).
Article Google Scholar
Rubio, N. & Martin-Clemente, B. Theiler’s murine encephalomyelitis virus infection induces early expression of c-fos in astrocytes. Virology 258, 21–29 (1999).
Article CAS PubMed Google Scholar
Mayr, C. & Bartel, D. P. Widespread shortening of 3′UTRs by alternative cleavage and polyadenylation activates oncogenes in cancer cells. Cell 138, 673–684 (2009).
Article CAS PubMed PubMed Central Google Scholar
Pereira, L. A., Munita, R., González, M. P. & Andrés, M. E. Long 3’UTR of Nurr1 mRNAs is targeted by miRNAs in mesencephalic dopamine neurons. PLoS ONE 12, 1–15 (2017).
Article Google Scholar
Miao, L. & Clair, D. K. S. Regulation of superoxide dismutase genes: Implications in disease. Free Radic. Biol. Med. 47, 344–356 (2009).
Article CAS PubMed PubMed Central Google Scholar
Bhattacharya, A. et al. Superoxide dismutase 1 protects hepatocytes from type i interferon-driven oxidative damage. Immunity 43, 974–986 (2015).
Article CAS PubMed PubMed Central Google Scholar
Rutkowski, A. J. et al. Widespread disruption of host transcription termination in HSV-1 infection. Nat. Commun. 6, 1–15 (2015).
Article Google Scholar
Wickham, H. ggplot2: Elegant Graphics for Data Analysis (Springer-Verlag, 2016).
Book MATH Google Scholar
Smiley, J. R. Herpes simplex virus virion host shutoff protein: Immune evasion mediated by a viral RNase?. J. Virol. 78, 1063–1068 (2004).
Article CAS PubMed PubMed Central Google Scholar
Moon, S. L. & Wilusz, J. Cytoplasmic viruses: Rage against the (cellular RNA decay) machine. PLoS Pathog. 9, 1–3 (2013).
Article Google Scholar
Eisenberg, E. & Levanon, E. Y. Human housekeeping genes, revisited. Trends Genet. 29, 569–574 (2013).
Article CAS PubMed Google Scholar
Kronstad, L. M., Brulois, K. F., Jung, J. U. & Glaunsinger, B. A. Dual short upstream open reading frames control translation of a herpesviral polycistronic mRNA. PLoS Pathog. 9, e1003156 (2013).
Article CAS PubMed PubMed Central Google Scholar
Tombácz, D. et al. Multiple long-read sequencing survey of herpes simplex virus dynamic transcriptome. Front. Genet. 10, 1–20 (2019).
Article Google Scholar
Crofts, L. A., Hancock, M. S., Morrison, N. A. & Eisman, J. A. Multiple promoters direct the tissue-specific expression of novel N-terminal variant human vitamin D receptor gene transcripts. Proc. Natl. Acad. Sci. U. S. A. 95, 10529–10534 (1998).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, H. Minimap2: Pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Article CAS PubMed PubMed Central Google Scholar
McCarthy, D. J., Chen, Y. & Smyth, G. K. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucl. Acids Res. 40, 4288–4297 (2012).
Article CAS PubMed PubMed Central Google Scholar
Mi, H., Muruganujan, A., Casagrande, J. T. & Thomas, P. D. Large-scale gene function analysis with the panther classification system. Nat. Protoc. 8, 1551–1566 (2013).
Article PubMed PubMed Central Google Scholar
Tombácz, D., Tóth, J. S., Petrovszki, P. & Boldogkői, Z. Whole-genome analysis of pseudorabies virus gene expression by real-time quantitative RT-PCR assay. BMC Genom. 10, 491 (2009).
Article Google Scholar

Download references

Funding

This study was supported by OTKA K 128247 Granted to Z.B., by the OTKA FK 128252 and by the Lendület (Momentum) I Program of the Hungarian Academy of Sciences (LP-2020/8) Granted to D.T. The Project was also supported by USDA National Institute of Food and Agriculture, Agriculture and Food Research Initiative Competitive Grant 2020-67016-31345 to F.M. The APC was covered by the University of Szeged Open Access fund 5240. The funding body had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Author information

These authors contributed equally: Zoltán Maróti, Dóra Tombácz and Norbert Moldován.

Authors and Affiliations

Department of Pediatrics, Faculty of Medicine, University of Szeged, Somogyi B. u. 4., Szeged, 6720, Hungary
Zoltán Maróti & Tibor Kalmár
Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4., Szeged, 6720, Hungary
Dóra Tombácz, Norbert Moldován, Gábor Torma, Zsolt Csabai, Gábor Gulyás, Ákos Dörmő, Miklós Boldogkői & Zsolt Boldogkői
MTA-SZTE Momentum GeMiNI Research Group, University of Szeged, Somogyi B. u. 4., Szeged, 6720, Hungary
Dóra Tombácz
Department of Biochemistry and Molecular Biology, Entomology and Plant Pathology, Mississippi State University, 408 Dorman Hall, 32 Creelman St., Box 9655, Starkville, MS, 39762, USA
Victoria A. Jefferson & Florencia Meyer

Authors

Zoltán Maróti
View author publications
You can also search for this author in PubMed Google Scholar
Dóra Tombácz
View author publications
You can also search for this author in PubMed Google Scholar
Norbert Moldován
View author publications
You can also search for this author in PubMed Google Scholar
Gábor Torma
View author publications
You can also search for this author in PubMed Google Scholar
Victoria A. Jefferson
View author publications
You can also search for this author in PubMed Google Scholar
Zsolt Csabai
View author publications
You can also search for this author in PubMed Google Scholar
Gábor Gulyás
View author publications
You can also search for this author in PubMed Google Scholar
Ákos Dörmő
View author publications
You can also search for this author in PubMed Google Scholar
Miklós Boldogkői
View author publications
You can also search for this author in PubMed Google Scholar
Tibor Kalmár
View author publications
You can also search for this author in PubMed Google Scholar
Florencia Meyer
View author publications
You can also search for this author in PubMed Google Scholar
Zsolt Boldogkői
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.M., N.M., G.T., M.B., G.G., and T.K. carried out bioinformatic analysis of the viral transcripts. Z.C., D.T. and Á.D. prepared ONT MinION libraries, carried out ONT sequencing and participated in the analysis. D.T. generated Loop-seq libraries and qPCR experiments and analysis. Z.C. and V.A.J. isolated RNA, maintained cell cultures, propagated viruses and participated in nucleic acid isolation and library preparation. F.M. and D.T. drafted the manuscript. Z.B. integrated the data, performed the analysis, and wrote the final version of the manuscript. All authors reviewed and approved the final version of the manuscript.

Corresponding author

Correspondence to Zsolt Boldogkői.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Table S1.

Supplementary Table S2.

Supplementary Table S3.

Supplementary Table S4.

Supplementary Table S5.

Supplementary Table S6.

Supplementary Data Item.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Maróti, Z., Tombácz, D., Moldován, N. et al. Time course profiling of host cell response to herpesvirus infection using nanopore and synthetic long-read transcriptome sequencing. Sci Rep 11, 14219 (2021). https://doi.org/10.1038/s41598-021-93142-7

Download citation

Received: 16 March 2021
Accepted: 14 June 2021
Published: 09 July 2021
DOI: https://doi.org/10.1038/s41598-021-93142-7

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.