Competitive endogenous RNA network identifies four long non-coding RNA signature as a candidate prognostic biomarker for lung adenocarcinoma
Original Article

Competitive endogenous RNA network identifies four long non-coding RNA signature as a candidate prognostic biomarker for lung adenocarcinoma

Jing Hu1,2#, Tonglian Wang3#, Qiang Chen3

1Department of Medical Oncology, The Affiliated Hospital of Kunming University of Science and Technology, Kunming 650032, China; 2Department of Medical Oncology, The First People’s Hospital of Yunnan Province, Kunming 650032, China; 3Faculty of Animal Science and Technology, Yunnan Agricultural University, Kunming 650201, China

Contributions: (I) Conception and design: Q Chen; (II) Administrative support: Q Chen; (III) Provision of study materials for patients: J Hu, T Wang; (IV) Collection and assembly of data: J Hu, T Wang; (V) Data analysis and interpretation: All authors; (VI) Manuscript writing: All authors; (VII) Final approval of manuscript: All authors.

#These authors contributed equally to this work.

Correspondence to: Qiang Chen. Faculty of Animal Science and Technology, Yunnan Agricultural University, No. 95 of Jinhei Road, Kunming 650201, China. Email: chq@sjtu.edu.cn.

Background: Lung adenocarcinoma (LUAD) is the most commonly histological subtype of lung cancer (LC) and the prognoses of the majority of LUAD patients are still very poor. The present study aimed at integrating long non-coding RNA (lncRNA), microRNA (miRNA) and messenger RNA (mRNA) expression data to construct lncRNA-miRNA-mRNA competitive endogenous RNA (ceRNA) network and identify importantly potential lncRNA signature in ceRNA network as a candidate prognostic biomarker for LUAD patients.

Methods: lncRNA, miRNA and mRNA expression data as well as clinical characteristics of LUAD patients were retrieved from The Cancer Genome Atlas (TCGA) database. Differentially expressed lncRNAs (DElncRNAs), differentially expressed mRNAs (DEmRNAs) and differentially expressed miRNA (DEmiRNA) between LUAD and normal lung tissues samples were analyzed. A lncRNA-miRNA-mRNA ceRNA network was constructed and the biological functions of DEmRNAs in ceRNA network were analyzed using gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses. Univariate and multivariate Cox regression analyses of DElncRNAs in ceRNA network were implemented to predict the overall survival (OS) in LUAD patients. The receiver operating characteristic (ROC) analysis was used to evaluate the performance of multivariate Cox regression model.

Results: A total of 1,664 DElncRNAs, 120 DEmiRNAs and 2,503 DEmRNAs was identified between LUAD and normal lung tissues samples. A lncRNA-miRNA-mRNA ceRNA network including 140 DElncRNAs, 33 DEmiRNAs and 57 DEmRNAs was established. Kaplan-Meier (KM) [Log-rank (LR) test] and univariate regression analysis of those 140 DElncRNAs revealed that 7 DElncRNAs (LINC00518, UCA1, NAV2-AS2, MED4-AS1, SYNPR-AS1, AC011483.1, AP002478.1) were simultaneously identified to be associated with OS of LUAD patients. A multivariate Cox regression analysis of those 7 DElncRNAs showed that a group of 4 DElncRNAs including AP002478.1 (Cox P=4.66E-03), LINC00518 (Cox P=2.34E-04), MED4-AS1 (Cox P=6.42E-03) and NAV2-AS2 (Cox P=6.66E-02) had significantly prognostic value in OS of LUAD patients. The cumulative risk score indicated that the 4-lncRNA signature was significantly associated with OS of LUAD patients (P=0). The area under the curve (AUC) of the 4-lncRNA signature related with 3-year survival was 0.669.

Conclusions: The present study provides novel insights into the lncRNA-related regulatory mechanisms in LUAD, and identifying 4-lncRNA signature may serve as a candidate prognostic biomarker in predicting the OS of LUAD patients.

Keywords: Lung adenocarcinoma (LUAD); competitive endogenous RNA (ceRNA) network; long non-coding RNA (lncRNAs); prognosis; biomarker; overall survival (OS)


Submitted Jan 07, 2019. Accepted for publication Jun 07, 2019.

doi: 10.21037/tcr.2019.06.09


Introduction

Lung cancer (LC) is among the most common types of malignancies and is the first leading cause of cancer-related death worldwide (1). In recent years, the morbidity and mortality of LC have been shown an increasing tendency, especially in some regions with higher tobacco consumption (2). According to pathological type, LC mainly includes small cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC), and NSCLC is the major pathological type and accounts for approximately 85% of LC (3). Although some significant advances were obtained in new detecting and treating methods, the 5-year overall survival (OS) rate for all stages of NSCLC less than 20% (4). The main causes lie in that only approximately 15% NSCLC patients are timely determined at an early stage (I or II stage) and NSCLC tends to form metastasis at an early stage (5). If the majority of NSCLC patients are timely diagnosed at an early stage, active treatments of NSCLC are able to offer a favorable prognosis and the 5-year OS rate will raise to 70–90% (6).

Among NSCLC, lung adenocarcinoma (LUAD) is the major histological subtype and annually results in more than 600,000 deaths worldwide (7). Currently, the diagnosis and prognosis of LUAD is evaluated on the basis of histological grade, disease stage and the expression level of some key genes. However, due to the heterogeneity of LUAD, the clinical outcomes are highly variable and the predictive values are limited on the basis of clinical and pathological characteristics in detecting LUAD at an early stage (8). Even though some progresses have been obtained in pathological mechanisms involved in LUAD (8), molecular mechanisms underlying LUAD remain vague. Thus, it is vital to identify potentially molecular diagnostic and prognostic biomarkers and/or therapeutic targets for combating LUAD.

Non-coding RNAs (ncRNAs) are no-coding parts of genome at different cell status and account for more than 90% RNA transcripts (9). Recently, more and more studies have revealed the important regulatory roles of ncRNAs in tumor biology involving in formation, metastasis, prognosis and recurrence of tumor (10). Long non-coding RNAs (lncRNAs), an important class of ncRNA with over 200 nucleotides long (11), are widely reported to play as important regulatory components in tumor promoter and suppressor pathways (10,12). However, little knowledge is known about the roles of lncRNAs in tumor biology. Nevertheless, due to high tissue specificity of lncRNA expression, lncRNAs are still often considered as potentially valuable diagnostic biomarkers of various malignant tumors (13-16). Therefore, identifying LUAD-specific lncRNAs as biomarkers is of clinical value for diagnosis and prognosis of LUAD. Recently, several studies have reported that some lncRNAs were associated with formation and progression of LUAD (17-19), and some were predicted as biomarkers related with clinical outcomes (20,21). But, owing to limited tissue samples, the results from different studies are inconsistent, and lncRNA-related studies with large tissue samples are urgently needed to identify key lncRNAs associated with LUAD. The Cancer Genome Atlas (TCGA; https://cancergenome.nih.gov/) is a publicly available database which can provide large-scale multi-dimensional cancer-related molecular profile data. The current study used LUAD-related data with large sample size from TCGA database to identify LUAD-specific lncRNAs, and increase statistical reliability of this study.

The competitive endogenous RNA (ceRNA) hypothesis was proposed as a novel regulatory mechanism between coding messenger RNAs (mRNAs) and ncRNAs (22). Studies have demonstrated that lncRNAs as ceRNAs competitively combine with microRNAs (miRNAs) by miRNA-response-elements (MREs) to play the key roles in various biological processes (BP) such as tumor formation and metastasis (23). For instance, Liu et al. have proved that two lncRNAs DLEU2 and DDX11-AS1 in lncRNA-miRNA-mRNA ceRNA network were significantly up-regulated in gastric cancer (GC) tissues, and promoted GC cell proliferation by acting as has-miR-30-5p and has-miR-145-5p sponge (24). In serous ovarian cancer, lncRNA PTAR could promote epithelial-to-mesenchymal transition (EMT) and invasion-metastasis by competitively binding miR-101-3p to regulate ZEB1 expression (25). Fan et al. found that the aberrant expression of 4-lncRNA signature (ADAMTS9-AS1, LINC00536, AL391421.1 and LINC00491) in ceRNA network played the key roles in the progression and prognosis of breast cancer (26). In addition, a study has revealed that 7-lncRNA signature in ceRNA network was linked to clinical features in rectal adenocarcinoma (27). In summary, these studies manifest that the dysregulation of lncRNAs in ceRNA network can regulate lncRNA and mRNA interaction by miRNA mediating, and further contribute to the initiation and progression of cancer (28). However, although a few studies have paid attention to LC ceRNAs (29,30), little information is available for LC ceRNAs, especially LUAD ceRNAs.

In the present study, LUAD-related RNA sequencing (RNA-seq) data were retrieved from TCGA database. lncRNA-related ceRNA network was established and LUAD-specific lncRNAs were identified. Further, survival analysis of key lncRNAs was performed to predict important lncRNAs associated with OS of LUAD patients. The present study provides some novel insights into the regulatory mechanism of lncRNAs though ceRNA network in LUAD, and identifying lncRNA signature may serve as novel potentially diagnostic biomarkers and/or therapeutic targets.


Methods

RNA-seq data and patient information collection

RNA-seq data associated with LUAD were retrieved from TCGA database portal (https://cancergenome.nih.gov/). The exclusion criteria of LUAD-related tissues RNA-seq data were included as follows: (I) histological diagnosis for not LUAD; (II) except LUAD suffering from other malignancy; (III) data with incomplete clinical information. Finally, a total of 535 LUAD-related and 59 non-LUAD lung tissues RNA-seq data were retrieved. The clinical information of all LUAD samples patients were simultaneously downloaded from TCGA database. Similarly, LUAD-related miRNA RNA-seq data and clinical information in this study were downloaded from TCGA database. Finally, a total 521 LUAD-related and 46 non-LUAD lung tissues RNA-seq data was included. All RNA-seq data and clinical information were obtained from public TCGA database, and were approved by the Institutional Review Board of the relevant participating institutions and by the National Cancer Institute of NIH. No approval from the ethics committee was required. The present study meets the requirements of data usage and publishing from TCGA database.

RNA-seq data preprocessing and differentially expressed analysis

All raw RNA-seq data were subjected to normalization using the trimmed mean of M-values method.

EdgeR software package in Bioconductor project (version 3.8, http://www.bioconductor.org/) based on R (version 3.5.1, https://www.r-project.org/) language environment was used to screen the differentially expressed lncRNAs (DElncRNAs), differentially expressed mRNAs (DEmRNAs) and differentially expressed miRNAs (DEmiRNAs) between LUAD tissues and non-LUAD lung tissues samples (31). |Log fold change (logFC)| >2 and P<0.01 were set as cut-off criteria. The volcano plots of RNA expression were visualized using the ggplot2 (version 3.1.0, https://github.com/tidyverse/ggplot2) software package. The heat map of differentially expressed RNAs (DERNAs) was plotted using the pheatmap (version 1.0.10) software package.

Construction of lncRNA-miRNA-mRNA ceRNA network

The lncRNA-miRNA-mRNA ceRNA network was established based on ceRNA hypothesis that lncRNA directly combine miRNA by acting as miRNA sponge to indirectly regulate the function of mRNA (32). The ceRNA network was constructed according to the following steps: (I) three types of RNAs with |logFC| >2 and P<0.01 cut-off criteria were kept; (II) the potential target miRNAs of DElncRNAs, as well as the interaction relationships between DElncRNAs and target miRNAs were predicted using the online miRcode tool (miRcode 11, http://www.mircode.org/); (III) the potential target mRNAs of DEmiRNAs were predicted using the online tools including TargetScan (release 7.2, http://www.targetscan.org/), miRDB (version 5.0, http://mirdb.org/) and miRTarBase (release 7.0, http://mirtarbase.mbc.nctu.edu.tw/); (IV) based on ceRNA hypothesis, intersecting miRNAs negatively regulated by lncRNAs and mRNAs were selected to construct ceRNA network. The lncRNA-miRNA-mRNA ceRNA network was constructed and visualized using an open-source Cytoscape (version 3.7.0, https://cytoscape.org/) software (33).

Construction of protein-protein interaction network and analysis

The interactive relationships among DEmRNAs encoding proteins in ceRNA network was analyzed by constructing a protein and protein interaction (PPI) network. The interactive information among DEmRNAs encoding proteins was retrieved from online STRING database (version 11.0, https://string-db.org/) (34). The gene pairs with a combined score ≥0.7 were used to construct the PPI network. The Cytoscape software was used to construct and visualize the interactive relationships among genes within PPI network (version 3.7.0, http://www.cytoscape.org/) (33). The subnetwork was extracted from whole PPI network using Molecular COmplex DEtection (MCODE) algorithm based on topological properties of PPI network, and a plugin MCODE (version 1.5.1) in Cytoscape was used to perform MCODE analysis (35). The threshold parameters were set for degree cutoff =2, node score cutoff =0.2, K-core =2 and max-depth =100.

Identification of key genes

Key genes in PPI network were identified using Centrality analysis. Centrality analyses including subgraph centrality, degree centrality, eigenvector centrality, betweenness centrality, network centrality, information centrality and closeness centrality were performed using a plugin CytoNCA (version 2.1.6) in Cytoscape (36). The genes with higher centrality scores were identified as key gene, and essential genes were identified as intersecting genes of key genes obtained by seven centrality methods. GEPIA database (http://gepia.cancer-pku.cn) is an interactive web server for analyzing gene expression data of tumors and normal tissues from TCGA and genotype-tissue expression database (37), and was used to validate essential genes in PPI network.

LUAD-specific prognostic lncRNA signatures identification

The associations between DElncRNAs in ceRNA network and OS in LUAD patients were evaluated using Kaplan-Meier (KM) estimate and log-rank (LR) test in survival (version 2.43-3) package based on R. The statistical P<0.01 was considered as the significant association between lncRNA and OS. Univariate Cox proportional hazards regression model was performed to evaluate the association between DElncRNAs in ceRNA network and OS. Subsequently, multivariate Cox hazards regression model was implemented to assess the prognostic factors for LUAD. The hazards model was established as follows:

Risk socre = ExplncRNA1 × CoelncRNA1 + ExplncRNA2 × CoelncRNA2 + ExplncRNA3 × CoelncRNA3 + ExplncRNAn × CoelncRNAn

Where “Exp” denotes the expression level of lncRNA, and “Coe” is the regression coefficient from the multivariate Cox regression model (38). According to the median of above risk scores, LUAD patients were divided into high-risk and low-risk groups. Receiver operating characteristic (ROC) curve was constructed using survival ROC (version 1.0.3) package based on R, and was used to measure the risk prediction rate of lncRNAs between high- and low-risk groups.

All statistical analyses were performed using R and SPSS softwares.


Results

Identification of DElncRNA, DEmRNA and DEmiRNA

All raw data were normalized using edgeR package, and 8,896 lncRNAs, 18,123 mRNAs, and 672 miRNAs were obtained. Using differentially expressed analysis based on edgeR package, we identified DElncRNAs, DEmRNAs and DEmiRNAs between LUAD tissues samples and non-LUAD lung tissues samples. According to the |logFC| >2 and P<0.01 cut-off criteria, a total of 1,664 DElncRNAs (1,447 up- and 187 down-regulated; http://fp.amegroups.cn/cms/tcr.2019.06.09-1.pdf), 2,503 DEmRNAs (1,975 up- and 528 down-regulated; http://fp.amegroups.cn/cms/tcr.2019.06.09-2.pdf), and 120 DEmiRNAs (104 up- and 16 down-regulated; http://fp.amegroups.cn/cms/tcr.2019.06.09-3.pdf) was identified. The distributions of DElncRNAs, DEmRNAs and DEmiRNAs were displayed using volcano plots in Figure 1A. The expression levels of DElncRNAs, DEmRNAs and DEmiRNAs between LUAD tissues and non-LUAD lung tissues were displayed using heat map in Figure 1B.

Figure 1 Differentially expressed RNAs in lung adenocarcinoma. (A) Volcano plot of expressions of three types of RNAs. Horizontal axis represented the mean expression differences of lncRNAs, miRNAs and mRNAs between LUAD and normal lung tissues samples, and vertical axis represented log transformed FDR; (B) heatmap of expressions of three types of differentially expressed RNAs between LUAD and normal lung tissues samples. Horizontal axis represented the samples, and blue represented normal lung tissues samples and red represented LUAD tissues samples. Vertical axis represented differentially expressed lncRNAs, miRNAs and mRNAs, and blue represented down-regulated lncRNAs, miRNAs and mRNAs, and red represented up-regulated lncRNAs, miRNAs and mRNAs. The color scale showed the expression values. lncRNAs, long non-coding RNAs; miRNAs, microRNAs; mRNAs, messenger RNAs; FDR, false discovery rate; LUAD, lung adenocarcinoma.

Construction of ceRNA network

The relationships among DElncRNAs, DEmiRNAs and DEmRNAs needed to be obtained before lncRNA-miRNA-mRNA ceRNA network was constructed. Using online miRcode tool, the relationships among 1,664 DElncRNAs and 120 DEmiRNAs were evaluated, and finally 140 LUAD-specific DElncRNAs (123 up- and 17 down-regulated) (Table 1) and 33 LUAD-specific DEmiRNAs (28 up- and 5 down-regulated) (Table 2) were putatively identified to interact. Using online tools including TargetScan, miRDB and miRTarBase, we predicted the relationships among DEmiRNAs and their target DEmRNAs. Finally, 57 LUAD-specific target DEmRNAs (41 up- and 16 down-regulated) (Table 3) were putatively identified to interact with 33 LUAD-specific DEmiRNAs.

Table 1
Table 1 lncRNAs in ceRNA network of LUAD
Full table
Table 2
Table 2 miRNAs in ceRNA network of LUAD
Full table
Table 3
Table 3 mRNAs in ceRNA network of LUAD
Full table

On the basis of the above data, lncRNA-miRNA-mRNA ceRNA network was constructed and visualized using Cytoscape software. On the basis of ceRNA hypothesis and the expression levels of DElncRNAs, DEmiRNAs and DEmRNAs, two lncRNA-miRNA-mRNA ceRNA networks including over-expressed (Figure 2A) and under-expressed (Figure 2B) networks were constructed and visualized using Cytoscape. In over-expressed ceRNA network, 65 up-regulated DElncRNAs, 5 down-regulated DEmiRNAs and 17 up-regulated DEmRNAs were included and 114 edges were contained. In under-expressed ceRNA network, 17 down-regulated DElncRNAs, 24 up-regulated DEmiRNAs and 13 down-regulated DEmRNAs were included and 78 edges were contained.

Figure 2 lncRNA-miRNA-mRNA ceRNA network. (A) Over-expressed network was constructed with up-regulated lncRNAs, down-regulated miRNAs and up-regulated mRNAs on the basis of ceRNA theory. Red circles represented up-regulated mRNAs, green rectangles represented up-regulated lncRNAs, and blue diamonds represented down-regulated miRNAs. Larger nodes meant more edges; (B) under-expressed network was constructed with down-regulated lncRNAs, up-regulated miRNAs and down-regulated mRNAs on the basis of ceRNA theory. Blue circles represented down-regulated mRNAs, yellow rectangles represent down-regulated lncRNAs, and red diamonds represented up-regulated miRNAs. Larger nodes mean more edges. lncRNAs, long non-coding RNAs; miRNAs, microRNAs; mRNAs, messenger RNAs; ceRNA, competitive endogenous RNA.

Construction of PPI network and key gene validation

The interactive relationships of 57 DEmRNAs encoding proteins in ceRNA network were elucidated using PPI network. At minimum required interaction score = the high confidence 0.7, a total of 25 among 57 genes was filtered into PPI network, and a PPI network with 25 nodes and 38 edges was established (Figure 3A). Highly correlated module analysis showed that 2 modules were identified in whole PPI network, and the most significant module (subnetwok1) included 8 nodes and 26 edges (Figure 3B). Centrality analysis showed that centrality score of CCNB1 gene obtained from each centrality method ranked first (degree =7, betweenness centrality =0.01904762, closeness centrality =1, eigenvector =0.407759339, subgraph =73.9836731, information =2.274614334, network =6.647619048), and CCNB1 gene was identified as essential gene in subnetwork1. The expression analysis based on GEPIA database showed CCNB1 gene was significantly up-regulated in LUAD (Figure 3C), and survival analysis revealed that low mRNA expression of CCNB1 resulted in a higher OS rate than high mRNA expression in LUAD patients (Figure 3D). CCNB1 is the Protein Coding gene of Cyclin B1, and play role in mitosis. Cyclin B1 is necessary for proper control of the G2/M transition phase of the cell cycle. Some studies have reported that the dysregulated expression of CCNB1 resulted in many diseases including cancer (39,40). Recently, several studies showed that overexpression of CCNB1 was correlated with poor survival in most solid tumors including LUAD (41,42), which suggested that the expression status of CCNB1 was able to serve as a significant prognostic predictor in solid tumors (42). Our result further validated the feasibility of mRNA CCNB1 as a prognosticator in LUAD.

Figure 3 DEmRNA analysis in ceRNA network (A) PPI network of DEmRNAs encoding proteins in ceRNA network was constructed on the basis of interaction information from online STRING database; (B) highly correlated module was identified in whole PPI network by MCODE algorithm; (C) CCNB1 was identified as key gene in PPI network, and was significantly highly expressed in LUAD tissue; (D) survival analysis showed that low mRNA expression of CCNB1 resulted in a higher overall survival rate in LUAD patients. LUAD, lung adenocarcinoma; PPI, protein and protein interaction; DEmRNAs, differentially expressed mRNAs; ceRNA, competitive endogenous RNA; MCODE, Molecular Complex Detection.

Gene function analysis of DEmRNAs

For the purpose of better understanding the roles of the DElncRNAs in ceRNA network in LUAD, gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analyses for 57 DEmRNAs in ceRNA network were performed using online STRING database, and GO terms and KEGG pathways regulated by DElncRNAs were identified. Biological process (BP), cellular component (CC) and molecular function (MF) of the most significant association to DEmRNAs were separately positive regulation of biological process [GO: 0048518, false discovery rate (FDR) =5.12E-05], chromatin (GO: 0000785, FDR =0.0137) and sequence-specific DNA binding (GO: 0043565, FDR =0.00101). The most significant enrichment KEGG pathway was cell cycle (has04110, FDR =0.000308). As we all know, cell cycle is one of the most important pathways belonging to cell growth and death, and lots of studies have demonstrated that cell cycle was dysregulated in various cancers (43,44).

OS analysis of LUAD-specific lncRNA signature

The relationships of OS of LUAD patients and DElncRNAs in ceRNA network were determined using KM and LR test. According to P<0.01 cut-off threshold, 9 DElncRNAs including LINC00518 (P=8.375E-06), UCA1 (P=4.709E-03), NAV2-AS2 (P=5.768E-03), CHODL-AS1 (P=6.035E-03), MED4-AS1 (P=6.565E-03), SYNPR-AS1 (P=9.765E-04), AC011483.1 (P=4.288E-03), POU6F2-AS1 (P=7.019E-03) and AP002478.1 (P=9.339E-03) were found to be related to OS of LUAD-patients (Figure S1).

Figure S1 The relationships of lncRNAs and overall survival of LUAD patients. lncRNAs, long non-coding RNAs; LUAD, lung adenocarcinoma.

Establishment of lncRNA signature prognostic model

We used univariate regression analysis to identify DElncRNAs associated with the OS of LUAD patients. With P<0.01 cut-off threshold, a group of 18 DElncRNAs including LINC00518 (P=5.11E-06), C20orf197 (P=8.68E-05), AP002478.1 (P=0.000365572), C11orf44 (P=0.000933011), LINC00460 (P=0.001760147), UCA1 (P=0.001836663), SYNPR-AS1 (P=0.002361183), LINC00319 (P=0.003555803), LINC00337 (P=0.00423919), AC004832.1 (P=0.004503163), LNX1-AS2 (P=0.004812281), NAV2-AS2 (P=0.004889676), C5orf64 (P=0.00513461), AC080129.1 (P=0.006289307), MED4-AS1 (P=0.006517782), AP000438.1 (P=0.008252988), AC011483.1 (P=0.008263272) and ABCA9-AS1 (P=0.009646077) was detected to have significantly prognostic value (http://fp.amegroups.cn/cms/tcr.2019.06.09-4.pdf). Further, we found that 7 DElncRNAs (LINC00518, UCA1, NAV2-AS2, MED4-AS1, SYNPR-AS1, AC011483.1, AP002478.1) were simultaneously identified to be related to OS in KM (LR test) and univariate Cox regression analysis. The 7 DElncRNAs were fitted into the multivariate Cox regression model, and the result showed that a group of 4 DElncRNAs including AP002478.1 (Cox P=4.66E-03), LINC00518 (Cox P=2.34E-04), MED4-AS1 (Cox P=6.42E-03) and NAV2-AS2 (Cox P=6.66E-02) had significantly prognostic value with OS of LUAD patients (Figure 4), and 4-lncRNA prognostic model was established. With the risk score of each patient (http://fp.amegroups.cn/cms/tcr.2019.06.09-5.pdf), the patients were divided into high- and low- risk groups (Figure 5A,B). Comparing to low risk group, the mortality rate of patients in high-risk group was significantly higher, and high-risk group was related with worse prognosis (Figure 5C). The ROC of 3-year survival correlation of the 4-lncRNA signature was analyzed, and area under the curve (AUC) was computed. The AUC of 4-lncRNA signature was 0.669, which indicated its effectiveness as a prognostic biomarker in evaluating OS of LUAD-patients (Figure 5D).

Figure 4 Kaplan-Meier survival cures for 4 lncRNAs associated with overall survival of lung adenocarcinoma patients. Log-rank test was used to evaluate the survival differences between low- and high- expression groups. Horizontal axis represented overall survival time (years) and vertical axis represented survival function. lncRNAs, long non-coding RNAs.
Figure 5 The 4-lncRNAs signature for the outcome. The 4 lncRNAs included AP002478.1, LINC00518, MED4-AS1 and NAV2-AS2. ((A) The LUAD patients were divided into high- and low- risk groups according to risk scores of LUAD patients; (B) the survival information between high- and low- risk groups was plotted using ggplot2 package; (C) log-rank test was used to determine the survival differences between the low- and high- risk groups (p=0); (D) ROC curve showed that AUC of 4-lncRNA prognostic model was 0.669. lncRNAs, long non-coding RNAs; LUAD, lung adenocarcinoma; ROC, receiver operating characteristic; AUC, area under the curve.

The expressions of the 4 lncRNAs were analyzed in LUAD tissues and normal lung tissues (Figure 6A), in high-risk and low-risk patient groups (Figure 6B), and in alive and dead groups (Figure 6C). AP002478.1 and LINC00518 were expressed at high level in LUAD patients with low risk scores. Whereas, MED4-AS1 and NAV2-AS2 were expressed at low level in LUAD patients with high risk scores. The expressions of 4 lncRNAs in alive and dead patients showed that AP002478.1 and LINC00518 were expressed at high level, and MED4-AS1 and NAV2-AS2 were expressed at low level in dead patients. The expressions [log2 (expression value +1)] of the 4 lncRNAs were showed in Figure 6D.

Figure 6 Expression pattern of the 4-lncRNA. The 4 lncRNAs included AP002478.1, LINC00518, MED4-AS1 and NAV2-AS2. (A) The expression pattern of the 4-lncRNA was in lung adenocarcinoma and normal tissues samples; (B) the expression pattern of 4-lncRNA was in high- and low- risk groups; (C) the expression pattern of 4-lncRNA was in alive and dead patients; (D) the expression level [log2 (expression value +1)] of 4-lncRNA was in experimental tissues samples. lncRNAs, long non-coding RNAs.

Correlation of four-lncRNA signature and clinical factor

The expression analysis of 4-lncRNA signature showed that AP002478.1 and MED-AS1 were highly expressed in male LUAD patients (P=0.002, 0.006, respectively; Figure 7A), and expression of AP002478.1 was significantly correlated to pathological stage and was increased with pathological stage (P=0.013; Figure 7B). Kaplan-Meier analysis showed that pathological stage was significantly correlated to OS of LUAD patients (LR P=0.009; Figure 7C), and the correlation of gender and OS had no statistical significance (LR P=0.215; Figure 7D). The results indicated that pathological stage was independent prognostic factor of OS in LUAD patients.

Figure 7 Correlation of 4-lncRNA signature and clinical factor. (A) The expression analysis of 4-lncRNA signature showed that AP002478.1 and MED-AS1 were highly expressed in male LUAD patients; (B) the expression analysis of 4-lncRNA signature showed that the expression of AP002478.1 was significantly correlated to pathological stage; (C) Kaplan-Meier curve showed that pathological stage was significantly correlated to OS of LUAD patients; (D) Kaplan-Meier curve showed that gender had no significant correlation to OS of LUAD patients. lncRNAs, long non-coding RNAs; OS, overall survival; LUAD, lung adenocarcinoma.

Discussion

LC is among the most common malignant cancer, and has jumped to the first cause of cancer-related malignant in recent years. LUAD is the major pathological subtype of LC (7), but the majority of the patients with LUAD remain short survival times, and the overall 5-year survival rate for all stages of LUAD less than 20% (4). The identifying specific diagnostic and prognostic biomarkers may contribute to improve the OS rate of LUAD patients. For this, it is necessary to further understand the regulatory mechanisms of LUAD initiation and progression, and to identify potentially prognostic signatures related to LUAD. Increasing evidence shows that lncRNAs acting as ceRNAs play key roles in cancer biology (10).

In recent years, several studies have explored the roles of RNAs in LUAD-specific ceRNA network (20,45,46). For example, Li et al. investigated the LUAD-related ceRNA network and identified some potential LUAD-related prognostic lncRNAs using univariate Cox regression (46), but did not consider multivariate Cox regression. Similarly, another study also analyzed the LUAD-related ceRNA network and validated some lncRNAs (45), but still did not take into account the prognostic signature using multivariate Cox regression. In this study, we integrated LUAD-related lncRNA, miRNA and mRNA expression data from TCGA database and constructed LUAD-specific lncRNA-miRNA-mRNA ceRNA network based on the ceRNA theory. Further, we investigated the association of lncRNAs in ceRNA network and OS of LUAD patients. KM (LR test) and univariate regression analysis showed that 7 lncRNAs in ceRNA network were simultaneously associated with OS of LUAD patients. Multivariate regression analysis showed that prognostic value of 4 lncRNAs (AP002478.1, LINC00518, MED4-AS1 and NAV2-AS2) in those 7 lncRNAs had statistical significance in the OS of LUAD patients. A cumulative risk score of the 4 lncRNAs was calculated, and 0.669 AUC indicated that the 4-lncRNA signature might predict and evaluate the OS of LUAD patients.

In the current study, lncRNAs AP002478.1, LINC00518, MED4-AS1 and NAV2-AS2 were identified to play important roles in the prognosis of LUAD patients. A recent study showed that the expression of AP002478.1 was associated with GC with Helicobacter pylori infection, and might serve as a prognostic biomarker of OS in GC patients with positive Helicobacter pylori (47). In this study, we noticed that the expression of AP002478.1 was significantly up-regulated in LUAD tissues, and up-regulated AP002478.1 could compete with down-regulated miRNAs (has-mir-195, has-mir-144, has-mir-184) to regulate the expression of the target genes such as CHEK1, KIF23, CCNE1 and HOXA10 involved in over-expressed ceRNA network (Figure 2A). Previous studies have reported that those miRNAs and their targets were associated with cancers (48-53). For example, Zhao et al. found that down-regulated has-mir-195 was associated with poor differentiation of NSCLC (50), and Tan et al. found down-regulated has-mir-195 was related to the onset, erosion and transfer of colorectal cancer (54). A lot of studies reported that has-mir-144 were aberrantly expressed in some cancers such as osteosarcoma (55), rectal cancer (56), and cervical cancer (57). In addition, several studies showed has-mir-184 was down-regulated in epithelial ovarian cancer and could act as a potentially diagnostic and prognostic marker (58). So far, no study has reported that LINC00518, MED4-AS1 and NAV2-AS2 were associated with cancers. The present study is the first to identify the aberrant expression of AP002478.1, LINC00518, MED4-AS1 and NAV2-AS2 in LUAD, and indicated the potential roles of the 4-lncRNA signature as prognostic biomarker in LUAD. This study will contribute to further understand the lncRNA-mediated ceRNA regulatory mechanisms in LUAD, and identify novel lncRNAs as potentially prognostic biomarkers and/or therapeutic targets.

Although the findings of this study provided novel potential important lncRNA biomarkers for clinic in predicting and evaluating OS of LUAD patients, the limitations were that the results were not verified by other experimental methods. Further, the biological roles of the 4 lncRNAs in LUAD need to be investigated.


Conclusions

In the current study, 4-lncRNA signature was identified as a potentially prognostic predictor of OS for LUAD patients based on ceRNA network. The current findings provide some novel insights into the regulatory mechanisms related to lncRNA-related ceRNA network in LUAD and identify 4 lncRNAs to serve as potentially prognostic biomarkers. Further, functional studies underlying lncRNAs function in LUAD-specific ceRNA network are required to elucidate.


Acknowledgments

Funding: This work was supported by National Natural Science Foundation of China (grant no 31660655), Yunnan Province Applied Basic Research Projects (grant nos 2016FB146 and 2018FE001) and Fund from Health and Family Planning Commission of Yunnan Province (grant no 2017NS261).


Footnote

Conflicts of Interest: The authors have no conflicts of interest to declare.

Ethical Statement: The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.


References

  1. Torre LA, Siegel RL, Jemal A. Lung Cancer Statistics. Adv Exp Med Biol 2016;893:1-19. [Crossref] [PubMed]
  2. Yao T, Sung HY, Mao Z, et al. The healthcare costs of secondhand smoke exposure in rural China. Tob Control 2015;24:e221-6. [Crossref] [PubMed]
  3. Blandin Knight S, Crosbie PA, Balata H, et al. Progress and prospects of early detection in lung cancer. Open Biol 2017. [Crossref] [PubMed]
  4. Wu K, House L, Liu W, et al. Personalized targeted therapy for lung cancer. Int J Mol Sci 2012;13:11471-96. [Crossref] [PubMed]
  5. Walters S, Maringe C, Coleman MP, et al. Lung cancer survival and stage at diagnosis in Australia, Canada, Denmark, Norway, Sweden and the UK: a population-based study, 2004-2007. Thorax 2013;68:551-64. [Crossref] [PubMed]
  6. Nesbitt JC, Putnam JB Jr, Walsh GL, et al. Survival in early-stage non-small cell lung cancer. Ann Thorac Surg 1995;60:466-72. [Crossref] [PubMed]
  7. Zagryazhskaya A, Gyuraszova K, Zhivotovsky B. Cell death in cancer therapy of lung adenocarcinoma. Int J Dev Biol 2015;59:119-29. [Crossref] [PubMed]
  8. Ohba T, Toyokawa G, Osoegawa A, et al. Mutations of the EGFR, K-ras, EML4-ALK, and BRAF genes in resected pathological stage I lung adenocarcinoma. Surg Today 2016;46:1091-8. [Crossref] [PubMed]
  9. ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 2012;489:57-74. [Crossref] [PubMed]
  10. Qiu MT, Hu JW, Yin R, et al. Long noncoding RNA: an emerging paradigm of cancer research. Tumour Biol 2013;34:613-20. [Crossref] [PubMed]
  11. Mercer TR, Dinger ME, Mattick JS. Long non-coding RNAs: insights into functions. Nat Rev Genet 2009;10:155-9. [Crossref] [PubMed]
  12. Huarte M, Rinn JL. Large non-coding RNAs: missing links in cancer? Hum Mol Genet 2010;19:R152-61. [Crossref] [PubMed]
  13. Hao F, Mou Y, Zhang L, et al. LncRNA AFAP1-AS1 is a prognostic biomarker and serves as oncogenic role in retinoblastoma. Biosci Rep 2018. [Crossref] [PubMed]
  14. Qin J, Ning H, Zhou Y, et al. LncRNA MIR31HG overexpression serves as poor prognostic biomarker and promotes cells proliferation in lung adenocarcinoma. Biomed Pharmacother 2018;99:363-8. [Crossref] [PubMed]
  15. Xiao Z, Qu Z, Chen Z, et al. LncRNA HOTAIR is a Prognostic Biomarker for the Proliferation and Chemoresistance of Colorectal Cancer via MiR-203a-3p-Mediated Wnt/ss-Catenin Signaling Pathway. Cell Physiol Biochem 2018;46:1275-85. [Crossref] [PubMed]
  16. Zeng Y, Wang T, Liu Y, et al. LncRNA PVT1 as an effective biomarker for cancer diagnosis and detection based on transcriptome data and meta-analysis. Oncotarget 2017;8:75455-66. [Crossref] [PubMed]
  17. Liu C, Ren L, Deng J, et al. LncRNA TP73-AS1 promoted the progression of lung adenocarcinoma via PI3K/AKT pathway. Biosci Rep 2019.39. [PubMed]
  18. Xiong DD, Li ZY, Liang L, et al. The LncRNA NEAT1 Accelerates Lung Adenocarcinoma Deterioration and Binds to Mir-193a-3p as a Competitive Endogenous RNA. Cell Physiol Biochem 2018;48:905-18. [Crossref] [PubMed]
  19. Ye JJ, Cheng YL, Deng JJ, et al. LncRNA LINC00460 promotes tumor growth of human lung adenocarcinoma by targeting miR-302c-5p/FOXA1 axis. Gene 2019;685:76-84. [Crossref] [PubMed]
  20. Li L, Feng T, Qu J, et al. LncRNA Expression Signature in Prediction of the Prognosis of Lung Adenocarcinoma. Genet Test Mol Biomarkers 2018;22:20-8. [Crossref] [PubMed]
  21. Li X, Li B, Ran P, et al. Identification of ceRNA network based on a RNA-seq shows prognostic lncRNA biomarkers in human lung adenocarcinoma. Oncol Lett 2018;16:5697-708. [PubMed]
  22. Salmena L, Poliseno L, Tay Y, et al. A ceRNA hypothesis: the Rosetta Stone of a hidden RNA language? Cell 2011;146:353-8. [Crossref] [PubMed]
  23. Conte F, Fiscon G, Chiara M, et al. Role of the long non-coding RNA PVT1 in the dysregulation of the ceRNA-ceRNA network in human breast cancer. PLoS One 2017;12:e0171661. [Crossref] [PubMed]
  24. Liu H, Zhang Z, Wu N, et al. Integrative Analysis of Dysregulated lncRNA-Associated ceRNA Network Reveals Functional lncRNAs in Gastric Cancer. Genes (Basel) 2018;9:303. [Crossref] [PubMed]
  25. Liang H, Yu T, Han Y, et al. LncRNA PTAR promotes EMT and invasion-metastasis in serous ovarian cancer by competitively binding miR-101-3p to regulate ZEB1 expression. Mol Cancer 2018;17:119. [Crossref] [PubMed]
  26. Fan CN, Ma L, Liu N. Systematic analysis of lncRNA-miRNA-mRNA competing endogenous RNA network identifies four-lncRNA signature as a prognostic biomarker for breast cancer. J Transl Med 2018;16:264. [Crossref] [PubMed]
  27. Zhang Z, Wang S, Ji D, et al. Construction of a ceRNA network reveals potential lncRNA biomarkers in rectal adenocarcinoma. Oncol Rep 2018;39:2101-13. [PubMed]
  28. Karreth FA, Pandolfi PP. ceRNA cross-talk in cancer: when ce-bling rivalries go awry. Cancer Discov 2013;3:1113-21. [Crossref] [PubMed]
  29. Jin X, Guan Y, Sheng H, et al. Crosstalk in competing endogenous RNA network reveals the complex molecular mechanism underlying lung cancer. Oncotarget 2017;8:91270-80. [Crossref] [PubMed]
  30. Wei Y, Chang Z, Wu C, et al. Identification of potential cancer-related pseudogenes in lung adenocarcinoma based on ceRNA hypothesis. Oncotarget 2017;8:59036-47. [Crossref] [PubMed]
  31. Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 2010;26:139-40. [Crossref] [PubMed]
  32. Guo LL, Song CH, Wang P, et al. Competing endogenous RNA networks and gastric cancer. World J Gastroenterol 2015;21:11680-7. [Crossref] [PubMed]
  33. Shannon P, Markiel A, Ozier O, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 2003;13:2498-504. [Crossref] [PubMed]
  34. Szklarczyk D, Franceschini A, Wyder S, et al. STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res 2015;43:D447-52. [Crossref] [PubMed]
  35. Bader GD, Hogue CW. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinformatics 2003;4:2. [Crossref] [PubMed]
  36. Tang Y, Li M, Wang J, et al. CytoNCA: a cytoscape plugin for centrality analysis and evaluation of protein interaction networks. Biosystems 2015;127:67-72. [Crossref] [PubMed]
  37. Tang Z, Li C, Kang B, et al. GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses. Nucleic Acids Res 2017;45:W98-W102. [Crossref] [PubMed]
  38. Zeng JH, Liang L, He RQ, et al. Comprehensive investigation of a novel differentially expressed lncRNA expression profile signature to assess the survival of patients with colorectal adenocarcinoma. Oncotarget 2017;8:16811-28. [PubMed]
  39. Chai N, Xie HH, Yin JP, et al. FOXM1 promotes proliferation in human hepatocellular carcinoma cells by transcriptional activation of CCNB1. Biochem Biophys Res Commun 2018;500:924-9. [Crossref] [PubMed]
  40. Liu D, Xu W, Ding X, et al. Polymorphisms of CCNB1 Associated With the Clinical Outcomes of Platinum-Based Chemotherapy in Chinese NSCLC Patients. J Cancer 2017;8:3785-94. [Crossref] [PubMed]
  41. Ding K, Li W, Zou Z, et al. CCNB1 is a prognostic biomarker for ER+ breast cancer. Med Hypotheses 2014;83:359-64. [Crossref] [PubMed]
  42. Ye C, Wang J, Wu P, et al. Prognostic role of cyclin B1 in solid tumors: a meta-analysis. Oncotarget 2017;8:2224-32. [PubMed]
  43. Fan Y, Zhu X, Xu Y, et al. Cell-Cycle and DNA-Damage Response Pathway Is Involved in Leptomeningeal Metastasis of Non-Small Cell Lung Cancer. Clin Cancer Res 2018;24:209-16. [Crossref] [PubMed]
  44. Sherr CJ. Cell cycle control and cancer. Harvey Lect 2000-2001;96:73-92. [PubMed]
  45. Sui J, Li YH, Zhang YQ, et al. Integrated analysis of long non-coding RNAassociated ceRNA network reveals potential lncRNA biomarkers in human lung adenocarcinoma. Int J Oncol 2016;49:2023-36. [Crossref] [PubMed]
  46. Li DS, Ainiwaer JL, Sheyhiding I, et al. Identification of key long non-coding RNAs as competing endogenous RNAs for miRNA-mRNA in lung adenocarcinoma. Eur Rev Med Pharmacol Sci 2016;20:2285-95. [PubMed]
  47. Liu Y, Zhu J, Ma X, et al. ceRNA network construction and comparison of gastric cancer with or without Helicobacter pylori infection. J Cell Physiol 2019;234:7128-40. [Crossref] [PubMed]
  48. Wang YB, Zhao XH, Li G, et al. MicroRNA-184 inhibits proliferation and promotes apoptosis of human colon cancer SW480 and HCT116 cells by downregulating C-MYC and BCL-2. J Cell Biochem 2018;119:1702-15. [Crossref] [PubMed]
  49. Yao Q, Gu A, Wang Z, et al. MicroRNA-144 functions as a tumor suppressor in gastric cancer by targeting cyclooxygenase-2. Exp Ther Med 2018;15:3088-95. [PubMed]
  50. Zhao Q, Cao J, Wu YC, et al. Circulating miRNAs is a potential marker for gefitinib sensitivity and correlation with EGFR mutational status in human lung cancers. Am J Cancer Res 2015;5:1692-705. [PubMed]
  51. Kato T, Wada H, Patel P, et al. Overexpression of KIF23 predicts clinical outcome in primary lung cancer patients. Lung Cancer 2016;92:53-61. [Crossref] [PubMed]
  52. Noack S, Raab M, Matthess Y, et al. Synthetic lethality in CCNE1-amplified high grade serous ovarian cancer through combined inhibition of Polo-like kinase 1 and microtubule dynamics. Oncotarget 2018;9:25842-59. [Crossref] [PubMed]
  53. Yuan Y, Sun S, Jiao N, et al. Upregulation of HOXA10 Protein Expression Predicts Poor Prognosis for Colorectal Cancer. Genet Test Mol Biomarkers 2018;22:390-7. [Crossref] [PubMed]
  54. Tan YG, Zhang YF, Guo CJ, et al. Screening of differentially expressed microRNA in ulcerative colitis related colorectal cancer. Asian Pac J Trop Med 2013;6:972-6. [Crossref] [PubMed]
  55. Green D, Mohorianu I, Piec I, et al. MicroRNA expression in a phosphaturic mesenchymal tumour. Bone Rep 2017;7:63-9. [Crossref] [PubMed]
  56. Cai SD, Chen JS, Xi ZW, et al. MicroRNA144 inhibits migration and proliferation in rectal cancer by downregulating ROCK1. Mol Med Rep 2015;12:7396-402. [Crossref] [PubMed]
  57. Tao P, Wen H, Yang B, et al. miR-144 inhibits growth and metastasis of cervical cancer cells by targeting VEGFA and VEGFC. Exp Ther Med 2018;15:562-8. [PubMed]
  58. Qin CZ, Lou XY, Lv QL, et al. MicroRNA-184 acts as a potential diagnostic and prognostic marker in epithelial ovarian cancer and regulates cell proliferation, apoptosis and inflammation. Pharmazie 2015;70:668-73. [PubMed]
Cite this article as: Hu J, Wang T, Chen Q. Competitive endogenous RNA network identifies four long non-coding RNA signature as a candidate prognostic biomarker for lung adenocarcinoma. Transl Cancer Res 2019;8(4):1046-1064. doi: 10.21037/tcr.2019.06.09