Triple-negative breast cancer (TNBC) is a subtype of breast cancer (BC) with the most aggressive phenotype and poor overall survival. Using bioinformatics tools, we identified LINC00908 encoding a 60–aa polypeptide and differentially expressed in TNBC tissues. We named this endogenously expressed polypeptide ASRPS (a small regulatory peptide of STAT3). ASRPS expression was down-regulated in TNBCs and associated with poor overall survival. We showed that LINC00908 was directly regulated by ERα, which was responsible for the differential down-regulation of LINC00908 in TNBCs. ASRPS directly bound to STAT3 through the coiled coil domain (CCD) and down-regulated STAT3 phosphorylation, which led to reduced expression of VEGF. In human endothelial cells, a mouse xenograft breast cancer model, and a mouse spontaneous BC model, ASRPS expression reduced angiogenesis. In a mouse xenograft breast cancer model, down-regulation of ASRPS promoted tumor growth, and ASRPS acted as an antitumor peptide. We presented strong evidence that LINC00908-encoded polypeptide ASRPS represented a TNBC-specific target for treatment.
Triple-negative breast cancer (TNBC) does not express the three most common breast cancer (BC) markers, estrogen receptor (ER), progesterone receptor, and HER2/neu oncogene (HER2), and belongs to the basal-like BC type by gene expression profile (Elias, 2010). In the United States, TNBC accounts for 10–15% of all BCs, and is more common in women of African descent and in women <50 yr old (Brewster et al., 2014). The incidence of BC in China has been increasing at a rate of 3.5% per year since 2000. It was estimated that there were 268,600 new BC cases and 69,500 deaths from BC in China in 2015 (Chen et al., 2016). With current annual rate of increase, it is projected that by 2021, there would be 2.5 million cases of BC in China. About 20% of BCs in China are TNBC subtype (Peng et al., 2016). TNBC is the most aggressive subtype of BC and is associated with worst prognosis and overall survival (OS; Son et al., 2019). It is characterized by high cell proliferation, poor cellular differentiation, and more frequent disease recurrence (Chang et al., 2018). Compared with non-TNBC, TNBC patients have poor prognosis, including higher nuclear grade, increased incidence of lung and brain metastases, and shorter recurrence-free interval. Further, TNBC patients do not benefit from endocrine therapy or HER2-targeted therapy (Foulkes et al., 2010), and TNBC patients demonstrate enhanced angiogenesis when compared with non-TNBC patients (Ribatti et al., 2016). Despite the fact that significant improvement has been made for BC treatment, treatment options for TNBC are limited, including surgery, radiotherapy, and chemotherapy. No targeted therapies are available for TNBC. Elucidating the tumorigenesis in TNBC is key to developing TNBC-specific therapies and improving patients’ OS.
Long noncoding RNAs (lncRNAs) are nonprotein coding transcripts longer than 200 nucleotides. Recent studies have shown that lncRNAs participate in cancer initiation and progression (Kong et al., 2018). Aberrant expression of lncRNAs has been observed in various types of cancer, including BC. LncRNAs can act as either tumor-suppressor genes or oncogenes (Li et al., 2019; Wu et al., 2015), and regulate cell proliferation and invasion (Li et al., 2017). Mechanistically, lncRNAs serve as transcription regulators through different modes. They can act as signal RNAs to regulate transcription in response to various stimuli; they can act as decoy RNAs to limit the availability of regulatory factors; they can act as scaffold RNAs to provide platforms for assembling protein complexes for transcription; they can act as guide RNAs for the proper localization of ribonucleoproteins; and they can act as enhancer RNAs for chromatin interactions (Fang and Fullwood, 2016). More recently, it has also been shown that some noncoding RNAs function through their encoding polypeptides (Ingolia et al., 2011). Notably, an 87-aa tumor suppressive peptide encoded by the circular form of long intergenic nonprotein-coding RNA could directly interact with polymerase associated factor complex and inhibit the transcriptional elongation in glioblastoma (Zhang et al., 2018); Circ-AKT3 encodes a 174-aa novel protein that is a negative regulator of the receptor tyrosine kinase/PI3K pathway (Xia et al., 2019). A polypeptide named SPAR (small regulatory polypeptide of amino acid response), encoded by LINC00961, can regulate mTORC1 and muscle regeneration (Matsumoto et al., 2017); lncRNA HOXB-AS3 encodes a peptide suppressing colon cancer growth (Huang et al., 2017); myoregulin, encoded by a putative lncRNA, regulates muscle performance (Anderson et al., 2015); a recent study identified CASIMO1, a novel small protein encoding by a small open reading frame (ORF) from lncRNA that interacted with squalene epoxidase and was able to influence lipid droplet formation (Polycarpou-Schwarz et al., 2018).
Recent studies have identified a series of dysregulated lncRNAs in TNBC when compared with adjacent normal tissues (Kong et al., 2018; Rodríguez Bautista et al., 2018). These differentially expressed lncRNAs can be used to classify TNBCs into different subgroups and facilitate the development of targeted therapies (Zhang et al., 2019); they can also be used as prognostic markers for survival and potential therapeutic targets. However, little is known about how these lncRNAs are regulated, and no reports have identified lncRNAs with coding potentials.
In the present study, we identified TNBC-specific lncRNAs from The Cancer Genome Atlas (TCGA) database and predicted the lncRNAs with coding potential using the GWIPS-viz dataset. Subsequently, we showed that one of these TNBC-specific lncRNAs, LINC00908, is regulated by ERα, and it contains small ORFs encoding a 60-aa functional polypeptide, a small regulatory peptide of STAT3 (ASRPS), that is involved in TNBC progression. Finally, we determined the signaling pathways regulating the expression of ASRPS as well as downstream STAT3/vascular endothelial growth factor (VEGF) pathways regulated by ASRPS in TNBC.
Identification of TNBC-specific LINC00908
To identify TNBC-specific lncRNAs, we first searched for lncRNAs with coding ability, using the following criteria: (1) encoding ORFs by ORFFinder; (2) the ORFs overlapping with the peaks of ribosome profiling data from the GWIPS-viz database. A totally of 583 lncRNAs were identified. Then we looked for lncRNAs that were differentially expressed in TNBC tissues when compared with normal tissues (set A), but not differentially expressed in non-TNBC tissues when compared with normal tissues (set B). A total of 26 candidate lncRNAs were identified (Fig. 1 A). Next, we measured the expression of these 26 lncRNAs by quantitative RT-PCR (qRT-PCR) in 53 pairs of TNBC and adjacent normal tissue samples, and 18 lncRNAs showed differential expression (Fig. 1 B). Of these 18 lncRNAs, only 10 of them did not show differential expression in 53 pairs of non-TNBC and normal tissue samples (Fig. 1 C). Finally, these 10 lncRNAs were analyzed by polysome profiling and qPCR in MDA-MB-231 and Hs578T cell lines, and only four of them were detected (Fig. 1 D). Using ORFFinder, all putative ORFs longer than 30 aa’s in the sense (+) orientation with an initiating ATG condon among these four lncRNAs were predicted (Fig. 1 E and Fig. S1, A–C), and constructed in pcDNA3.1 tagged with FLAG at the C terminal. The plasmids were transfected into TNBC cell line MDA-MB-231, and Western blot analysis indicated that only LINC00908-ORF3 translated into a polypeptide (Fig. 1 F and Fig. S1, A–C).
LINC00908 was annotated as an lncRNA in the human genome (NR_015417.1). We analyzed the expression of LINC00908 in the TCGA database and Cancer Cell Line Encyclopedia project and found that LINC00908 expression was lower in TNBC tissues or cells compared with non-TNBC tissues or cells (Fig. S1, D and E). LINC00908 transcript was detected in two pairs of human TNBC tissue samples by Northern blot analysis (Fig. S1 F). Both confocal microscopy analysis of fluorescent in situ hybridization (FISH) and nuclear/cytoplasm fractionation experiment showed that LINC00908 is a cytoplasmic RNA (Fig. S1, G and H).
LINC00908 encoded a polypeptide
To confirm that LINC00908 encoded a polypeptide, we inserted a FLAG-tag at the C terminal of the ORF (FLAG-KI), and the polypeptide was detected by immunofluorescence staining using anti-FLAG antibody (Fig. 1 G).
The ORF of LINC00908 is located on chromosome 18, covering three exons of LINC00908, and encodes a 60-aa peptide (6.62KD). The sequence information of the peptide and the region of LINC00908 encoding peptide are shown in Fig. S1 I. This was consistent with ribosome occupancy data from the GWIPS-viz database (Fig. S1 J). We named this polypeptide ASRPS.
To determine whether the in-frame ATG codon of the ASRPS was functional, we generated a plasmid (ASRPS-GFPmut) by fusing GFPmut ORF (in which the start codon ATGGTG has mutated to ATTGTT) to the C terminus of the ASRPS and transfected it into the MDA-MB-231 cell line (Fig. 1 H). 24 h after transfection, the expression of the GFP fusion protein was observed in MDA-MB-231 cells, suggesting that the ASRPS ATG codon was functional (Fig. 1 I). GFP fusion protein was also detected by Western blot analysis using anti-GFP antibody (Fig. 1 J). On the contrary, GFP fusion protein was not detected when the ATG codon in ASRPS was mutated (ASRPSmut-GFPmut). To rule out effect of the relative large size of GFP on the fusion protein, we also confirmed the functionality of ASRPS start codon by generating FLAG-tag fusion protein in MDA-MB-231 cells. The FLAG fusion protein was detected by both Western blot and immunofluorescence staining using anti-FLAG antibody (Fig. 1, K–M).
ASRPS was expressed endogenously and down-regulated in TNBC
To determine whether ASRPS was endogenously expressed, we performed polysome profiling in MDA-MB-231 cells. The mRNA-protein particles (mRNPs) were separated into three fractions: nonribosome (mRNPs without any ribosome), 40S-80S (mRNPs associated with ribosome but not being translated), and polysome (mRNPs being actively translated). The quantification of ASRPS RNA was determined via qRT-PCR in each fraction (Fig. S1 K). We showed that ASRPS RNA was enriched in the polysome fraction in the MDA-MB-231 cell. Using a Cy3-labeled in situ hybridization probe for LINC00908 and a FITC-labeled anti-ASRPS antibody, we detected both LINC00908 and ASRPS in the same tumor (Fig. S1 L) or adjacent normal tissues (Fig. S1 M), and both were highly expressed in the adjacent normal tissue. Furthermore, we generated rabbit polyclonal antibody against ASRPS, and both ASRPS-GFP and ASRPS-FLAG fusion proteins were detected (Fig. 1, J, L, and N). When LINC00908 was silenced by shRNA in MDA-MB-231 and Hs578T cells (Fig. S2 A), the amount of ASRPS detected by ASRPS antibody was significantly lower (Fig. S2, B and C), which further confirmed the specificity of ASRPS antibody.
The presence of ASRPS protein was detected in MCF7 and MDA-MB-231 cell lines by immunofluorescence assay (Fig. 2 A). ASRPS protein was also detected in TNBC and non-TNBC cell lines by Western blot analysis, and ASRPS expression was significantly lower in TNBC than in non-TNBC cell lines (Fig. 2 B). To confirm that ASRPS was translated from LINC00908 but not processed from other longer peptides, we showed that ASRPS protein was not detected when anti-LINC00908-ORF translation-blocking antisense oligo was transfected (Fig. 2 C).
Down-regulation of ASRPS promoted tumor growth in TNBC
To determine whether ASRPS had a function in TNBC progression, we knocked out ASRPS in a mouse xenograft TNBC model. The tumor volume was significantly higher when ASRPS was knocked out in TNBC cells (Fig. 2 D). To determine whether LINC00908 had a function in TNBC progression besides ASRPS, we knocked down LINC000908 alone or in combination with ASRPS. We showed that LINC00908 knockdown alone significantly promoted tumor growth, but did not further augment tumor growth in ASRPS KO TNBC cells (Fig. 2, E and F). Finally, we showed that reintroducing either full-length LINC00908 or ASRPS into ASRPS-KO TNBC cells could reverse tumor growth (Fig. 2 G). These results suggested that LINC00908 inhibited TNBC tumor growth mainly through ASRPS.
Low expression of ASRPS was associated with poor survival in TNBC patients
We next determined the expression of ASRPS in matched fresh primary BC and nontumorous tissues by Western blot analysis. ASRPS was down-regulated in TNBC tissues compared with matched nontumorous tissues; however, such down-regulation was not observed in non-TNBC tissues (Fig. 2 H).
To determine whether down-regulation of ASRPS expression was associated with OS among TNBC patients, we first stratified TNBC patients into two different groups: patients with high ASRPS expression (relative expression level greater than median expression level); and patients with low ASRPS expression (relative expression level less than or equal to median expression level). Using the log-rank test and Kaplan–Meier survival curves, we showed that patients with low ASRPS expression had significantly lower OS than patients with high ASRPS expression in both the discovery set (Suzhou cohort, 112 patients; log rank P = 0.018, hazard rate = 2.579) and the validation set (Guangzhou cohort, 105 patients; log rank P = 0.025, hazard rate = 2.465; Fig. 2, I and J).
ERα directly regulated LINC00908 transcription
To investigate the underlying mechanism of differential expression of LINC00908 in TNBC, we constructed luciferase reporter (pGL3-LINC00908) using the promoter region of LINC00908 (C1–C4; Fig. 3 A). Because ERα was down-regulated in TNBC cells compared with non-TNBC cells, we tested the hypothesis that ERα could directly regulate LINC00908 transcription. We showed that LINC00908 was significantly reduced in ER− BC tissues when compared with ER+ BC tissues in TCGA database (Fig. 3 B). Further, we showed that ASRPS polypeptide was significantly down-regulated in ERα− BC tumor tissues (Fig. 3 C). Next, we knocked down ERα in non-TNBC cell lines and overexpressed ERα in TNBC cell lines (Fig. S3, A and C). Luciferase assay indicated that ERα knockdown in non-TNBC cell lines significantly reduced transcription of LINC00908, whereas ERα overexpression (OE) increased transcription of LINC00908 in TNBC cell lines (Fig. 3, D and E). We also detected the expression level of ASRPS by Western blot (Fig. S3, B and D). This was consistent with the positive correlation between LINC00908 and ERα expression in BC tissues (Fig. 3, F and G).
Next, we performed the serial deletion experiment to narrow down the promoter region responsible for ERα-regulated LINC00908 transcription. Luciferase activity was significantly higher in MCF7 cells than in MDA-MB-231 cells as long as the C2 region of the promoter was present (Fig. 3 H). Three binding elements of ERα were identified by PROMO in the C2 promoter region of LINC00908 (Fig. 3 I). To determine which binding element was required for ERα-mediated LINC00908 expression, the three predicted ERα binding elements were individually deleted. The deletion experiment indicated that the regulation of ERα is lost in MCF7 when all three elements were deleted together (Fig. 3 J). To further confirm that C2 region was necessary for ERα-mediated LINC00908 expression, we performed chromatin immunoprecipitation (ChIP) using anti-ERα antibody followed by analysis of the C1–C4 region by qPCR in TNBC and non-TNBC cell lines. We showed that ERα was bound to the −1,500 to −1,000 region of LINC00908 promoter in the MCF7 cell line (Fig. 3 K). Luciferase analysis showed that deletion of the three ERα binding elements had no effect on LINC00908 expression in TNBC cell lines, but significantly reduced LINC00908 expression in non-TNBC cell lines (Fig. 3, L and M). This was consistent with the observation that the expression of LINC00908 was significantly higher in non-TNBC cell lines than in TNBC cell lines (Fig. 3 N). Moreover, we showed that 17 β-estradiol (E2) treatment promoted ASRPS in MCF7 cells, but not in MDA-MB-231 cells (Fig. S3 E).
ASRPS interacted with STAT3
Next, we performed an immunoprecipitation experiment in order to identify ASRPS-interacting proteins. We first transfected the ASRPS-FLAG construct into MDA-MB-231 and Hs578T cell lines. 48 h after transfection, the cell lysates were used to perform protein immunoprecipitation using the anti-FLAG antibody. Subsequently, mass spectrometry was used to identify coprecipitated proteins. Potential contaminated proteins were filtered out using the CRAPome database. We identified three candidate proteins that might bind to ASRPS, and we showed that only STAT3 was precipitated by anti-FLAG antibody (Fig. 4, A and B). We showed that the presence of RNase A did not abolish the interaction between ASRPS and STAT3, suggesting that the interaction between ASRPS and STAT3 was not mediated by RNA (Fig. 4 C). We next cotransfected ASRPS-FLAG and STAT3-HA into 293T, MCF7, and T47D cell lines. Both coimmunoprecipitation (coIP) and Western blot analysis showed that ASRPS directly interacted with STAT3 (Fig. 4 D). We further showed that endogenous ASRPS and STAT3 were present in the same protein complex (Fig. 4 E). We used mass spectrometry from coIP with STAT3 antibody in MCF7 to verify the presence of ASRPS (Fig. S3 F). We next determined which conserved domain of STAT3 was responsible for the interaction with ASRPS (Fig. 4 F). Serial deletion analysis of STAT3 indicated that the coiled-coil domain (CCD) of STAT3 was essential for the interaction with ASRPS (Fig. 4, G and H).
ASRPS inhibited STAT3 phosphorylation
Because phosphorylation at tyrosine 705 was essential for STAT3 activation (Yan et al., 2015), we determined whether ASRPS affected STAT3 phosphorylation. Because ASRPS expression was lower in TNBC cells than in non-TNBC cells, we overexpressed ASRPS in the TNBC cell line (MDA-MB-231) and knocked out ASRPS in the non-TNBC cell line (MCF7; Fig. S3 G). We showed that knocking out ASRPS in MCF7 cells increased IL-6–induced STAT3 phosphorylation. In MDA-MB-231 cells, STAT3 was phosphorylated even in the absence of IL-6 due to low expression of ASRPS, and ASRPS OE in MDA-MB-231 cells reduced STAT3 phosphorylation in the absence of IL-6 (Fig. 4 I). To investigate how ASRPS influenced STAT3 phosphorylation, we determined whether ASRPS directly interfered the interaction of JAK2 (the canonical kinase of STAT3) and STAT3 (Banerjee and Resat, 2016). coIP analysis indicated that both ASRPS and JAK2 coprecipitated with STAT3, and the amount of JAK2 associated with STAT3 was inversely correlated with the amount of ASRPS present in the complex (Fig. 4, J and K). To determine whether ASRPS also affected STAT3 transcriptional activity, we constructed a luciferase reporter using the STAT3-specific binding acute phase response element (APRE). We showed that IL-6–mediated STAT3 transcriptional activity was enhanced by down-regulation of ASRPS in MCF7 cells (Fig. 4 L). In MDA-MB-231 cells, both IL-6–dependent and IL-6–independent STAT3 transcriptional activities were reduced by OE of ASRPS (Fig. 4 M). Immunohistochemistry analysis showed that ASRPS expression level was negatively correlated with p-STAT3 in TNBC tissue samples (Fig. 4 N). Our data indicated that ASRPS inhibited STAT3 phosphorylation as well as its transcriptional activity.
Generation of ASRPS KO BC cell lines using CRISPR/Cas9 system
To further explore the role of ASRPS polypeptide in TNBC progression, we deleted ASRPS in two TNBC cell lines (Hs578T and MDA-MB-231) using the CRISPR/Cas9-mediated homologous recombination technique (Fig. S3, G–J). Two independent KO cell lines (Hs578TKO and MDA-MB-231KO) were generated. Sequence analysis revealed that both cell lines contained mutations at the ATG codon that prevented the translation of ASRPS (Fig. S3 G). We showed that in ASRPS KO cell lines, translation of ASRPS was decreased by polysome profiling and Western blot analysis, but transcription of LINC00908 was not affected by qRT-PCR analysis (Fig. S3, H–J).
ASRPS regulated the STAT3/VEGF signaling pathway
Previous reports indicated that the VEGF promoter contained the STAT3 binding site and STAT3 activation directly regulated VEGF expression (Niu et al., 2002; Wei et al., 2003). We determined whether ASRPS could affect VEGF expression through regulating STAT3 activation. We detected the canonical STAT3-binding motif in the JASPAR Database (Fig. 5 A). We used ChIP assay to measure STAT3 binding to the VEGF promoter and the VEGF promoter luciferase assay to measure its expression in TNBC cell lines (Fig. 5, B and C). We showed that ASRPS OE led to reduced binding of STAT3 to the VEGF promoter and reduced VEGF promoter activity, while ASRPS KO enhanced binding of STAT3 to the VEGF promoter and increased VEGF promoter activity. In vivo, we showed that ASRPS expression level in TNBC cell lines negatively correlated with VEGF expression (Fig. 5 D), and secreted VEGF level in the culture medium measured by ELISA was also negatively correlated with ASRPS expression level in TNBC cell lines (Fig. 5 E). However, when cells were treated with STAT3 inhibitor Stattic, VEGF expression was reduced regardless of ASRPS expression level, which correlated with a reduced level of phosphorylated STAT3 (Fig. 5, F and G). These results suggested that ASRPS regulated VEGF expression via STAT3 phosphorylation.
ASRPS repressed tumor angiogenesis
Because VEGF is a key player in angiogenesis and it also regulates proliferation and migration of vascular endothelial cells and tumor cells, we determined the effect of ASRPS expression on angiogenesis and migration of vascular endothelial cells. HUVECs were incubated with conditional medium collected from TNBC cells expressing variable amount of ASRPS, and the capillary tube formation, migration, and wound healing of the HUVECs were measured. ASRPS OE in TNBC significantly reduced the amount of capillary tubes formation by HUVECs, as well as migration and wound-healing ability. On the contrary, ASRPS down-regulation in TNBC significantly increased the tube formation, migration, and wound-healing ability (Fig. 6, A–C; and Fig. S4, A–C). Using an in vitro three-dimensional spheroid-based angiogenesis assay, ASRPS OE in TNBC reduced sprout generation in HUVECs, while ASRPS down-regulation in TNBC significantly induced sprout growth (Fig. 6 D and Fig. S4D). The F-actin staining indicated that the cytoskeleton arranged regularly in ASRPS knock-down cells while the cytoskeleton was in a nest shape and arranged irregularly in ASRPS OE cells, so we counted the percentage of the cells whose cytoskeleton were neatly arranged across the cell’s length (Fig. 6 E and Fig. S4 E). Electron microscopy analysis indicated that HUVEC junctions were tighter when incubated with medium collected from MDA-MB-231–overexpressing ASRPS (Fig. 6 F). In vivo Matrigel plug angiogenesis assay indicated that ASRPS expression level in TNBC cells inversely correlated with blood vessel formation (Fig. 6 G). Using a mouse xenograft model, we showed that ASRPS expression level negatively correlated with vascular endothelial cell marker CD31 expression and generation of microvessels (Fig. 6 H and Fig. S4 F). Similarly, ASRPS expression level in human TNBC tissues negatively correlated with CD31 expression level (Fig. 6 I).
We also determined the effect of LINC00908 on angiogenesis. Using a pull-down assay, we showed that LINC00908 did not directly interact with ASRPS (Fig. S5 A). When we overexpressed LINC00908-mut (in which the start codon ATG was mutated to ATT so no ASRPS was translated) in WT MDA-MB-231 and ASRPS OE MDA-MB-231 cells (Fig. S5 B), LINC00908-mut did not improve the level of phosphorylated STAT3 (Fig. S5 C). When these cells were injected subcutaneously into the hind flanks of the mice, there was no significant difference in angiogenesis in the tumor. Finally, LINC00908-mut OE did not reverse the effect of ASRPS OE (Fig. S5, D and E).
STAT3C expression fully reversed the effect of ASRPS OE
Since STAT3C has been reported to spontaneously dimerize through disulfide bonds (Bromberg et al., 1999), we determined whether overexpressed STAT3C in TNBC cells could reverse tumor angiogenesis suppressed by ASRPS OE. STAT3C indeed inversely promotes the expression of VEGF suppressed by ASRPS (Fig. 7, A–C). We showed that OE STAT3C could increase tube formation, migration, and wound-healing ability (Fig. 7, D–F; and Fig. S4, G–I). STAT3C expression also promoted sprout generation (Fig. 7 G and Fig. S4 J) and cytoskeleton organization (Fig. 7 H and Fig. S4 K) in HUVECs. STAT3C expression also promoted angiogenesis in the Matrigel plug angiogenesis assay (Fig. 7 I). In the mouse xenograft model, the CD31 immunofluorescence staining of tumor indicated that STAT3C expression increased generation of microvessels (Fig. 7 J and Fig. S4 L). Based on the above results, we concluded that STAT3C OE fully reversed the effect of ASRPS OE.
ASRPS inhibited BC angiogenesis in MMTV-PyMT mouse mammary tumor model
To further confirm the role of ASRPS in BC angiogenesis, we generated a mouse model by introducing ASRPS into MMTV-PyMT mice to generate MMTV-PyMT; ASRPS+/+ mice. Immunostaining with anti-CD31 indicated that the recruitment of blood vessels was significantly decreased in MMTV-PyMT; ASRPS+/+ mouse tumor (Fig. 8 A). We also showed that direct injection of exogenous ASRPS into the mammary pad of MMTV-PyMT mice significantly reduced angiogenesis in the primary tumor (Fig. 8 B). We concluded that ASRPS inhibited tumor angiogenesis in the MMTV-PyMT mouse mammary tumor model.
ASRPS is a potential antitumor peptide
Because ASRPS could inhibit angiogenesis in mouse mammary tumor models, we determined whether ASRPS had antitumor activity. In the TNBC mouse xenograft model, intratumoral injection of ASRPS significantly improved survival (Fig. 8 C). Furthermore, the immunostaining with anti-CD31 of mouse xenograft tumor showed that ASRPS significantly reduced the angiogenesis (Fig. 8 D). These results indicated that ASRPS could improve OS by effectively suppressing tumor angiogenesis.
Using the GWIPS-viz database, we identified 26 lncRNAs that were differentially expressed in TNBCs when compared with adjacent normal tissues, as well as when compared with non-TNBC tissues. TNBC-specific expression of LINC00908 was regulated by ERα. Both in silico and in vitro analyses indicated that LINC00908 encoded a polypeptide (ASRPS). ASRPS was down-regulated in TNBC, associated with increased tumor growth and poor OS. We showed that ASRPS was directly bound to STAT3, inhibited phosphorylation of STAT3, down-regulated VEGF expression, and therefore reduced tumor angiogenesis in both BC cell lines and mouse BC models. Finally, we showed that ASRPS is a potent antitumor peptide.
Several studies analyzed lncRNA expression profiles in TNBC. Yang et al. (2018) analyzed lncRNA expression profiles between 111 TNBC and 104 noncancerous tissues using RNA sequencing data from TCGA, and identified 1,441 lncRNAs differentially expressed in TNBC, and 109 of them were associated with survival. However, it was unclear whether these differentially expressed lncRNAs were specific to TNBC. Lv et al. (2016) analyzed lncRNA expression levels in three TNBC and three age-matched non-TNBC samples, and identified 880 up-regulated and 784 down-regulated lncRNAs in TNBC. 70 of these lncRNAs were further analyzed in 46 BC tissues and BC cell lines. 38 of them were up-regulated and 32 were down-regulated in TNBC. A panel of four lncRNAs could be used as biomarkers to diagnose TNBC (Lv et al., 2016). Liu et al. (2016) performed transcriptome analysis on 165 TNBC samples and identified various lncRNAs associated with TNBC subtypes. Koduru et al. (2017) analyzed publicly available small RNA sequencing data derived from 24 TNBC samples and identified 61 differentially expressed lncRNA. 28 were up-regulated and 33 were down-regulated. In the current study, we focused on identifying TNBC-specific lncRNAs with coding potential. Using publicly available sequencing data on TNBC tissues, we first identified lncRNAs that were differentially expressed in TNBC tissues when compared with adjacent normal tissues, then selected those lncRNAs that were not differentially expressed in non-TNBC tissues when compared with normal tissues. We identified one lncRNA, LINC00908, which was differentially expressed in TNBC and encoded a polypeptide, ASRPS. Although polypeptide-encoding lncRNAs have been reported before (Huang et al., 2017), to our knowledge, LINC00908 was the first polypeptide-encoding lncRNA differentially expressed in TNBC.
To elucidate the mechanism of TNBC-specific expression of LINC00908, we analyzed the LINC00908 promoter. Using the luciferase reporter assay, we showed that LINC00908 expression as well as ASRPS was directly regulated by ERα expression. Promoter deletion and ChIP analyses indicated that ERα was directly bound to the −1,500 to −1,000 region in the LINC00908 promoter. This was consistent with our clinical data that LINC00908 expression was significantly higher in ER+ BC than ER− BC. Previous studies have identified ER-regulated lncRNAs in BC. For example, Zhang et al. (2017b) identified 114 ER-regulated lncRNAs in BC by a genome-wide analysis; Li et al. (2018) identified lncRNA MIAT overexpressed in ER+ BC; Wang et al. (2019) showed that LINC00472 was up-regulated by ERα in BC; and Jonsson et al. (2015) showed that LINC01016 and LIN00160 were direct transcription targets of ERα. Our study was the first that reported the identification of ERα regulation of lncRNA expression as well as lncRNA-encoded polypeptide.
Our clinical data indicated that ASRPS expression was down-regulated in TNBC, associated with increased tumor growth and poor OS. To elucidate the functional mechanism of ASRPS, we showed that ASRPS was directly bound to STAT3. All STAT proteins have similar structures, consisting of six conserved domains: a N terminal domain followed by a CCD, a DNA-binding domain, a linker domain, a SH2 domain, and a C terminal transactivation domain. Our data indicated that the CCD of STAT3 was essential for the interaction with ASRPS. Previous studies indicate that the CCD of STAT3 is essential for auto-phosphorylation (Ma et al., 2003; Zhang et al., 2000), and certain proteins could inhibit STAT3 activation by binding to STAT3 CCD domain (Chang et al., 2018; Mattagajasingh et al., 2012). Even K116, a STAT3 allosteric inhibitor, inhibited STAT3 phosphorylation at Tyr705 by binding to the CCD of STAT3 (Huang et al., 2018a). Using molecular docking, we showed that the ASRPS binding region in the CCD domain overlaps with K116 binding (Fig. S5 F). This suggests that ASRPS could also affect STAT3 phosphorylation through binding to the STAT3 CCD domain. Further, we showed that ASRPS binding to STAT3 inhibited STAT3 phosphorylation, but also inhibited IL-6–induced STAT3 activation. This is consistent with current knowledge on the STAT3 CCD domain: it is not only essential for its activation via SH2 domain-mediated receptor binding and IL-6 induction, but also essential for its activation via phosphorylation (Zhang et al., 2000).
Using ASRPS KO BC cell lines as well as mouse BC models, we showed that ASRPS down-regulated VEGF expression and angiogenesis through STAT3 phosphorylation and angiogenesis. In the TNBC mouse model, direct intravenous injection of ASRPS reduced angiogenesis and tumor growth. Only a few studies have identified lncRNAs regulating angiogenesis in BC (Huang et al., 2018b; Zhang et al., 2017a), and no studies have investigated angiogenesis-regulating polypeptide encoded by lncRNA in BC. Our data suggested that ASRPS is a potent antitumor peptide through down-regulating tumor angiogenesis.
It is well known that TNBC is characterized by high cellular proliferation. Consequently, TNBC is associated with increased angiogenesis compared with non-TNBC. Microvascular density is significantly higher in TNBC than in non-TNBC (Ribatti et al., 2016), and VEGF levels are significantly higher in TNBC as well (Bahhnassy et al., 2015). Targeting angiogenesis has been proposed to treat TNBC patients. Our data suggest that ASRPS down-regulated angiogenesis in TNBC and ASRPS is a potent antitumor peptide; therefore, targeting tumor angiogenesis through ASRPS might represent a novel targeted therapy in TNBC.
In summary, we identified an ERα-regulated lncRNA LINC00908, which was down-regulated in TNBC, associated with increased tumor growth and poor OS. LINC00908 encoded a polypeptide, ASRPS, that regulated angiogenesis in TNBC via the STAT3/VEGF pathway. ASRPS is a potent antitumor peptide. Our results suggest that LINC00908 as well as its encoded ASRPS are potential prognostic markers and therapeutic targets for TNBC.
Materials and methods
Human study subjects
112 pairs of fresh TNBC tissues and their adjacent noncancerous tissues were obtained from patients in eastern China who underwent tylectomies at the Affiliate Hospitals of Soochow University (Suzhou, China). Another 105 pairs of fresh TNBC tissues were collected from patients in southern China at the Cancer Hospitals affiliated with Guangzhou Medical University. These cases were selected based on a clear pathological diagnosis, and the patients had not received preoperative anticancer treatment. This study was approved by the Medical Ethics Committees of Soochow University and the Ethical Committee of Guangzhou Medical University, and written informed consent was obtained from all participants or their appropriate surrogates. The clinical characteristics of the patients are listed in Table S1.
All cell lines were purchased from Procell Life Science and Technology. These cell lines were all authenticated using short tandem repeat analysis as described in 2012 in the American National Standards Institute standard (ANSI/ATCCASN-0002-2011 Authentication of Human Cell Lines: Standardization of STR Profiling) by the American Type Culture Collection Standards Development Organization and passaged less than 6 mo in this study. DMEM, RPMI-1640, and FBS were purchased from Invitrogen. BT549 and BT474 cells were grown in RPMI-1640 medium supplemented with 10% FBS plus 0.01 mg/ml bovine insulin; MCF7 and T47D cells were grown in MEM supplemented with 10% FBS plus 0.01 mg/ml bovine insulin. MDA-MB-468, MDA-MB-231, and 293T cells were grown in DMEM supplemented with 10% FBS; Hs578T cells were grown in DMEM supplemented with 10% FBS plus 0.01 mg/ml bovine insulin; SKBR3 cells were grown in McCoy's 5A medium supplemented with 10% FBS; and HUVECs were grown in endothelial cell medium (Sciencell, 1001). All cell lines were grown in penicillin/streptomycin-containing medium at 37°C in a humidified atmosphere with 5% CO2.
Animal breeding and treatments
Nude mice with a BALB/C background (female, 6–8 wk of age) and C57BL/6 WT mice were purchased from Shanghai Laboratory Animal Center at the Chinese Academy of Sciences, Shanghai, China. MMTV-PyMT mice with a C57BL/6 background were obtained from Z. Dong’s laboratory at the Institute for Immunology and School of Medicine, Tsinghua University, Bejing, China. The congenic CMV-Cre recombinase mice were constructed from C57BL/6 mice. All animal studies were conducted with the approval of the Soochow University Institutional Animal Care and Use Committee and were performed in accordance with established guidelines.
Hs578T and MDA-MB-231 cell lines were used for various manipulations: (1) ASRPS KO, (2) LINC00908 knockdown, (3) ASRPS KO/reexpression of ASRPS, (4) ASRPS KO/reexpression of LINC00908, and so on. A total of 0.1 ml of cell suspension (1 × 106/ml) was injected subcutaneously into the hind flanks of the nude mice.
For tumor angiogenesis experiments, PBS or synthetic ASRPS peptide (15 mg/kg) was injected intratumorally in the nude mice when the tumor volume reached ∼100 mm3. Mice were scheduled for euthanasia when tumor volumes were ≥1,000 mm3. The CD31 immunofluorescence staining of tumors were directly visualized in randomly selected fields. ASRPS (15 mg/kg) was injected into the mammary fat pad of 8-wk-old MMTV-PyMT mice. The CD31 immunofluorescence staining of tumors was directly visualized in randomly selected fields 12 wk after implantation. All WT mice or control mice used in the study are littermates.
Generation of MMTV-PyMT;ASRPS+/+ mice
The MMTV-PyMT;ASRPS+/+ mice were generated using the gene-targeting construct. A single copy of the ASRPS cDNA was inserted downstream to the STOP cassette (the cassette containing elements designed to terminate both transcription and translation) through homologous recombination into the ROSA26 (R26) locus, so the exogenous ASRPS cDNA is transcribed under the control of the R26 promoter, allowing a reproducible and stable OE of ASRPS. This gene-targeting construct was electroporated into C57BL/6 mice embryonic stem cells; subsequently the recombinant embryonic stem clones were microinjected to generate chimeric C57BL/6 mice.
To analyze the in vivo consequences of ASRPS OE, conditional knockin animals were crossed with the CMV-Cre strain in order to obtain ASRPS+/− heterozygous mice. ASRPS+/+ homozygous animals were generated by intercrossing heterozygous mice. Finally, C57BL/6 ASRPS+/+ mice were crossed with C57BL/6 MMTV-PyMT mice to generate MMTV-PyMT;ASRPS+/+ on the C57BL/6 background. All mice we used were on a C57BL/6 background.
Data processing and differential expression analysis
The gene expression profile of BC RNA sequencing data and matched clinicopathological information BC patients in this study were downloaded from Genomic Data Commons (https://gdc.cancer.gov/). We inspected the gene expression status of ESR1, progesterone receptor, and HER2 of each patient in the clinical data. TNBC patients were defined as patients with negative ESR1 and progesterone receptor status, as well as zero HER2 immunohistochemistry level. In total, 113 normal samples, 18 TNBC samples, and 347 non-TNBC samples were selected for the differential expression analysis. DESeq2 was applied to identify the differentially expressed genes between TNBCs and normal samples, and between TNBCs and non-TNBCs. Genes were considered as statistically significantly differentially expressed with a false discovery rate < 0.01 and absolute log2 fold change > 2.
Prediction of lncRNAs with coding ability
The human lncRNAs sequences were obtained from GENCODE. Small ORFs of lncRNAs were predicted by ORFFinder using the parameters with “-strand plus -ml 30 -s 0.” The predicted ORFs were aligned to the human reference genome version hg38 using Bowtie2 and transformed to BED format by Bedtools. Then the predicted ORFs overlapping with the GWIPS-viz dataset that filtered with data value ≥100 were extracted by UCSC Table Browser. Finally, these ORFs were annotated using R package ChIPseeker.
All experiments were repeated at least three times unless stated in the figure legends. Two-tailed paired Student’s t tests were applied for comparison between two groups. Survival curves were obtained using the Kaplan–Meier method and compared using the log-rank test. Statistical analyses were performed using Prism 7 software. The data were presented as the means ± SD except where stated otherwise. The differences with *, P < 0.05, **, P < 0.01, or ***, P < 0.001 were considered statistically significant.
Trizol total RNA isolation reagent (Invitrogen) was using for extracting total RNA from cells and tissues. cDNA was then generated by reverse transcription of aliquots of RNA using M-MLV RNase H-Reverse transcription (Comiike) according to the manufacturer’s instructions, and the cDNA was used for real-time quantitative PCR using Novostart SYBR qPCR Super Mix Plus (Novoprotein). Primers used for real-time RT-PCR are shown in Table S4.
Northern blot analysis
Northern blot was performed to analyze LINC00908 transcript using NorthernMax Kit (Invitrogen) according to the manufacturer’s instructions. About 10 µg of RNA from each sample was subjected to formaldehyde gel electrophoresis and transferred to a Biodyne Nylon membrane (Pall). The PCR primers used to generate the Northern blot probe are listed in Table S4.
A series of ORF-GFP fusion plasmids were transfected into MDA-MB-231 cells, and GFP fluorescence was directly visualized by Nikon Eclipse Ti-U fluorescence microscope. MDA-MB-231 and MCF7 cells were plated on glass coverslips, fixed with 4% paraformaldehyde at room temperature for 15 min, rinsed three times with phosphate-buffered saline Triton (PBSTx; 0.3% Triton X-100), incubated with anti-ASRPS antibody and anti-FLAG antibody overnight at 4°C, washed twice with Tris-buffered saline Triton (TBSTx), and incubated with FITC secondary IgG antibodies (Beyotime) at room temperature for 1 h in the dark. Cell nuclei were stained with DAPI. Fluorescence images were analyzed using Leica SP5 Confocal Microscopes.
Tumor tissues were fixed and frozen in optimum cutting temperature compound (Sakura). Sections of 10-µm thickness were cut at −20°C. The sections were incubated with CD31 overnight at 4°C. After three rinses with PBST (0.3% Triton X-100), sections were incubated for 1 h at room temperature with Cy3 secondary IgG antibodies in the dark (Beyotime). Cell nuclei were stained with DAPI. Fluorescence images were analyzed using Leica SP5 Confocal Microscopes.
RNA FISH experiments were performed to detect LINC00908 RNA in BC cells using the FISH Detection Kit (Ribo) according to the protocol.
Cell nucleus/cytoplasm fraction isolation
Cell nucleus/cytoplasm fraction isolation was performed as previous described (Li et al., 2017). To determine the cellular localization of LINC00908, cytosolic and nuclear fractions were collected according to the manufacturer’s instructions for the nuclear/cytoplasmic Isolation Kit (Biovision).
ASRPS knockin and KO by CRISPR/Cas9
The guide RNA (gRNA) designed specifically for the ASRPS start codon was cloned into the Cas9/gRNA (puro-GFP) vector (VK001-02, ViewSolid BioTech), and donor oligo was designed and synthesized. All sequence information is shown in Table S4. The gRNA construct and donor oligo were transfected into cells using Lipofectamine 3000 (Invitrogen), following the manufacturer’s instructions. Single-cell clones were selected and evaluated using the T7 endonuclease assay to detect mutants. Next, the constructed clones were selected and sequenced using a 3100 Genetic Analyzer (ABI) to confirm the presence of mutation in the target sequence.
Anti-ASRPS antibody preparation
Peptide synthesis and anti-ASRPS antibody generation were performed as previously described with some modifications (Yu et al., 2017). Briefly, a BSA and OVA-coupled peptide RGGRIWSSDPRSDG-Cys was synthesized, and polyclonal antibodies against the LINC00908 peptide were obtained from inoculated rabbits. Antibodies were purified using affinity chromatography on columns containing the corresponding peptides.
ChIP in BC cells was performed using ERα and STAT3 antibodies and the ChIP Assay Kit (Millipore) according to the manufacturer’s protocol.
Dual-luciferase reporter assay
The corresponding DNA sequence of LINC00908 promoter was cloned into the pGL3-basic vector (Promega). The promoter with the ERα element deleted was also cloned into the pGL3-basic vector. The luciferase activities of the reporters were measured using the Dual Luciferase Reporter Assay System (Promega) according to the manufacturer’s protocol.
Western blot analysis and coIP
Tissues or cells were lysed with radioimmunoprecipitation assay lysis buffer (Solarbio) with protease inhibitor and phosphatase inhibitor. Western blot analysis was performed as in our previous study (Wu et al., 2015), and the Western results were quantified by ImageJ software (Wu et al., 2015).
For immunoprecipitation, cell lysate (∼1.5 mg total protein) was harvested using ice-cold nondenaturing lysis buffer (Thermo Fisher Scientific). coIP assay was performed using the Pierce Co-Immunoprecipitation Kit (Thermo Fisher Scientific) according to the manufacturer’s protocol. Briefly, the corresponding antibody was first immobilized for 2 h using AminoLink Plus coupling resin. The resin was then washed and incubated with cell lysate overnight. After incubation, the resin was again washed and protein eluted using elution buffer. All antibodies used are listed in Table S3.
Lentiviral production and transduction
The full length of ASRPS-FLAG was cloned into the lentiviral expression vector pLVX-IRES-neo (Clontech Laboratories). The shRNAs of LINC00908 lentivirus vector were obtained from ABM. Using a three-plasmid transient cotransfection method (Lenti-T HT packaging mix, Clontech Laboratories), 293T cells were cotransfected with three vectors. Lentiviruses were harvested and concentrated. For transduction, MCF7, T47D, MDA-MB-231, and Hs578T cells were infected with the concentrated virus in the presence of polybrene (Sigma-Aldrich). After 48 h of transduction, the cells were stably selected with G418 at 500 µg/ml (GIBCO BRL), and the drug-resistant cell populations were selected for subsequent studies. The lentiviruses and plasmids used in this study are listed in Table S2.
siRNAs targeting ERα were synthesized by Genepharma. A nontargeting scrambled siRNA (Dharmacon) was used as a control. The knockdown efficiency and specificity of all siRNAs were validated by qRT-PCR. The siRNAs used in this study are listed in Table S5.
Tube formation assay
Matrigel (10 µl, Corning) was first thawed on ice at 4°C overnight, then loaded into each well of a precooled µ-Slide Angiogenesis plate (Ibidi). The plate was incubated at 37°C for 30 min for hardening. 15,000 HUVECs in 100 µl conditioned cell culture medium were plated onto the precoated Matrigel for 6 h. The resulting capillary-like structures were then photographed with a microscope and counted with ImageJ software.
Migration and wound healing assay
For the cell migration assay, HUVECs (10,000 cells) in serum-free media (200 µl) were plated in the upper chamber (Corning), and the conditioned cell culture medium (prepared from ASRPS OE or KO MDA-MB-231/Hs578T cells) was placed in the lower chamber, and incubated for 48 h at 37°C to allow the cells to migrate to the lower chamber. The cells that had migrated through the membrane were fixed with 4% paraformaldehyde for 30 min, then stained with crystal violet (Invitrogen) for 10 min, imaged, and counted using a microscope.
For the wound healing assay, HUVECs were seeded on cell culture inserts (Ibidi) in 35-mm dishes and coincubated with mitomycin C at 37°C in 5% CO2 and grown to confluence as a monolayer. Then inserts were removed and the conditioned cell culture medium (prepared from ASRPS OE or KO MDA-MB-231/Hs578T cells) was added. After 12 h, cells were imaged. The gap was analyzed using ImageJ software. Each experiment was repeated in triplicate.
Spheroids were generated as described before (Korff and Augustin, 1998). Briefly, spheroids embedded into the collagen gel containing the conditional medium were applied on top of the gel. After culturing for 24 h, spheroid sprouts were evaluated by measuring the cumulative length of all capillary-like sprouts using a microscope. At least 10 randomly selected spheroids per experimental group were analyzed. Sprout length was measured with ImageJ software.
ELISA analysis of VEGF expression
VEGF secreted into the culture medium was measured using a Human VEGF ELISA kit (Bio-Swamp) following the manufacturer’s protocol. VEGF concentration was calculated using a standard curve.
Matrigel plug assay
The Matrigel plug assay was performed using an in vivo angiogenesis assay. In brief, the conditioned cell culture medium (prepared from ASRPS OE or KO MDA-MB-231/Hs578T cells) was collected. 400 μl of Matrigel mixed with 300 µl of conditioned medium and 5.0 × 106 HUVECs were injected into the ventral area of 6-wk-old nude mice. After 10 d, the skin of mice was pulled back with scissors to expose intact Matrigel plugs, and plug images were taken.
Online supplemental material
Fig. S1 shows screening of candidate lncRNAs and biological characterization of LINC00908. Fig. S2 shows that anti-ASRPS antibody specifically detects the polypeptide ASRPS. Fig. S3 shows silencing and OE of ERα, and mass spectrometry results of ASRPS and ASRPS KO in BC cell lines. Fig. S4 shows ASRPS affected migration, invasion, and angiogenesis in BC cell lines, but STAT3C expression fully reversed the effect of ASRPS OE. Fig. S5 shows LINC00908 cannot have an effect on angiogenesis or reverse the effect of tumor angiogenesis caused by ASRPS. Table S1 shows primary clinical and histological characteristics of 217 study patients. Table S2 shows virus strains and recombinant DNA used in this study. Table S3 shows antibodies used in this study. Table S4 shows primers and gRNA used in this study. Table S5 shows sequences of siRNAs and shRNAs used in this study.
This work was supported by the National Scientific Foundation of China (grants 81772544 and 81972649); the Science Foundation for Distinguished Young Scholars in Jiangsu (BK20160008); a project funded by the Priority Academic Program Development of Jiangsu Higher Education Institutions; National Key R&D Program of China (2016YFC1302100); and the Program for Guangdong Introducing Innovative and Entrepreneurial Teams (2017ZT07S096).
The authors declare no competing financial interests.
Author contributions: Y. Zhou designed the study. Y. Wang, S. Wu, L. Zhang, and X. Zhu performed the experiments. F. Li, J. Deng, B. Guo, S. Zhang, R. Wu, Z. Zhang, K. Wang, and J. Lu helped with investigation and formal analysis.
Y. Wang, S. Wu, X. Zhu, and L. Zhang contributed equally to this paper.