Research Paper Volume 13, Issue 21 pp 24360—24378

Polymorphisms in lncRNA MIR2052HG and susceptibility to breast cancer in Chinese population

Hui Yang1, , Qiuyu Sun2,3,4, , Feifei Chong2,3,4, , Xiaoru Jiang2,3,4, , Yanli Wang2,3,4, , Kedi Xu2,3,4, , Yuanlin Zou2,3,4, , Linping Xu5, , Chunhua Song2,3,4, ,

  • 1 Department of Gastroenterology and Hepatology, Henan Provincial People’s Hospital, People’s Hospital of Zhengzhou University, Jinshui, Zhengzhou 450003, Henan, China
  • 2 Department of Epidemiology and Statistics, College of Public Health, Zhengzhou University, Zhengzhou 450001, Henan Province, China
  • 3 Henan Key Laboratory of Tumor Epidemiology, Zhengzhou University, Zhengzhou 450052, Henan Province, China
  • 4 State Key Laboratory of Esophageal Cancer Prevention and Treatment, Zhengzhou University, Zhengzhou 450052, Henan Province, China
  • 5 Medical Research Office, The Affiliated Cancer Hospital of Zhengzhou University, Zhengzhou 450003, Henan Province, China

Received: June 11, 2021       Accepted: October 25, 2021
How to Cite

Copyright: © 2021 Yang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.


Background: Published studies based on pharmacokinetics have explored the relationship between the lncRNA MIR2052HG and the prognosis of breast cancer (BC) resistance and recurrence. However, the underlying association of MIR2052HG SNPs with BC development remains unclear.

Methods: Combining bioinformatics and databases, SNPs (Single Nucleotide Polymorphisms) in the MIR2052HG gene were screened, and SNPs in the lncRNA MIR2052HG were selected for genotyping among 504 Chinese Han patients and 505 healthy controls, which were frequency-matched for age (±2 years). Logistic regression analysis was used to explore the association between MIR2052HG SNPs and the BC risk. Interactions between the MIR2052HG SNPs and reproductive factors were further evaluated using the multifactor dimensionality reduction (MDR) method. qRT–PCR was performed to detect MIR2052HG expression in individuals with different genotypes of rs34841297. The target miRNA, miR-4456 of MIR2052HG rs34841297 was predicted by websites and confirmed by performing dual luciferase gene reporter assays. CCK-8 and Transwell experiments were designed to explore the effects of miR-4456 on the proliferation, invasion and migration of BC cells.

Results: In this study, nine SNPs were screened. After adjusting for age, menarche age, menopausal status, number of pregnancies, history of abortions, breast feeding history and family history of BC, the results of the logistic regression analysis showed the rs34841297 A/- gene polymorphism was positively correlated with the incidence of BC. Compared with the AA genotype, patients with the A-+-- genotype of rs34841297 at age<50 years, and menarche age<14 years, Premenopausal status, history of abortion, no history of breastfeeding and no family history of tumors in first-degree relatives had an increased risk of BC. MDR results revealed that individuals with rs34841297 - (homozygous deletion) of the A allele who were not menopausal and had no history of breastfeeding had a higher risk of BC. qRT–PCR results revealed that homozygous deletion (1.68±1.37) of the rs34841297 A- genotype resulted in higher MIR2052HG expression than the heterozygous deletion genotype (0.95±0.94) and wild AA genotype (0.26±0.12). Binding between MIR2052HG and miR-4456 was occurred when rs34841297 carried the AA genotype. Moreover, preliminary functional studies indicated that the overexpression of miR-4456 increased the proliferation, invasion and migration of BC cells.

Conclusion: Our study showed that the MIR2052HG gene polymorphism may be related to BC susceptibility, and the MIR2052HG rs34841297 A/- genotype may probably affect the proliferation, invasion and migration of BC cells by modulating the interactions with of miR-4456.


According to the latest global cancer burden data released by the International Agency for Research on Cancer (IARC) of the World Health Organization in 2020 [1], the incidence of breast cancer (BC) in women exceeds that of lung cancer, accounting for 11.7% of all new tumors and becoming the most common cancer in the worldwide. Approximately 685,000 deaths from BC were reported, making it the fifth leading cause of cancer-related death in the world [1], 90% of which were caused by distant metastasis of primary tumor cells [2, 3]. Among women, the BC incidence and mortality rank first among 159 countries in the world, and the relative increase in cancer risk was the largest in low or medium countries/regions (95% increase and 64% increase from 2020, respectively). Although the mortality rate of BC had decreased in 1991-2017 [4], in the past ten years (2008-2017), the rate of decrease in mortality of females with BC has gradually slowed. The aforementioned survey data showed that the previous preventive or treatment measures were not effective, and the high morbidity and mortality rates have become a public health problem that seriously threatens women’s health. Therefore, effective and cost-effective early detection, early diagnosis and individualized treatment of BC have become urgent problems that remain to be solved worldwide. Current studies have reported many reproductive and environmental factors related to BC [57]. Genetic factors are also indispensable for increasing the risk of BC. BRCA1 and BRCA2 are currently recognized BC susceptibility genes and are widely measured as predictors of BC risk [810]. Recently, with the development of high-throughput sequencing technology, noncoding RNAs have been extensively studied, and lncRNAs, new BC biomarkers, are involved in some biological processes, such as cell proliferation, cell cycle, apoptosis, pluripotency differentiation and maintenance [11], resulting in the occurrence and development of some cancers including BC, liver cancer, and lung cancer, by promoting tumor proliferation, invasion, and metastasis [1215]. The LncRNA MIR2052HG, also known as FLJ39080 and LOC441355, is a long non-coding RNA located on chromosome 8. James N et al. [16] found that the risk of BC recurrence in individuals with homozygous mutant and heterozygous genotypes of the lncRNA MIR2052HG was lower than that in the wild type homozygous individuals. In addition, MIR2052HG overexpression increases BC cell proliferation and promotes colony formation. Pharmacogenomics studies [17] have shown that as a functional polymorphic gene, MIR2052HG might affect the risk of BC recurrence in women treated with aromatase inhibitors. Single Nucleotide Polymorphisms (SNPs) are polymorphisms in DNA sequences caused by variations in single nucleotides at the genome level. According to genome-wide association studies (GWAS) [18], SNPs in lncRNAs are related to susceptibility to many diseases, and SNPs at the key regulatory position of lncRNAs may substantially disrupt their function. Wang L et al. [19] also found that the SNP rs3802201 in MIR2052HG is closely related to the recurrence. However, this study was mainly based on pharmacogenomics to explore the relationship between the lncRNA MIR2052HG and BC resistance and recurrence. Researchers have not clearly determined whether there is an association between the genetic variants of MIR2052HG and BC susceptibility exists. Therefore, relying on the Han population in Henan, this project screened SNPs in the lncRNA MIR2052HG that affect the occurrence of BC and studied the possible molecular mechanism to discover new risk markers for BC. The results might facilitate the early identification and diagnosis of BC in high-risk populations to achieve the purpose of early prevention of BC.


Basic characteristics of study subjects

Based on a case-control study, the basic information of 504 patients and 505 healthy controls was presented in Table 1. The results showed that the age at menarche of the case group (14.21±1.70) was higher than that of the control group (13.97±1.75) (P=0.030), and ≥2 pregnancies (OR:2.049, 95% CI: 1.592-2.637, P<0.001) and the family history of BC (OR:1.869 95% CI: 1.116-3.130, P=0.017) were risk factors for BC. However, a breastfeeding history (OR:0.724, 95% CI: 0.535-0.980, P=0.037) protected against BC.

Table 1. Basic characteristics of 504 breast cancer cases and 505 healthy controls.

VariableCase (%)Control (%)Pb
Age (x¯±s)48.00 ± 9.8548.15 ± 9.610.806a
Age at menarche (x¯±s)14.21 ± 1.7013.97 ± 1.750.030a
Age at menopause (x¯±s)48.60± 3.9048.72 ± 3.700.760a
Menopause statue
Un-menopause320 (63.5)294 (58.2)
Post-menopause184 (36.5)211(41.8)0.086
Number of pregnancies
≥2451 (89.5)411 (81.4)<0.001
Number of abortions
<2340 (67.5)345(68.3)
≥2164 (32.5)160 (31.7)0.771
Breast-feeding history
No121 (24.0)94 (18.6)
Yes383 (76.0)411(81.4)0.037
Family history
No461 (91.5)481(95.2)
Yes43 (8.5)24 (4.8)0.017
ER receptor
PR receptor
HER-2 receptor
at test.
bχ2 test, Two-sided P<0.05 is statistically significant.

Association of MIR2052HG SNPs with BC susceptibility

Different genotype models of nine MIR2052HG SNPs with BC susceptibility were presented in Table 2. The MIR2052HG rs3802201 CG+GG genotype, rs2553716 AC genotype and AC+CC genotype, rs2588297 GT genotype and GT+TT type, rs4259395 AG+GG genotype, rs10957736 TT genotype and CT+TT genotype and rs12546233 AC+CC genotype might reduce the risk of BC, while the rs34841297 deletion mutation increases the risk of breast cancer. Compared with the wild homozygous AA genotype, the SNP rs34841297 --(homozygous deletion) genotype (OR: 1.936, 95% CI: 1.208-3.123) and A-+--genotype (OR: 1.704, 95% CI: 1.087-2.672) resulted in a higher risk in BC.

Table 2. Association between nine SNPs and breast cancer susceptibility.

SNPsGenetic modelGenotypeCase (n=504, %)Control (n=505, %)PaAdjusted OR (95%CI)Pb
aP value of Hardy-Weinberg equilibrium in controls.
bP value of logistic regression analysis with adjusted for age, age at menarche, menopausal status, number of pregnancies and abortions, breast-feeding status, and family history of BC in first degree relative.

Stratified analysis of the association between MIR2052HG SNPs and breast cancer susceptibility

A stratified analysis was conducted to further explore the relationship between the nine SNPs in the MIR2052HG gene and BC susceptibility. As shown in Table 3, compared with the rs34841297 AA genotype, the A-+-- genotype of rs34841297 in patients age<50 years (OR: 1.998, 95% CI: 1.072-3.723) with an age at menarche<14 years (OR: 2.823, 95% CI: 1.316-6.005) who were non-menopausal (OR: 2.490, 95% CI: 1.288-4.815), had a history of abortion (OR: 2.045, 95% CI: 1.117-3.744), no history of breastfeeding (OR: 3.290, 95% CI: 1.183-3.070) and had family history of tumors in first-degree relatives (OR: 1.905, 95% CI: 1.183-3.070) had an increased risk of BC. However, the CG+GG genotype of rs3802201, the AC+CC genotype of rs2553716, the AG+GG genotype of rs4259395, the rs2588297 GT+TT genotype, the CT+TT genotype of rs10957736 and the AC+CC genotype of rs12546233 were all found to exert a protective effect on the BC risk.

Table 3. Stratification analysis of the nine SNPs and BC susceptibility.

rs3802201(CG+GG/CC) OR(95%CI)Pars2553716(AC+CC/AA) OR(95%CI)Pars4259395(AG+GG/AA) OR(95%CI)Pa
Age at menarche
Menopausal state
Age at menopause
No. of pregnancies
<30.854(0.572,1.273)0.4380.861 (0.577,1.283)0.4620.882(0.583,1.333)0.551
History of abortion
Family history
rs2588297(GT+TT/GG) OR(95%CI)Pars10957736(CT+TT/CC) OR(95%CI)Pars269183(CT+CC/TT) OR(95%CI)Pa
Age at menarche
Menopausal state
Age at menopause
No. of pregnancies
Family history
rs269198(CA+AA/CC) OR(95%CI)Pars34841297(A-+--/AA) OR(95%CI)Pars12546233(CC+AC/AA) OR(95%CI)Pa
Age at menarche
Menopausal state
Age at menopause
No. of pregnancies
Family history
aP values adjusted for age, menarche age, menopausal status, number of pregnancies, number of abortions, history of breast feeding, and family history of breast cancer in first-degree relatives in logistic regression analysis (except for stratification factors).
The distributions of hormone receptor status (ER, PR and HER-2) and molecular subtypes (triple-negative, Her-2 overexpression and luminal) of patients with BC presenting different SNP genotypes are shown in Supplementary Tables 4 and 5, respectively. The rs269183 T>C variant was associated with Her-2 receptor positive and Her-2 overexpressing BC. The rs269198 C>A variant reduces the risk of Her-2-overexpressing BC. The GG genotype of rs4259395 was related to the PR receptor-positive BC, and the TT genotype of rs2588297 was related to the Luminal-type BC. SNPs rs3802201 C>G and rs2553716 A>C were both associated with PR positivity and luminal BC. In addition, the rs12546233 AC+CC genotype was associated with triple-negative BC and the AA genotype was associated with Her-2-overexpression BC.

False positive report probability (FPRP)

FPRP analysis [20] was used to evaluate the reliability of the positive results for MIR2052HG SNPs associated with BC susceptibility. As presented in Supplementary Table 6, when the critical value of FPRP was set to 0.5 and prior probability was 0.25, the FPRP values of all positive results for SNPs rs2553716, rs269183, rs269198, and rs12546233 were lower than the critical value. A possible association between the SNPs and the risk of BC was observed, which was worthy of further research and verification.

Haplotype analysis

Haplotype analysis was used to test the combined effect of MIR2052HG SNPs (Supplementary Table 7). Haplotype Crs3802201Ars2553716Ars4259395Grs2588297Crs10957736Trs269183Crs269198 Wrs34841297Ars12546233 was the highest frequency haplotype, and individuals with this haplotype have an increased risk of BC (OR: 1.203, 95% CI: 1.001-1.445), while Haplotype Grs3802201Crs2553716Grs4259395Trs2588297 Trs10957736 Trs269183Crs269198Mrs34841297 Ars12546233 and haplotype Grs3802201Crs2553716Grs4259395Trs2588297Trs10957736Trs269183Crs269198Mrs34841297Crs12546233 were associated with a low risk of BC (OR: 0.258, 95 %CI: 0.073-0.908; OR: 0.698, 95% CI: 0.549-0.887, respectively).

Multifactor dimensionality reduction

MDR software (multifactor dimensionality reduction 3.0.2) was used to analyze the interaction of genes and reproductive factors. As shown in Table 4, there was an interaction with the rs34841297 - (homozygous deletion) of A genotype, non-menopausal status, and no history of breastfeeding was observed, furthermore, the interaction model revealed a higher risk of BC (OR: 1.771, 95% CI: 1.367-2.941, P<0.001).

Table 4. Interaction results between MIR2052HG SNPs and reproductive factors.

Pre-menopausal, no breast-feeding0.49663/1010.0610.0021.543(1.180,2.018)
Pre-menopausal, no breast-feeding, rs348412970.53137/1018.883<0.0011.771(1.367,2.941)
aTesting balance accuracy.
bCross-validation consistency.

Real-time PCR results

The MIR2052HG expression levels in individuals with the rs34841297 --, A- and AA genotypes were shown in Figure 1A. The relative expression in individuals with the homozygous deletion genotype (1.68±1.37) was significantly higher than that in individuals with the heterozygous deletion genotype (0.94±0.95) (P=0.011) and AA genotype (0.26±0.12) (P<0.001). In addition, the relative expression of MIR2052HG in individuals with a homozygous deletion of rs34841297 was significantly higher than that of individuals with the AA genotype (P=0.001).

Figure 1. (A) Relative expression of LncRNA MIR2052HG in different genotypes of rs34841297. (B) The relative fluorescence value of different experimental groups in the dual luciferase report experiment. WT-NC: rs34841297 wild plasmids-Normal Control mimics, WT-MIMIC: rs34841297 wild plasmids-miR-4456 mimics, MUT-NC: rs34841297 mutant plasmids-Normal Control mimics, MUT-MIMIC: rs34841297 mutant plasmids-miR-4456 mimics. (C) The effect of miR-4456 combined with MIR2052HG on the proliferation of breast cancer cells (0-96h). *represents P<0.05; **represents P<0.01; ***represents P<0.001. (D) Transwell experiment explores the effect of miR-4456 combined with MIR2052HG on the invasion ability of MDA-MB-231 cells. (E) Transwell experiment explores the effect of miR-4456 combined with MIR2052HG on the migration ability of MDA-MB-231 cells.

Dual-luciferase reporter assays

A dual-luciferase reporter assay was performed in 293T cells to determine the biological association between rs34841297 and miR-4456. As shown in Figure 1B, the relative luciferase activity of the rs34841297 W-NC group was significantly higher than rs34841297 W-miR-4456 group (P<0.001), suggesting an interaction between the rs34841297 wild genotype plasmid vector and miR-4456. Meanwhile, the relative luciferase activity of the rs34841297 W-miR-4456 group was lower than the rs34841297 MUT-miR-4456 group (P<0.001), showing that the interaction between the plasmid vector of MIR2052HG and miR-4456 disappeared due to the deletion of the rs34841297 A allele.

The effect of miR-4456 combined with MIR2052HG on cell proliferation, invasion and migration

The relative expression of MIR2052HG and miR-4456 in MDA-MB-231 cells (2.02±0.34, 3.95±0.65) were both higher than that in MCF10A cells (0.92±0.08, 1.17±0.79), and the differences were statistically significant (Supplementary Figure 1). qRT–PCR was performed to detect the relative expression of miR-4456 in BC cells (MDA-MB-231) stably transfected with three lentiviruses. The results of the CCK-8 test are shown in Figure 1C. In the MDA-MB-231 cells, the OD values for the miR-4456 low-expression group at 24 h (P<0.001), 48 h (P=0.001), 72 h (P<0.001) and 96 h (P=0.002) were lower than those of the NC group, and the OD values for the miR-4456 overexpression group at 24 h (P<0.001), 48 h (P=0.005), 72 h (P<0.001) and 96 h (P=0.001) were higher than those of the NC group. Then, the number of cells passing through the Matrigel-covered Transwell chamber was observed to determine the ability of miR-4456 to modulate the invasion of MDA-MB-231 cells. The results presented in Figure 1D showed that the number of invading MDA-MB-231 cells in the miR-4456 low-expression group was lower than that in the NC group (P<0.001), and the number of invading MDA-MB-231 cells in the miR-4456 overexpression group was higher than that in the NC group (P=0.002). Finally, Transwell experiments were performed to explore the migration ability of BC cells with different miR-4456 expression levels. As presented in Figure 1E, the number of migrating MDA-MB-231 cells in the miR-4456 low-expression group MDA-MB-231 was lower than that in the NC group (P=0.004), and the number of migrating MDA-MB-231 cells in the miR-4456 overexpression group MDA-MB-231 was higher than that in the NC group (P=0.022).


In the present study, six SNPs rs3802201 (C>G), rs2553716 (A>C), rs4259395 (A>G), rs2588297 (G>T), rs10957736 (C>T) and rs12546233 (A>C), reduced the risk of BC. Among them, the rs3802201 (C>G) mutation reduces the risk of breast cancer, which corresponds with the study by Wang L et al. [19]. However, the rs34841297 gene polymorphism was positively correlated with the incidence of BC, and rs34841297 A gene deletion might increase the risk of BC. Furthermore, the associations between nine MIR2052HG SNPs and the BC receptor status (ER, PR, Her-2) were analyzed. Estrogen receptor (ER) can regulate normal breast epithelial cells and breast gland proliferation of cancer cells [21, 22]. Progesterone receptor (PR) was a member of the nuclear receptor superfamily of transcription factors that has the biological function of promoting functional recovery and reducing the volume of BC lesions [23]. Human epidermal growth factor receptor 2 (Her-2), as a marker for predicting the prognosis of BC, was regarded as the key to evaluating the efficacy of targeted drugs [16]. Our results indicated that rs3802201, rs2553716 and rs4259395 may exert a protective effect on BC by affecting PR receptor status. The CT+CC genotype of rs269183 was related to Her-2 receptor status and may affect the prognosis of BC. MDR model results showed that with rs34841297, homozygous deletion of the A gene, a non-menopausal status and no history of breastfeeding resulted in a higher risk of BC, which might lead to an increased risk of BC. MIR2052HG downregulation expression was reported to reduce ERα-positive BC cell growth [24]. Genetic variations in MIR2052HG were associated with the BC-free interval in the MA.27 trial ( number NCT00066573), and the variant SNPs were associated with increased MIR2052HG expression due to increased ERα binding to EREs [17, 25]. What’s more, researchers have discovered that SNPs could mediate the occurrence of cancer by affecting the secondary structure and expression of lncRNAs and the biological effects mediated by the interactions between lncRNAs and miRNAs [2628]. Here, we performed qRT–PCR to explore the total expression of MIR2052HG expression in individuals with different genotypes of rs34841297. A dual-luciferase reporter gene experiment was conducted to identify whether the rs34841297 A/- deletion mutation affects the binding ability of MIR2052HG to miR-4456. An interaction was observed between MIR2052HG and miR-4456 mimics when rs34841297 carried wild-type A allele, and the interaction disappeared with the deletion of the A allele, consistent with the results predicted by the LncRNASNP2 and DINAN websites. This result suggested that the rs34841297 polymorphism might affect the MIR2052HG and miR-4456 interaction. MiRNAs are small noncoding RNAs of approximately 22 nucleotides in length that perform posttranscriptional regulatory functions by binding to specific sites on the target transcript [29]. miR-4456 is a tiny noncoding RNA located on chromosome 5 that putatively influences oxytocin signaling [30]. To date, no studies have reported the relationship between miR-4456 and cancer. Our study proposed the hypothesis that the effect of the lncRNA MIR2052HG on BC occurrence and development is based on the interaction of MIR2052HG and miR-4456 mediated by rs34841297. CCK-8, Transwell and scratch wound experiments were performed using MDA-MB-231 cells. Compared with the normal control group, the proliferation ability of MDA-MB-231 cells with high miR-4456 expression was significantly increased. The Transwell assays using MDA-MB-231 cells found that compared with the normal control group, the ability of cells with high miR-4456 expression to migrate through the chamber and invade the Matrigel-coated Transwell chamber was significantly increased. Based on these experiments, we preliminarily suggested that the lncRNA MIR2052HG might mediate the binding of miR-4456 through rs34841297 to affect the proliferation, invasion and metastasis of BC. This study is the first to explore the association between genetic variants in the lncRNA MIR2052HG and BC susceptibility. This study has several advantages. First, the patients included in our study were all newly diagnosed and the controls were selected according to the frequency matching, which might reduce the selection bias in the study. Second, SNPscan high-throughput typing technology was used to SNP typing, making the results more accurate and credible than traditional restriction fragment length polymorphism (PCR-RFLP) typing technology. Finally, genotyping of all SNPs was performed on 10% of randomly selected samples for sequencing verification. Additionally, all cell function experiments were repeated more than three times, which improved the authenticity and reliability of the study results. Nevertheless, this study still has some limitations. All the subjects included in this study were of the Chinese Han population, and further studies of other populations should be performed to verify our results. In addition, the mechanism by which MIR2052HG SNPs modulate BC must be further explored in vivo.


In conclusion, the study reveals the association between the MIR2052HG gene polymorphism and the occurrence of BC. The MIR2052HG rs34841297 A/-- variant may affect the binding of miR-4456 to MIR2052HG and subsequently alter proliferation, invasion and migration of BC cells by regulating the expression of miR-4456 expression, which provides a baseline information for screening high-risk population populations for BC and formulating individualized preventive measures.

Materials and Methods

Research sample

Based on a case-control study design, 504 new BC samples that were pathologically confirmed were collected from March 2016 to December 2018. All patients were recruited from the top three hospitals in Henan Province. Through frequency matching to patients by age (±2 years), 505 healthy control samples were obtained from the biological sample bank of Henan Key Laboratory of Oncology Epidemiology. The basic information and clinical characteristics of the patients included age, age at menarche, menopausal status, menopausal age, number of pregnancies, number of abortions, history of breastfeeding and family history of BC. Moreover, information on patients’ hormone receptor status including estrogen receptor (ER), progesterone receptor (PR) and human epidermal growth factor receptor-2 (HER-2) was also acquired. The study was approved by the Ethics Review Committee of the Ethics Committee of Medical and Health Research of Zhengzhou University, Zhengzhou, Henan, China.

DNA extraction, SNP selection and genotyping

Sample DNA was extracted using a DNA Extraction Kit (Shanghai Laifeng Biotechnology Co., Ltd.) according to the manufacturer’s instructions and stored at -80° C until use. The MIR2052HG SNPs were screened using Ensembl38, Ensembl37, NCBI 1000 Genomes and Haploview software, with a minor allele frequency (MAF)>0.05 in Chinese Han population. Finally, 9 SNPs in functional regulatory regions SNPs and tag SNPs were determined, and the basic information was presented in Supplementary Table 1. RNAfold was used to predict the secondary structure of the significant SNPs. LncRNASNP2 and DIANA were used to predict the binding capacity of miRNAs that SNPs might affect (Supplementary Table 2). Combined with the Δ Energy, correlation with BC, and site prediction intersection results, miR-4456 was selected for further functional research. All SNPs were genotyped with SNPscan™ multiple SNP typing kit.

Quantitative real-time PCR (qRT–PCR)

Plasma RNA was extracted from 72 randomly selected healthy controls with TRIzol reagent, DNA was removed and RNA was reverse transcribed into cDNA. The relative expression of MIR2052HG in individuals carrying the rs34841297 polymorphism was detected using qRT–PCR with SYBR-green among individuals with different genotypes of rs34841297, and GAPDH served as the endogenous control. All samples were analyzed in triplicate, and the relative expression was calculated using the method of 2-ΔCt method. The sequences of primers used in this study were listed in Supplementary Table 3.

Dual-luciferase report assay

According to LncRNASNP2, carried with wild-type MIR2052HG-rs34841297 might gain a binding site for miR-4456, and the dual-luciferase assay verified the biological association between rs34841297 and miR-4456. Following the construction of the MIR2052HG rs34841297 wild-type and mutant pmirGLO plasmids, cotransfected HEK 293T cells with miR-4456 mimic or normal control (NC) mimics by using the riboFECT™ CP kit. Firefly fluorescence activity and Renilla fluorescence activity in each group were detected 72 hours after transfection, and relative luciferase activity was calculated based on firefly/Renilla fluorescence.

Cytological function experiment

qRT–PCR experiments first detected the relative expression of MIR2052HG and miR-4456 in MDA-MB-231 cells and MCF10A cells. The relative expression was calculated using the method of 2-ΔΔCt method. The sequences of MIR2052HG primers used in this study were listed in Supplementary Table 3. And the sequences of miR-4456 and U6 internal reference primers are shown in Supplementary Table 8. BC cells (MDA-MB-231) stably transfected with low-expression, overexpression and negative control (NC) lentiviral vectors carrying miR-4456 were established. Then, the stably transfected cell line was used for CCK-8 experiments and Transwell experiments to detect the proliferation, invasion and migration ability of BC cells with different miR-4456 expression levels.

Statistical analysis

Unconditional logistic regression analysis was used to explore BC-related MIR2052HG SNPs and adjusted for age, menarche age, menopausal status, number of pregnancies, number of abortions, breastfeeding history, and family history. SHEsis online software was applied to conduct the haplotype analysis of MIR2052HG. Multifactor dimensionality reduction (MDR) was used to analyze the interaction between genes and the environment. Independent t tests were applied to compare the relative expression of the lncRNA MIR2052HG with different rs34841297 genotypes, and the false-positive report probability (FPRP) [20] analysis was conducted to verify the positive results and the cut-off value to ensure the reliability and accuracy of the positive results. A t test was used to compare the OD values from the CCK8 experiment among groups with low expression of miR-4456 and the high expression of miR-4456 and the NC group. Independent t tests were used to analyze the accurate counts of stained cells in the two groups obtained in Transwell experiments. Differences in invasion and migration capabilities were calculated using t tests. All data analysis and cell number statistical analyses in this study were performed using SPSS 21.0(t-test, χ2 test and unconditional logistic regression model analysis), ImageJ (calculate scratch healing area, Transwell migration and invasion cell count), and GraphPad Prism 7.0(plot the calculation results). Two-sided P<0.05 was statistically significant.

Informed consent

Informed consent was obtained from all individual participants included in the study.

Ethical approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.

Ethical responsibilities of authors

The manuscript has not been submitted to more than one journal for simultaneous consideration. The manuscript has not been published previously (partly or in full).

Author Contributions

CH S and LP X conceived of the study, participated in its design and helped to draft the manuscript. H Y carried out the molecular genetic studies, performed the statistical analysis, participated in the cell experiment and drafted the manuscript. QY S, FF C, XR J, YL W, KD X and YL Z revised the manuscript. All authors read and approved the final manuscript.


The authors thank the participants, the coordinators, and administrators for their supports during the study.

Conflicts of Interest

The authors declare no conflicts of interest.


This work was supported by Henan university science and technology innovation talents support program (19HASTIT005); Medical Science and Technology key projects of Henan Province and Zhengzhou (192102310088, 19A32000820, SBGJ2018089). The National Natural Science Foundation of China (U1604168).


  • 1. Erratum: Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2020; 70:313. [PubMed]
  • 2. Harbeck N, Penault-Llorca F, Cortes J, Gnant M, Houssami N, Poortmans P, Ruddy K, Tsang J, Cardoso F. Breast cancer. Nat Rev Dis Primers. 2019; 5:66. [PubMed]
  • 3. Amelio I, Bernassola F, Candi E. Emerging roles of long non-coding RNAs in breast cancer biology and management. Semin Cancer Biol. 2021; 72:36–45. [PubMed]
  • 4. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2020. CA Cancer J Clin. 2020; 70:7–30. [PubMed]
  • 5. Li N, Deng Y, Zhou L, Tian T, Yang S, Wu Y, Zheng Y, Zhai Z, Hao Q, Song D, Zhang D, Kang H, Dai Z. Global burden of breast cancer and attributable risk factors in 195 countries and territories, from 1990 to 2017: results from the Global Burden of Disease Study 2017. J Hematol Oncol. 2019; 12:140. [PubMed]
  • 6. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018; 68:394–424. [PubMed]
  • 7. Lambertini M, Santoro L, Del Mastro L, Nguyen B, Livraghi L, Ugolini D, Peccatori FA, Azim HA Jr. Reproductive behaviors and risk of developing breast cancer according to tumor subtype: A systematic review and meta-analysis of epidemiological studies. Cancer Treat Rev. 2016; 49:65–76. [PubMed]
  • 8. Abdel-Sater F, Najar M, Fayyad-Kazan H. Triple negative breast cancer: microRNA expression profile and novel discriminators according to BRCA1 status. J Cell Physiol. 2020; 235:5204–12. [PubMed]
  • 9. Song Y, Barry WT, Seah DS, Tung NM, Garber JE, Lin NU. Patterns of recurrence and metastasis in BRCA1/BRCA2-associated breast cancers. Cancer. 2020; 126:271–80. [PubMed]
  • 10. Wiggins J, McLoughlin A, George A, Ring A, Kemp Z. Germline BRCA1 and BRCA2 testing for breast cancer survivors. J Med Genet. 2020; 57:528–30. [PubMed]
  • 11. Maruyama R, Suzuki H. Long noncoding RNA involvement in cancer. BMB Rep. 2012; 45:604–11. [PubMed]
  • 12. Zhou W, Ye XL, Xu J, Cao MG, Fang ZY, Li LY, Guan GH, Liu Q, Qian YH, Xie D. The lncRNA H19 mediates breast cancer cell plasticity during EMT and MET plasticity by differentially sponging miR-200b/c and let-7b. Sci Signal. 2017; 10:eaak9557. [PubMed]
  • 13. Xu LJ, Yu XJ, Wei B, Hui HX, Sun Y, Dai J, Chen XF. LncRNA SNHG7 promotes the proliferation of esophageal cancer cells and inhibits its apoptosis. Eur Rev Med Pharmacol Sci. 2018; 22:2653–61. [PubMed]
  • 14. Wei GH, Wang X. lncRNA MEG3 inhibit proliferation and metastasis of gastric cancer via p53 signaling pathway. Eur Rev Med Pharmacol Sci. 2017; 21:3850–6. [PubMed]
  • 15. Loewen G, Jayawickramarajah J, Zhuo Y, Shan B. Functions of lncRNA HOTAIR in lung cancer. J Hematol Oncol. 2014; 7:90. [PubMed]
  • 16. Tomiguchi M, Yamamoto Y, Yamamoto-Ibusuki M, Goto-Yamaguchi L, Fujiki Y, Fujiwara S, Sueta A, Hayashi M, Takeshita T, Inao T, Iwase H. Fibroblast growth factor receptor-1 protein expression is associated with prognosis in estrogen receptor-positive/human epidermal growth factor receptor-2-negative primary breast cancer. Cancer Sci. 2016; 107:491–8. [PubMed]
  • 17. Ingle JN, Xie F, Ellis MJ, Goss PE, Shepherd LE, Chapman JW, Chen BE, Kubo M, Furukawa Y, Momozawa Y, Stearns V, Pritchard KI, Barman P, et al. Genetic Polymorphisms in the Long Noncoding RNA MIR2052HG Offer a Pharmacogenomic Basis for the Response of Breast Cancer Patients to Aromatase Inhibitor Therapy. Cancer Res. 2016; 76:7012–23. [PubMed]
  • 18. Wapinski O, Chang HY. Long noncoding RNAs and human disease. Trends Cell Biol. 2011; 21:354–61. [PubMed]
  • 19. Wang L, Ingle J, Weinshilboum R. Pharmacogenomic Discovery to Function and Mechanism: Breast Cancer as a Case Study. Clin Pharmacol Ther. 2018; 103:243–52. [PubMed]
  • 20. Wacholder S, Chanock S, Garcia-Closas M, El Ghormli L, Rothman N. Assessing the probability that a positive report is false: an approach for molecular epidemiology studies. J Natl Cancer Inst. 2004; 96:434–42. [PubMed]
  • 21. Marchese S, Silva E. Disruption of 3D MCF-12A breast cell cultures by estrogens--an in vitro model for ER-mediated changes indicative of hormonal carcinogenesis. PLoS One. 2012; 7:e45767. [PubMed]
  • 22. Zhang Y, Cheng JC, Huang HF, Leung PC. Homeobox A7 stimulates breast cancer cell proliferation by up-regulating estrogen receptor-alpha. Biochem Biophys Res Commun. 2013; 440:652–7. [PubMed]
  • 23. Gibson CL, Constantin D, Prior MJ, Bath PM, Murphy SP. Progesterone suppresses the inflammatory response and nitric oxide synthase-2 expression following cerebral ischemia. Exp Neurol. 2005; 193:522–30. [PubMed]
  • 24. Cairns J, Ingle JN, Kalari KR, Shepherd LE, Kubo M, Goetz MP, Weinshilboum RM, Wang L. The lncRNA MIR2052HG regulates ERα levels and aromatase inhibitor resistance through LMTK3 by recruiting EGR1. Breast Cancer Res. 2019; 21:47. [PubMed]
  • 25. Frazer KA, Ballinger DG, Cox DR, Hinds DA, Stuve LL, Gibbs RA, Belmont JW, Boudreau A, Hardenbol P, Leal SM, Pasternak S, Wheeler DA, Willis TD, et al, and International HapMap Consortium. A second generation human haplotype map of over 3.1 million SNPs. Nature. 2007; 449:851–61. [PubMed]
  • 26. Gao P, Wei GH. Genomic Insight into the Role of lncRNA in Cancer Susceptibility. Int J Mol Sci. 2017; 18:1239. [PubMed]
  • 27. Minotti L, Agnoletto C, Baldassari F, Corrà F, Volinia S. SNPs and Somatic Mutation on Long Non-Coding RNA: New Frontier in the Cancer Studies? High Throughput. 2018; 7:34. [PubMed]
  • 28. Fu Y, Zhang Y, Cui J, Yang G, Peng S, Mi W, Yin X, Yu Y, Jiang J, Liu Q, Qin Y, Xu W. SNP rs12982687 affects binding capacity of lncRNA UCA1 with miR-873-5p: involvement in smoking-triggered colorectal cancer progression. Cell Commun Signal. 2020; 18:37. [PubMed]
  • 29. Bartel DP. MicroRNAs: genomics, biogenesis, mechanism, and function. Cell. 2004; 116:281–97. [PubMed]
  • 30. Boström AE, Chatzittofis A, Ciuculete DM, Flanagan JN, Krattinger R, Bandstein M, Mwinyi J, Kullak-Ublick GA, Öberg KG, Arver S, Schiöth HB, Jokinen J. Hypermethylation-associated downregulation of microRNA-4456 in hypersexual disorder with putative influence on oxytocin signalling: A DNA methylation analysis of miRNA genes. Epigenetics. 2020; 15:145–60. [PubMed]