Research Paper Volume 17, Issue 12 pp 2950—2988

Age-specific DNA methylation alterations in sperm at imprint control regions may contribute to the risk of autism spectrum disorder in offspring

Eugenia Casella1, *, , Jana Depovere1, *, , Chantal Delger1, , Mariia Butynets1, , Philipp Antczak2,3,4, , Thomas Price5, , Randy L. Jirtle6, , Susan K. Murphy7, , Cathrine Hoyo6, , Adelheid Soubry1, ,

  • 1 Department of Human Genetics, Epigenetic Epidemiology Lab, Faculty of Medicine, KU Leuven, University of Leuven, Leuven 3000, Belgium
  • 2 Department II of Internal Medicine, University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne 50937, Germany
  • 3 Center for Molecular Medicine Cologne (CMMC), Cologne 50931, Germany
  • 4 Cologne Excellence Cluster on Cellular Stress Responses in Aging-Associated Diseases (CECAD), University of Cologne and University Hospital Cologne, Cologne 50931, Germany
  • 5 Division of Reproductive Endocrinology and Infertility, Department of Obstetrics and Gynecology, Duke University Medical Center, Durham, NC 27701, USA
  • 6 Department of Biological Sciences, Center for Human Health and the Environment, North Carolina State University, Raleigh, NC 27633, USA
  • 7 Department of Obstetrics and Gynecology, Division of Reproductive Sciences, Duke University Medical Center, Durham, NC 27701, USA
* Co-first authors

Received: July 25, 2025       Accepted: December 4, 2025       Published: December 29, 2025      

https://doi.org/10.18632/aging.206348
How to Cite

Copyright: © 2025 Casella et al. This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Abstract

Research findings suggest that advanced paternal age is associated with an increased risk of autism spectrum disorder (ASD) in children. The biological process behind this father-to-child inheritance of a disease may be driven by sperm epigenetic marks. This has been suggested earlier, but the identification of epigenomic regions responsible for these age-related responses have not been further elaborated. To identify sperm-specific marks, we conducted an epigenome-wide association study in sperm from 63 men, using the Illumina 450K array. Linear regression modeling was applied to identify differentially methylated CpGs (DMCs) by age; we controlled for body mass index, patient status, and multiple testing. We found 14,622 statistically significant age-related DMCs; most (69%) were inversely correlated. We identified 95 imprinted genes and emphasized 747 age-related DMCs adjacent to an imprint control region (ICR). Altered methylation patterns in ICRs may result in disturbed expression of imprinted genes and are suspected to be at the origin of several diseases in offspring, including neurodevelopmental disorders. Mapping our results to other databases revealed the following set of imprinted genes linked to ASD: OTX1, PRDM16, PTPRN2, B4GALNT4, KCNQ1, KCNQ1OT1, DLGAP2, PLAGL1, GNAS, GRB10, MAGEL2, CDH24, and FBRSL1. Further research on these genes could help understand the contribution of paternal age on the development of autism. A change in DNA methylation levels in ICRs before conception may contribute to the heterogeneity and complexity of ASD. Measured DNA methylation effect sizes were subtle, but small epigenetic disturbances in sperm may be important on a population level, especially if men continue delaying fatherhood. Public health would benefit from the development of preventive and educational programs.

Introduction

Animal models and human epidemiological data indicate that male ageing can interfere with multiple health outcomes, including fertility, embryo growth, and pregnancy success [13]. If pregnancy succeeds, older men have an increased risk to father a child with a psychiatric disorder, compared to younger fathers [4]. A worrisome observation is that these findings have been reported within “normal age-ranges” of fatherhood for the first time [5]. Exploring the role of preconceptional age in autism is a time-wise question. Especially because autism diagnosis has risen dramatically over the last decade. Between 2011 and 2022 rates of ASD diagnosis in the US increased by 175% in children between 5 and 8 years old [6]. Within the same period of time, the average age of American men fathering a first child shifted from 30.8 to 32.1 years [7]. A genome-wide DNA methylation study in sperm from mice indicated an age-related decrease in DNA methylation in regions associated with transcriptional regulation. Offspring of older males exhibited reduced exploratory behaviors and showed a transcriptional dysregulation at genes linked to schizophrenia and autism [3]. A link between advanced age of the father and autism in offspring has been demonstrated through multiple studies [810]. However, the underlying biological or epigenetic processes are not well understood. A few genome-wide association studies have provided evidence that sperm DNA methylation in human is sensitive to ageing [1114]. Some were designed to predict male age from a semen sample (e.g. in forensics). Limitations of these studies included the use of distinct populations, such as mixtures of fertility patients, sperm donors and men recruited from the general population. Other reports used limited numbers of young and healthy participants; and populations also included smokers [12]. Smoking is a known confounder in age-related epigenetic studies [14]. Other factors may also affect age-DNA methylation associations, such as body mass index (BMI), diet, and medical conditions [15, 16]. In brief, most published data from age-related epigenetic studies in sperm were not designed -or their analytic approach was not fully optimized- to identify genes involved in inheritance or offspring health.

In this publication we perform a genome-scale DNA methylation study and comprehensive analysis using the Infinium HumanMethylation450 array to identify age-associated alterations in sperm that may possibly be transmitted to offspring. Our male population largely consists of healthy and non-smoking volunteers. Considering current knowledge on imprint control regions (ICRs), whose parent-of-origin CpG methylation patterns are established during gametogenesis and resistant to postfertilization epigenetic reprogramming, we specifically explore these regions for age-related alterations [17]. ICRs regulate imprinted genes and potentially acquired imprint instability from preconceptional exposures (including ageing and age-related factors) may be carried on to the next generation, increasing the risk for development of a chronic disorder. We hypothesize that a better understanding of age-related DNA methylation patterns in the male germ line will provide valuable information about father-to-child inheritance of diseases.

Results

Characteristics of study participants and semen samples

Because of variability in human populations, we studied a subgroup of the general population. Non-smoking men with a healthy reproductive profile were recruited at the Duke Fertility Center. Study design and inclusion or exclusion criteria are illustrated in Supplementary Figure 1. Socio-demographic data of our study population is shown in Table 1. In brief, a total of 63 men between 18 to 35 years old was included in our statistical analysis. Men aged less than 25 years old represented 44% (n = 28), 27% (n = 17) were between 25 and 29 years old, and nearly 29% (n = 18) were between 30 and 35 years old. Most men lacked a graduate degree (n = 46, 74.1%) and had not fathered a child (n = 55, 87.3%). The majority were healthy volunteers from outside the clinic (n = 48, 76.2%). The mean age of those that were patients was 24 years, while that of the non-patient subgroup was 30 years; this difference was significant (p-value <0.001). Abnormal clinical sperm parameters were found in 16 men (25.4%); 5 in the patient subgroup and 11 in the non-patient subgroup. About one-third (n = 20) was categorized as being overweight or obese. Having a BMI of 25 or more was strongly associated with older age (p-value = 0.001).

Table 1. Socio-demographic data of TIEGER participants and semen characteristics.

VariablesCategorizationn%
Age (years) (median: 25.00, mean: 25.48)18–242844.4
25–291727.0
30–351828.6
BMINormal weight (18 ≤ BMI <25)4368.2
Overweight/obese (BMI ≥25)2031.8
Patient at fertility clinicYes1523.8
No4876.2
Highest degree of educationHigh school graduate/GED69.7
Some college1829.0
College graduate2235.5
Graduate degree1625.8
Biological childrenYes812.7
No5587.3
Total motile count (TMC)≤39 × 106 (abnormal)1016.1
>39 × 1065283.9
Concentration<15 × 106 (abnormal) 34.8
≥15 × 1065995.2
Motility<40% (abnormal)1117.7
≥40%5182.3
Sperm qualitynormal4774.6
abnormal1625.4
Legend: The study population includes 63 men between 18 and 35 years old. Variables included in our statistical analysis are shown. Sperm quality is categorized as “abnormal” if at least one of the sperm quality metrics (concentration, motility, TMC) is below the WHO recommended threshold. If the sum is not 63, due to missing data, the percentage was calculated on known data (missing data: TMC: 1; concentration: 1; motility: 1).

Distributions and categorization of DNA methylation outcomes in sperm

Primary analyses of DNA methylation percentages in sperm on the Illumina array, such as global means and individual means, were performed on BMIQ normalized β-values. An overview of our analytic approach is illustrated in Figure 1. In total, we evaluated 482,287 CpG sites. The mean DNA methylation at all CpG sites was 47.63% (β-value = 0.4763). We classified each CpG site by its mean β-value in the following subgroups: unmethylated (UM), hemi-methylated (HM) and fully methylated (FM), as described in the methods section. The number of CpG sites in each methylation subgroup is presented in Table 2. Most CpG sites in sperm were either unmethylated (UM) (46.95%) or fully methylated (FM) (43.02%); the remaining 10.03% was categorized as hemi-methylated (HM). Because β-values have severe heteroscedasticity at low and high methylation values, we used M-values in our validation of individual CpG outcomes. Distributions of the average M-values per site, before and after normalization, are shown as the average densities across 63 samples in Supplementary Figure 2. For the purpose of data interpretation, we also show (and discuss) β-value outcomes.

Workflow to assess age-related DNA methylation changes in sperm and biomarker selection for ASD. Data generated from the Illumina HumanMethylation450 BeadChip went through the following analytic procedures. (1) global DNA methylation (mean per subject) by age. (2) DNA methylation at individual CpGs by age. Note, additional tests were performed, as described in the methods section. (3) We subdivided our outcome data as follows: Unmethylated (UM, mean β-value 0.80), defined by the mean DNA methylation per CpG site (of all subjects). The top 30 of the most significant results (lowest adjusted p-value) were classified by: (4) significance (top 90), direction (positive, negative), and magnitude (absolute Delta-M >0.1). All age-related DMCs (n = 14,622) were analyzed in terms of the following approaches: (5) chromosome location, island content, functional genomic allocation, and (6) gene ontology (GO). We compared our results with: (7) similar published reports, and (8) listed data on imprinting. Finally, (9) a focus was applied on all DMCs using Simons Foundation Autism Reference Initiative database and other publicly available databases on ASD. (10) We selected a set of potential biomarkers for ASD within our set of imprinted genes.

Figure 1. Workflow to assess age-related DNA methylation changes in sperm and biomarker selection for ASD. Data generated from the Illumina HumanMethylation450 BeadChip went through the following analytic procedures. (1) global DNA methylation (mean per subject) by age. (2) DNA methylation at individual CpGs by age. Note, additional tests were performed, as described in the methods section. (3) We subdivided our outcome data as follows: Unmethylated (UM, mean β-value <0.2), Hemi-methylated (HM, 0.20≤ mean β-value ≤0.80), and Fully methylated (FM, mean β-value >0.80), defined by the mean DNA methylation per CpG site (of all subjects). The top 30 of the most significant results (lowest adjusted p-value) were classified by: (4) significance (top 90), direction (positive, negative), and magnitude (absolute Delta-M >0.1). All age-related DMCs (n = 14,622) were analyzed in terms of the following approaches: (5) chromosome location, island content, functional genomic allocation, and (6) gene ontology (GO). We compared our results with: (7) similar published reports, and (8) listed data on imprinting. Finally, (9) a focus was applied on all DMCs using Simons Foundation Autism Reference Initiative database and other publicly available databases on ASD. (10) We selected a set of potential biomarkers for ASD within our set of imprinted genes.

Table 2. Frequencies and distributions of CpG sites in sperm by DNA methylation subgroup and by age.

DNA Methylation subgroup (SG)β-valueM-valueAnalyzed CpG sitesSignificant DMCs by age
N%N%1%2
All sitesAllAll482,28710014,6221003.03
10,091; 4,531+69.01; 30.99+2.09; 0.94+
Unmethylated (UM)<0.2(inf, −2)226,43246.952,41416.511.07
2,217; 197+91.84; 8.16+0.98; 0.09+
Hemi-methylated (HM)(0.2–0.8)(−2, 2)48,39110.036,39043.7013.20
4,775; 1,615+74.73; 25.27+9.87; 3.33+
Fully methylated (FM)>0.8(2, +inf)207,46443.025,81839.792.80
3,099; 2,719+53.27; 46.73+1.49; 1.31+
According to the mean β-value all CpGs of the 450K are categorized by subgroup (SG): Unmethylated (UM), Hemi-methylated (HM), or Fully methylated (FM) (UM: β-value <0.2; HM: β-value (0.2–0.8); FM: β-value >0.8). Within each subgroup the number of analyzed CpG sites on the array are shown, as well as the number (and %) of significant age-related DMCs (FDR <0.05). (+) number of sites that are increased by age. (−) number of sites that are decreased by age. 1% relative to the total number of significant DMCs. 2% relative to the total number of CpGs assessed on the 450K within each subgroup.

Global DNA methylation outcomes in sperm by age

After calculating global DNA methylation at all sites, we searched for a potential correlation between age and DNA methylation. No significant relationship was found. Neither if age was used as a continuous variable in a linear regression model (p-value = 0.76) (Supplementary Figure 3), nor through stratifying by age (below and above median age) (p-value = 0.97) (Supplementary Figure 4). Next, we verified if age-associations could be found after stratification of our population by patient status or by obesity status. Although an opposite trend could be seen by these strata, these correlations were not significant (Supplementary Figures 5 and 6). We further verified if global DNA methylation differed by age within each CpG subgroup (UM, HM and FM). We found no significant correlations; p-values were 0.74 at UM sites, 0.92 at HM sites, and 0.96 at FM sites (Supplementary Figure 7).

Site specific DNA methylation outcomes in sperm by age: directions and magnitudes of change

At each CpG site of the 450K array a potential association between DNA methylation and age was evaluated using a linear regression model, adjusting for BMI, patient status and multiple testing. We found a small fraction of CpG sites (3.03%) that was significantly altered by age; 14,622 CpG sites out of 482,287 CpGs could be identified as differentially methylated sites (DMCs). Among those, most DMCs (69.01%; n = 10,091) were negatively associated with age (Table 2). A Volcano plot illustrates all DMCs that are significantly altered by age (Figure 2A). After subdividing our probes by subgroups, associations were as follows: in the UM subgroup, 91.84% of the DMCs were decreased by age (Table 2 and Figure 2B); in the HM subgroup, 74.73% DMCs were negatively associated with age (Table 2 and Figure 2C); and in the FM subgroup, significant DMCs were nearly equally distributed, with 53.3% (n = 2,719) being positively associated and 46.7% (n = 3,099) being negatively associated with age (Table 2 and Figure 2D). We present a short list of highly significant age-related DMCs (top 30 within each subgroup); with highest magnitude in change (absolute delta M-value higher than 0.1) underlined in Table 3. In the UM subgroup, 20 DMCs were highly susceptible to ageing with a high DNA methylation impact at the following genes: HOXA11; HOXA11AS, LRRC25, LHX1, LHX2, LHX5, ZIC4, GRASP, C16orf13, LOC283999, ZNF385A, LOXL3; DOK1, MAGEL2, GNA13, IRF9 and OSR2. In the HM subgroup we identified eight DMCs, annotated to the following genes: NOTCH4, RSPO1, LOXL3, HOXA5, LOC283999, HBA1 and ESAM. In the FM subgroup we found four DMCs, corresponding to LOC283999, FAM184A, as well as some unidentified (NA) genes. For instance, the gene LOC283999 showed an opposite direction in DNA methylation change by age at the FM subgroup; this may have biological consequences. Older men may have less sperm cells that are fully methylated at the corresponding site (cg09445803); a ten-year increase in age was associated with a 2.7% decrease in DNA methylation, or a drop from 82.56% to 79.86% (Table 3). As indicated in our methods section, we controlled for BMI, patient status and multiple testing. In our sensitivity analyses (excluding outliers or patients) and in our extended models (including additional co-factors) our results remained consistent with the main analysis (data not shown).

Volcano plots of 450K data by age in sperm. Volcano plots of our age-association study on the 450K array. X-axis: Delta β-values (Δβ) representing age-associated changes in DNA methylation (after linear regression, adjusted for BMI and patient status). Y-axis: logarithmic transformation of the adjusted p-value (BH-method). Dashed line: indicates where the adjusted p-value is 0.05; dots above the dashed line represent CpG sites where sperm DNA methylation is significantly associated with male age. (A) All sites of the 450K are included (n = 482,287) and dots above the dashed line are age-associated DMCs (n = 14,622). (B–D) Volcano plots by subgroup of DNA methylation. Colored dots: significant age-related DMCs mapped to genes selected by their potential role in inheritance of ASD from father to child; these include: OTX1, PRDM16, PTPRN2, B4GALNT4, KCNQ1, KCNQ1OT1, DLGAP2, PLAGL1, GNAS, GRB10, MAGEL2, CDH24 and FBRSL1. Each colored dot represents one DMC; multiple dots with same color are allocated to the same gene.

Figure 2. Volcano plots of 450K data by age in sperm. Volcano plots of our age-association study on the 450K array. X-axis: Delta β-values (Δβ) representing age-associated changes in DNA methylation (after linear regression, adjusted for BMI and patient status). Y-axis: logarithmic transformation of the adjusted p-value (BH-method). Dashed line: indicates where the adjusted p-value is 0.05; dots above the dashed line represent CpG sites where sperm DNA methylation is significantly associated with male age. (A) All sites of the 450K are included (n = 482,287) and dots above the dashed line are age-associated DMCs (n = 14,622). (BD) Volcano plots by subgroup of DNA methylation. Colored dots: significant age-related DMCs mapped to genes selected by their potential role in inheritance of ASD from father to child; these include: OTX1, PRDM16, PTPRN2, B4GALNT4, KCNQ1, KCNQ1OT1, DLGAP2, PLAGL1, GNAS, GRB10, MAGEL2, CDH24 and FBRSL1. Each colored dot represents one DMC; multiple dots with same color are allocated to the same gene.

Table 3. Alterations at the top 90 most significant DMCs by age in sperm.

SGProbe IDDelta MMean Mp-valueMean βDelta βGeneChrChr location
<0.2cg16038003−0.139−2.4342e-080.169−0.023HOXA11; HOXA11AS7chr7:27227520-27229043
<0.2cg07834682−0.099−2.2743.2e-070.179−0.017SIX22chr2:45231211-45231482
<0.2cg11207353−0.104−2.8343.8e-070.131−0.017LRRC2519NA
<0.2cg01298678−0.132−2.9944e-070.123−0.023LHX29chr9:126773246-126780953
<0.2cg06490225*−0.128−2.6934.2e-070.146−0.021LHX1*17chr17:35291899-35300875
<0.2cg03861097−0.138−3.8504.3e-070.073−0.023NA21chr21:34395128-34400245
<0.2cg08178072−0.111−2.1495e-070.193−0.019NA2chr2:45164561-45166567
<0.2cg00896370−0.106−2.6895.4e-070.143−0.018ZIC43chr3:147115764-147116421
<0.2cg01447831*−0.158−3.3736.3e-070.102−0.027NA*7chr7:155246390-155251955
<0.2cg22566906−0.103−2.2926.3e-070.177−0.018GRASP12chr12:52400467-52401696
<0.2cg25156915−0.129−4.2687.2e-070.055−0.018NA4chr4:174427891-174428192
<0.2cg16757281*−0.109−4.8218.5e-070.038−0.010C16orf13*16chr16:682634-687106
<0.2cg04856022−0.100−2.6038.7e-070.148−0.017PPT26chr6:32121829-32122529
<0.2cg16872071−0.094−2.5009.4e-070.158−0.016RALGDS9chr9:135995969-135996954
<0.2cg24764168s−0.143−2.1991.2e-060.196−0.019NA6chr6:1381743-1385211
<0.2cg14699734−0.085−2.4911.7e-060.157−0.014NA19chr19:5803734-5806023
<0.2cg02675859−0.102−3.7961.7e-060.072−0.013LOC28399917chr17:76228110-76228380
<0.2cg10073842−0.127−2.5731.8e-060.156−0.022MAGEL215NA
<0.2cg21619325−0.095−2.7252.4e-060.138−0.015OSR12chr2:19560963-19561650
<0.2cg00260116−0.079−2.4722.4e-060.158−0.013C7orf527chr7:100815484-100816995
<0.2cg08788246−0.097−3.3733.1e-060.094−0.015ZNF7422chr22:20759743-20760923
<0.2cg08940570−0.125−2.8523.3e-060.133−0.022LOXL3; DOK12chr2:74781494-74782685
<0.2cg17465423−0.105−2.5303.7e-060.157−0.018ZNF385A12chr12:54784900-54785238
<0.2cg09600715−0.086−2.1883.8e-060.186−0.015NA7chr7:27227520-27229043
<0.2cg19556208−0.110−3.3344e-060.098−0.017IRF914NA
<0.2cg18911395−0.193−3.3744.2e-060.112−0.028LHX512chr12:113908887-113910681
<0.2cg17413194−0.125−2.5934.2e-060.154−0.022GNA1317chr17:63051893-63053355
<0.2cg15287594−0.089−3.7584.3e-060.073−0.010FITM220chr20:42939450-42940043
<0.2cg03157027−0.107−3.8045e-060.073−0.017OSR28chr8:99960497-99961438
<0.2cg04479580−0.100−2.3155.2e-060.176−0.017NA12chr12:6438272-6438931
(0.2–0.8)cg02991082−0.0860.0094.7e-100.502−0.010TSC22D3XchrX:106959378-106959914
(0.2–0.8)cg02494945−0.098−1.7621.1e-080.234−0.016TBX417chr17:59531723-59535254
(0.2–0.8)cg26854298−0.1000.2502e-080.542−0.008NA4chr4:174443365-174443948
(0.2–0.8)cg11671335−0.1040.8197.1e-080.634−0.007NOTCH46chr6:32163292-32164383
(0.2–0.8)cg03767475−0.0901.7858.7e-080.770−0.004COMP19chr19:18899037-18902284
(0.2–0.8)cg01862311s−0.0991.1568.7e-080.685−0.006NA8chr8:26305853-26306825
(0.2–0.8)cg15032957−0.0611.9858.7e-080.796−0.004TLE219chr19:3035638-3035872
(0.2–0.8)cg24049629−0.0860.3049.9e-080.551−0.009RASSF13chr3:50377803-50378540
(0.2–0.8)cg04508739−0.0881.1422.3e-070.683−0.006NA14chr14:57264638-57265561
(0.2–0.8)cg24114899s−0.0810.9722.3e-070.659−0.007MGC12982; FOXD21chr1:47902793-47905518
(0.2–0.8)cg13054119−0.115−1.4592.6e-070.276−0.018RSPO11chr1:38099677-38100864
(0.2–0.8)cg02021127−0.1001.4042.6e-070.719−0.005PIP5KL19chr9:130692839-130693331
(0.2–0.8)cg04044203−0.1090.0592.7e-070.510−0.010ESAM11chr11:124632063-124633239
(0.2–0.8)cg10836509−0.083−0.8343.1e-070.363−0.012NA1chr1:226297287-226298586
(0.2–0.8)cg23674882−0.117−0.6023.2e-070.403−0.012LOXL32chr2:74781494-74782685
(0.2–0.8)cg05269451s−0.103−1.1573.2e-070.317−0.014NA2chr2:172963623-172964135
(0.2–0.8)cg03107888−0.084−0.6773.2e-070.388−0.012HOXA137chr7:27238690-27240311
(0.2–0.8)cg06097213s−0.080−1.0023.2e-070.337−0.013LOC2858306chr6:29716468-29717158
(0.2–0.8)cg20437892−0.091−0.0463.2e-070.492−0.011NA2NA
(0.2–0.8)cg00215611−0.0880.0633.2e-070.511−0.009NA2chr2:219860927-219861242
(0.2–0.8)cg19701577−0.131−0.8243.2e-070.371−0.012HOXA57chr7:27182613-27185562
(0.2–0.8)cg10919107cr−0.0940.2013.2e-070.534−0.009PMEPA120chr20:56227252-56227687
(0.2–0.8)cg04730768−0.0770.7103.2e-070.618−0.007CHAT10chr10:50817095-50817309
(0.2–0.8)cg23327992−0.079−0.4723.2e-070.421−0.012IRF914NA
(0.2–0.8)cg20774552−0.0570.7073.3e-070.619−0.006NA11chr11:46410921-46414687
(0.2–0.8)cg20005923−0.102−0.3653.6e-070.440−0.011LOC28399917chr17:76228110-76228380
(0.2–0.8)cg02825052−0.0761.1153.8e-070.681−0.006PICALM11NA
(0.2–0.8)cg01704105−0.114−1.2503.8e-070.305−0.016HBA116chr16:226173-227254
(0.2–0.8)cg00176496−0.0750.5793.8e-070.597−0.008PITPNM317NA
(0.2–0.8)cg01344797cr−0.0670.9774e-070.661−0.005PSD10chr10:104168558-104169153
>0.8cg02882504s−0.0852.7488.7e-080.866−0.002NA16chr16:3238805-3239492
>0.8cg02680566−0.0832.1002.3e-070.806−0.004OSR12chr2:19560963-19561650
>0.8cg08604523s−0.0892.1714e-070.812−0.003MDK11chr11:46406904-46407441
>0.8cg01943692−0.0772.4097.7e-070.837−0.003NA1chr1:26686516-26687281
>0.8cg00019678−0.0652.5641.1e-060.852−0.003C12orf3412chr12:110151327-110152758
>0.8cg09445803−0.1012.3251.2e-060.826−0.003LOC28399917chr17:76228110-76228380
>0.8cg120010780.0642.7081.4e-060.8640.010DAPP14NA
>0.8cg186727160.1443.5211.8e-060.9070.024NA2NA
>0.8cg24321297−0.0462.6272.8e-060.859−0.002ST6GALNAC217chr17:74580974-74582396
>0.8cg24888609s0.1523.3213.1e-060.8940.026FAM184A6NA
>0.8cg21483922−0.0672.2203.1e-060.820−0.003SAMD1417chr17:48206663-48207601
>0.8cg05525368−0.0622.9503.8e-060.883−0.002CDC201chr1:43824134-43825059
>0.8cg11433866s−0.0702.1223.8e-060.809−0.003NA12chr12:115124729-115125152
>0.8cg01405985−0.1072.4414.2e-060.835−0.002NA19chr19:36347044-36348101
>0.8cg24073074−0.0692.1754.9e-060.815−0.003WIPI117NA
>0.8cg04078221−0.0642.0425.2e-060.801−0.003NA1chr1:47899125-47899398
>0.8cg18231690−0.0752.0925.3e-060.805−0.004ZNF59215chr15:85291079-85291697
>0.8cg21690489−0.0632.6215.6e-060.857−0.003CREB3L111NA
>0.8cg08692104−0.0553.3656.3e-060.910−0.001ZNF82319chr19:11849359-11849796
>0.8cg11277190cr0.0932.9056.4e-060.8760.015FAM184A6NA
>0.8cg215741860.0793.5196.4e-060.9160.009UNC5C4NA
>0.8cg00471142−0.0593.0406.5e-060.889−0.002LRFN119chr19:39804621-39805954
>0.8cg01109643−0.0832.1526.6e-060.810−0.003NA13chr13:113548643-113549127
>0.8cg24842142−0.0692.1756.9e-060.815−0.003RNF396chr6:30042918-30043500
>0.8cg141019760.0814.0987.3e-060.9420.010UNC5C4NA
>0.8cg12391323s−0.0462.4357.5e-060.842−0.003NA5chr5:139027443-139030219
>0.8cg05878073−0.0673.1947.6e-060.899−0.002ABCD414chr14:74769366-74769815
>0.8cg11338156−0.0782.6577.9e-060.858−0.003NA10NA
>0.8cg13244241−0.0823.8338.2e-060.931−0.001NA10chr10:102489343-102491011
>0.8cg00730441−0.0612.1878.4e-060.817−0.003TBX217chr17:59485573-59485780
Probes related to DMCs by age were first sorted by subgroups (UM: β-value <0.2; HM: β-value (0.2–0.8); FM: β-value >0.8), then by adjusted p-values (Benjamini-Hochberg method; FDR <0.05). Only the top 30 most significant results by each subgroup (SG) are presented. Delta-M and Delta-β: estimated changes by age in M-value and in β-value, respectively. Annotated gene names and chromosome locations are shown according to the UCSC Genome Browser. *Probe related to CpG Islands; crcross-reactive probe; sSNP; (bold) direction is opposite (e.g., hypomethylated in FM subgroup); (underlined) highest magnitude in DNA methylation changes (Delta-M is higher than 0.1).

Genomic allocations, island content and other functional characteristics of age-associated DMCs or annotated genes

A Miami plot was created of all significant age-related DMCs across the chromosomes, separated by hypermethylated sites and hypomethylated sites (Figure 3). A remarkable observation is a general hypomethylation at sites annotated to imprinted genes. Next, we did a parallel analysis of all 14,622 DMCs in terms of their CpG content and neighborhood content (island, shore, shelf, open sea). CpG islands and open sea areas are the most frequent targets on the microarray, accounting for 30.91% and 36.28% of all analyzed CpG sites, respectively (Supplementary Table 1, Figure 4A). We detected limited age-related DMCs at islands; accounting for nearly 5% (n = 722) of the DMCs (Figure 4B), and only 0.48% of all analyzed sites (which is in large contrast to the 30.9% in the complete array) (Supplementary Tables 1 and 2). A comparison of increase (or hypermethylation) versus decrease (hypomethylation) in numbers and in ratios - by age, by island and by neighborhood content - are shown in Supplementary Table 2. Only a small difference was seen between the prevalence of negative and positive associations by age (55.7% versus 44.3 %, respectively); ratio is 1.26 if all DMCs are included, but it is 2.38 if only UM sites are considered (Supplementary Table 2). Still, an overall ratio of 1.26 at CpG islands is modest if compared to a high ratio of 24.8 at shores (hence, 24.8 times more negative associations than positive associations are measured at shores). An opposite but small ratio (0.40) is seen at open seas (or 2.5 times more positive associations than negative associations).

Chromosomal distribution of age-associated DMCs in sperm. Miami plot of age-associated differentially methylated CpG sites (DMCs), shown by direction of change (Delta β >0 relates to an increase in number of sperm cells that are methylated, Delta β Δβ=2(M0+delta M)1+2(M0+delta M)−2M01+2M0. Significant DMCs correspond to FDR

Figure 3. Chromosomal distribution of age-associated DMCs in sperm. Miami plot of age-associated differentially methylated CpG sites (DMCs), shown by direction of change (Delta β >0 relates to an increase in number of sperm cells that are methylated, Delta β <0 means a decrease in number of sperm cells that are methylated). Dashed line: no change (Δβ = 0). Delta β-values are calculated from M-values, as shown in the methods section Δβ=2(M0+delta M)1+2(M0+delta M)2M01+2M0. Significant DMCs correspond to FDR <0.05. Blue dots are sites allocated to imprinted genes, listed in the Geneimprint database.

Island content of CpGs within 450K array and age-associated DMCs. (A) Percentages of CpGs by island content are displayed with respect to the 450K (n = 482,287). (B) Percentages of CpGs by island content are displayed with respect to the number of significant age-related DMCs (n = 14,622). (C–E) By DNA methylation subgroup (Abbreviations: UM: unmethylated, mean β-value 0.80).

Figure 4. Island content of CpGs within 450K array and age-associated DMCs. (A) Percentages of CpGs by island content are displayed with respect to the 450K (n = 482,287). (B) Percentages of CpGs by island content are displayed with respect to the number of significant age-related DMCs (n = 14,622). (CE) By DNA methylation subgroup (Abbreviations: UM: unmethylated, mean β-value <0.2; HM: hemi-methylated, 0.20 ≤ mean β-value ≤0.80; FM: fully methylated, mean β-value>0.80).

We further examined variations in DNA methylation in relation to CpG island content by our three subgroups of DNA methylation (Supplementary Table 2 and Figure 4C4E). Island associated DMCs were predominantly at unmethylated sites (UM), at which 13% of the age-related DMCs could be detected. Calculated ratios indicate that the direction of age-related changes is mostly negative in UM and positive in FM. Out of the 325 UM DMCs at islands, age had a positive effect on a small subset of 96 sites. And, out of the 117 FM DMCs at islands, 36 sites showed a decrease in DNA methylation. These sites -opposite to the characteristics of these subgroups (being low or high in DNA methylation, respectively)- may be interesting in the context of potential imprinting by parent of origin (discussed below). Most FM sites at shores and at shelves showed a decrease in methylation by age (approximately 95%, 1,611 sites out of 1,700 at shore sites; 81%, 776 out of 961 at shelf sites, respectively). Next, we explored functional genomic allocations (e.g., in subgroups such as promoter, body, 3′UTR, and intergenic) (Supplementary Table 3 and Figure 5). Overall, distributions of age-associated DMCs by functional allocations did not differ substantially from what was expected from the 450K array; although a higher representation of promotors within our DMCs was seen at the UM subgroup (28%) versus the FM subgroup (13%). Gene Ontology (GO) term enrichment analysis using all DMCs showed 11 significant GO terms related to male age (Supplementary Figure 8). Specifically, the following terms were captured with the highest enrichment: cell morphogenesis (biological process), and plasma membrane protein complex (cellular component).

Functional genomic distribution of CpGs within 450K array and age-associated DMCs. (A) Percentages of CpGs by functional genomic region are displayed; by promoter, 1stExon, 5′UTR, 3′UTR, body, and intergenic regions. CpG sites allocated to multiple regions are reported as a separate category. (A) Percentages with respect to the 450K (n = 482,287). (B) Percentages are displayed with respect to the number of significant age-related DMCs (n = 14,622). (C–E) By DNA methylation subgroup (Abbreviations: UM: unmethylated, mean β-value 0.80).

Figure 5. Functional genomic distribution of CpGs within 450K array and age-associated DMCs. (A) Percentages of CpGs by functional genomic region are displayed; by promoter, 1stExon, 5′UTR, 3′UTR, body, and intergenic regions. CpG sites allocated to multiple regions are reported as a separate category. (A) Percentages with respect to the 450K (n = 482,287). (B) Percentages are displayed with respect to the number of significant age-related DMCs (n = 14,622). (CE) By DNA methylation subgroup (Abbreviations: UM: unmethylated, mean β-value <0.2; HM: hemi-methylated, 0.20 ≤ mean β-value ≤0.80; FM: fully methylated, mean β-value >0.80).

Identifying age-related DMCs linked to imprinted genes and candidate ICRs

We screened our findings on age-related DMCs for their potential role in imprinting (Figure 1). First, imprinted genes were defined using Geneimprint, with a list of 227 imprinted genes in human; including 104 predicted and 123 confirmed imprinted genes [18]. Out of the 14,622 age-related DMCs, we found 271 DMCs (1.85%) that could be mapped to 95 (predicted) imprinted genes [17]; these genes are listed by subgroup in Figure 6. Because our 450K array includes 215 (potentially) imprinted genes, this means that 44% of these imprinted genes theoretically present on the array are sensitive to influences from ageing. Out of the 95 age-related imprinted genes, 49 are maternally expressed (or paternally imprinted), 42 are paternally expressed (or maternally imprinted), and imprinting is isoform dependent in four of the identified genes (Supplementary Table 4). Second, we extended our approach using a database published by Jima et al. [17], which includes a genomic map of 1,488 putative Imprint Control Regions (ICRs) or so-called Imprintome. Of our 14,622 DMCs, 747 DMCs could be linked to 380 candidate ICRs (Supplementary Table 5). More precisely, out of these 747 ICR-associated DMCs, 152 DMCs were allocated to 94 ICRs with paternal origin of methylation and 74 genes were identified; 193 DMCs were linked to 107 maternally methylated ICRs and 86 genes were found; and, the remaining 402 DMCs were associated to regions with unknown parent-of-origin methylation; corresponding to 179 ICRs or 158 genes (Supplementary Table 5). After a stringent selection of age-related genes based on a combined use of our findings above by Geneimprint list (n = 95) and by the Imprintome (n = 318), we selected 22 imprinted genes with previously reported ICRs (Figure 6). This set of genes corresponds to 94 age-associated DMCs; 79 DMCs are hypomethylated and 15 DMCs are hypermethylated (Table 4 and Figure 6). For instance, the highest magnitude at hypomethylated DMCs was measured at the PTPRN2 gene, being −28% per 10 years of ageing (cg18285788; Delta β = −0.028, p = 5.64e-04); and, the highest magnitude at hypermethylated DMCs was found at the KCNQ1 gene, being +10% per 10 years of ageing (cg17416793; Delta β = +0.010, p = 0.0127). In our search to define a set of age-related biomarkers of inheritance we could classify these 22 imprinted genes into the following categories: seven genes are paternally imprinted (or maternally expressed) (H19, KCNQ1DN, PTPRN2, KCNQ1, SVOPL, B4GALNT4, FBRSL1; the last two are yet predicted imprinted genes); twelve genes are known to be maternally imprinted (or paternally expressed) (MAGEL2, DIRAS3, FAM50B, DLGAP2, PLAGL1, ZIM2, GLI3, DLK1, CDH24, RB1, SNRPN, KCNQ1OT1); and, in 3 genes imprinting is isoform dependent (GNAS, BLCAP, GRB10). Notably, we repeated our analyses after exclusion of reported SNPs and cross-reactive probes [19, 20]. This test did not significantly change our results, with two exceptions: ZIM2 and C6orf145 (indicated in Figure 6 and Tables 3 and 4).

Imprinted genes affected by age in sperm and our approach for biomarker selection to predict autism spectrum disorders in offspring. A summary of (predicted) imprinted genes linked to significant age-associated DMCs is ordered by expressed allele, and significance (results with smallest p-values are on top). Upper frame: 95 (predicted) imprinted genes linked to 271 DMCs within DNA methylation subgroups (Abbreviations: UM: unmethylated, mean β-value 0.80). Lower frame: out of the 95 (predicted) imprinted genes (upper frame) 22 genes have been mapped to ICRs. In bold, genes involved in ASD. Right frame: summary of 27 ASD-related imprinted genes (due to probe cross-reactivity, the gene C6orf145 with a single DMC was not withheld). The following superscripts are used if at least one DMC had the following characteristic: *, at CpG Islands; PR, at promotor region; ↓ or ↑, an opposite direction in DNA methylation change was measured (increase or decrease in DNA methylation, at UM and FM methylated DMCs, respectively); s, at SNPs; cr, at a cross-reactive probe. M, maternally expressed genes; P, paternally expressed genes; I, isoform dependent.

Figure 6. Imprinted genes affected by age in sperm and our approach for biomarker selection to predict autism spectrum disorders in offspring. A summary of (predicted) imprinted genes linked to significant age-associated DMCs is ordered by expressed allele, and significance (results with smallest p-values are on top). Upper frame: 95 (predicted) imprinted genes linked to 271 DMCs within DNA methylation subgroups (Abbreviations: UM: unmethylated, mean β-value <0.2; HM: hemi-methylated, 0.20≤ mean β-value ≤0.80; FM: fully methylated, mean β-value >0.80). Lower frame: out of the 95 (predicted) imprinted genes (upper frame) 22 genes have been mapped to ICRs. In bold, genes involved in ASD. Right frame: summary of 27 ASD-related imprinted genes (due to probe cross-reactivity, the gene C6orf145 with a single DMC was not withheld). The following superscripts are used if at least one DMC had the following characteristic: *, at CpG Islands; PR, at promotor region; ↓ or ↑, an opposite direction in DNA methylation change was measured (increase or decrease in DNA methylation, at UM and FM methylated DMCs, respectively); s, at SNPs; cr, at a cross-reactive probe. M, maternally expressed genes; P, paternally expressed genes; I, isoform dependent.

Table 4. Age-related DMCs and 22 allocated genes in sperm linked to ICRs and imprinting.

SGEAProbe IDDelta βMean βp-valueChrGene
<0.2Icg21809160−0.0270.1026.65e-0520GNAS
<0.2Icg24058407−0.0170.1902.61e-0420GNAS
<0.2Icg11399589−0.0170.1390.002120BLCAP
<0.2Icg17696847−0.0080.0790.010720GNAS
<0.2Icg27006764−0.0010.0350.03347GRB10
<0.2Icg13591710−0.0160.1680.045320BLCAP
<0.2Icg11948874−0.0100.1040.048220BLCAP
<0.2Mcg07439128−0.0170.0672.35e-0511KCNQ1DN
<0.2Mcg04937416−0.0150.0913.17e-057PTPRN2
<0.2Mcg10798664−0.0120.1791.40e-0411B4GALNT4*
<0.2Mcg18285788−0.0280.0545.64e-047PTPRN2
<0.2Mcg26461944−0.0070.1390.001611B4GALNT4*
<0.2Mcg01923099−0.0200.1220.001711KCNQ1DN
<0.2Mcg16083838−0.0200.0870.004011KCNQ1DN
<0.2Mcg03983213−0.0090.0740.00427PTPRN2
<0.2Mcg22675922−0.0050.0600.006811KCNQ1DN
<0.2Mcg10726517−0.0030.0500.010111B4GALNT4*
<0.2Mcg13538517−0.0170.1500.01197PTPRN2
<0.2Mcg03481077−0.0140.1930.012011B4GALNT4*
<0.2Mcg04876474−0.0020.0430.012811KCNQ1DN
<0.2Mcg17682432−0.0030.0730.022911B4GALNT4*
<0.2Mcg08242024−0.0070.1510.03307PTPRN2
<0.2Pcg10073842−0.0220.1561.84e-0615MAGEL2
<0.2Pcg22872376−0.0140.1001.20e-0415MAGEL2
<0.2Pcg23076194−0.0110.1610.00291DIRAS3
<0.2Pcg19447496−0.0110.1900.022513RB1
<0.2Pcg23985641−0.0100.1770.04736FAM50B
(0.2–0.8)Icg07964163−0.0040.7826.42e-0520GNAS
(0.2–0.8)Icg24214471−0.0060.6812.87e-0420BLCAP
(0.2–0.8)Icg21733794−0.0080.5556.94e-0420BLCAP
(0.2–0.8)Icg04820254−0.0050.7230.001120BLCAP
(0.2–0.8)Icg21045560−0.0110.3510.001620BLCAP
(0.2–0.8)Icg20569652−0.0090.2820.005320BLCAP
(0.2–0.8)Icg17658854−0.0110.2070.009420GNAS
(0.2–0.8)Icg01565918−0.0100.3400.012820GNAS
(0.2–0.8)Icg14235271−0.0110.2190.036520GNAS
(0.2–0.8)Mcg21996245−0.0160.2026.08e-0511B4GALNT4*
(0.2–0.8)Mcg20846508−0.0120.3901.29e-0411B4GALNT4*
(0.2–0.8)Mcg24221919−0.0060.5892.18e-047PTPRN2
(0.2–0.8)Mcg18628367−0.0120.2894.13e-047PTPRN2
(0.2–0.8)Mcg03371125−0.0100.3456.71e-0411KCNQ1
(0.2–0.8)Mcg04666029−0.0040.6410.002111KCNQ1
(0.2–0.8)Mcg20533553−0.0120.2210.006811KCNQ1
(0.2–0.8)Mcg19698309−0.0100.3610.009411KCNQ1
(0.2–0.8)Mcg20482223−0.0060.5130.01777SVOPL
(0.2–0.8)Mcg19713140−0.0100.3740.03787PTPRN2
(0.2–0.8)Mcg212311890.0060.7760.04037PTPRN2
(0.2–0.8)Mcg01100465cr−0.0050.6420.04087PTPRN2
(0.2–0.8)Mcg150941190.0040.7720.04637PTPRN2
(0.2–0.8)Mcg15971656−0.0060.3740.048111B4GALNT4*
(0.2–0.8)Pcg20808078−0.0040.7472.87e-061DIRAS3
(0.2–0.8)Pcg25135755−0.0070.6313.14e-0615MAGEL2
(0.2–0.8)Pcg09834049−0.0120.2915.29e-0614CDH24*
(0.2–0.8)Pcg01152488−0.0110.4409.06e-0515MAGEL2
(0.2–0.8)Pcg03439898cr0.0090.7700.00258DLGAP2
(0.2–0.8)Pcg20076070−0.0110.4130.00568DLGAP2
(0.2–0.8)Pcg21113768−0.0080.4640.00696PLAGL1
(0.2–0.8)Pcg08082351−0.0050.7080.01068DLGAP2
(0.2–0.8)Pcg269397210.0050.7020.014715SNRPN
(0.2–0.8)Pcg128181590.0060.7850.02728DLGAP2
(0.2–0.8)Pcg02566775−0.0050.5770.03526PLAGL1
(0.2–0.8)Pcg09212014−0.0070.3290.037514DLK1
(0.2–0.8)P; Mcg03654058−0.0120.3313.96e-0411KCNQ1OT1; KCNQ1
(0.2–0.8)P; Mcg04762676−0.0030.7640.027111KCNQ1OT1; KCNQ1
>0.8Icg09993814−0.0030.8305.19e-0420BLCAP
>0.8Icg10546626−0.0020.8910.003420GNAS
>0.8Icg17820025cr−0.0020.8320.045520BLCAP
>0.8Mcg05821571−0.0030.8255.53e-047PTPRN2
>0.8Mcg04799270−0.0010.9240.00197PTPRN2
>0.8Mcg15012939−0.0030.8550.00277PTPRN2
>0.8Mcg02855778−0.0020.9040.00347PTPRN2
>0.8Mcg02773779−0.0020.8660.00397PTPRN2
>0.8Mcg19100996−0.0020.8960.006112FBRSL1*
>0.8Mcg221724940.0050.8590.007811H19
>0.8Mcg276293840.0080.9520.01197PTPRN2
>0.8Mcg174167930.0100.9420.012711KCNQ1
>0.8Mcg059263140.0050.9370.01297PTPRN2
>0.8Mcg197644890.0070.9060.016411KCNQ1
>0.8Mcg09350411−0.0020.9100.02227PTPRN2
>0.8Mcg120014560.0050.9220.02347PTPRN2
>0.8Mcg083769240.0050.9130.02447PTPRN2
>0.8Mcg27050114cr0.0080.8840.026411KCNQ1
>0.8Mcg06423822−0.0020.8650.02747PTPRN2
>0.8Mcg03647659−0.0010.9550.032011B4GALNT4*
>0.8Mcg24652817−0.0020.8610.03637PTPRN2
>0.8Pcg07599819s−0.0020.8990.001219ZIM2
>0.8Pcg03652257−0.0030.8180.00337GLI3
>0.8Pcg242574950.0050.9300.00388DLGAP2
>0.8Pcg03156547−0.0020.8390.006514CDH24*
>0.8Pcg06450373cr0.0080.8890.008714CDH24*
>0.8Pcg01983373−0.0020.8900.017814DLK1
>0.8Pcg18100008−0.0010.9200.02287GLI3
>0.8Pcg03646329−0.0030.8220.024313RB1
>0.8Pcg11882053−0.0030.8270.025613RB1
Significant age-associated DMCs and allocated genes (n = 22) are shown based on Geneimprint and the Imprintome (see methods). Probe IDs, estimated changes of β-values by age, annotated imprinted genes, and expressed allele are indicated. *Predicted imprinted gene; (EA): expressed alleles: (P) Paternally, (M) Maternally, or (I) Isoform Dependent; crcross-reactive probe; sSNP. Results are ordered by subgroup (SG): unmethylated, UM (<0.2); hemi-methylated, HM ((0.2–0.8)); fully methylated, FM (>0.8); and, sorted by adjusted p-value (FDR <0.05). Note: If DMCs were mapped to multiple genes, only the ICR-related (predicted) imprinted gene is reported.

Comparison of the current findings on age-related DMCs with results from similar studies

In a recent review by the research group of Haaf, 2,355 genes have been reported in sperm with age affected DMRs [11]. Considering the current study results and those found by four other observational studies, we retrieved 2,098 age-associated genes in sperm. Although similar techniques (Illumina platforms) were used, study populations and study conditions differed [912]. A Venn diagram illustrates the numbers of overlapping age-affected genes identified by each report (Figure 7). We counted 929 genes that were found by at least two studies, 83 genes that were found by at least three studies, seven genes that were found by at least four studies, and one gene that was found by all five studies. (Supplementary Table 6). In brief, six genes found by four studies are SLC22A18AS, C7orf50, UTS2R, BEGAIN, GRIN1, and PCDH15; and, a single gene identified by all is DLGAP2. Apart from C7orf50, all genes recurrently found in age-association studies over the last couple of years have been linked to the development of autism (discussed below) [2126]. The role of UTS2R (or GPR14) in autism is unclear. It plays a role in various brain functions, but it has been considered as a candidate gene for autism [27, 28]. Overall, our findings are in line with Bernhardt et al.’s conclusion that age-induced DNA methylation alterations in sperm may contribute to the development of neurodevelopmental disorders in offspring [11].

Comparison of our set of age-related DMCs with earlier published datasets. Venn diagram showing the number of genes that have been identified by different authors as being differentially methylated by age in sperm; starting from a recent review by Bernhardt et al. [11], where 2,355 genes have been selected. Next to our results, the following publications have been included in this comparison: Bernhardt et al., Jenkins et al., Laurentino et al., and Oluwayiose et al. [9–12]. Seven genes (center of diagram) were identified by at least four studies (DLGAP2, SLC22A18AS, C7orf50, UTS2R, BEGAIN, GRIN1, and PCDH15); two are known imprinted genes (DLGAP2 and SLC22A18AS).

Figure 7. Comparison of our set of age-related DMCs with earlier published datasets. Venn diagram showing the number of genes that have been identified by different authors as being differentially methylated by age in sperm; starting from a recent review by Bernhardt et al. [11], where 2,355 genes have been selected. Next to our results, the following publications have been included in this comparison: Bernhardt et al., Jenkins et al., Laurentino et al., and Oluwayiose et al. [912]. Seven genes (center of diagram) were identified by at least four studies (DLGAP2, SLC22A18AS, C7orf50, UTS2R, BEGAIN, GRIN1, and PCDH15); two are known imprinted genes (DLGAP2 and SLC22A18AS).

Assessment of a potential role in autism development of genes linked to age-related DMCs in sperm

Based on our results and reports by others, age-related DMCs in sperm were frequently seen at genes involved in autism. However, no earlier studies verified if these DNA methylation marks were located at sites important in epigenetic inheritance. First, we verified which of the seven recurrently reported genes (see paragraph above) are prone to imprinting. Two of the seven genes are known to be imprinted: DLGAP2 has been reported as a paternally expressed gene, and SLC22A18AS has been listed as a maternally expressed gene [18]. The estimated effects by age based on the current study at five CpG sites allocated to DLGAP2 are illustrated (Figure 8 and Supplementary Tables 4 and 5), directions varied by site; while at the three CpG sites allocated to SLC22A18AS all were hypomethylated by age (Figure 8 and Supplementary Table 4). This suggests that these genes may play an important role in autism development in children born to fathers of an advanced age. Hence, changes in DNA methylation patterns at these genes could be considered as a potential predictor for autism in the next generation.

DNA methylation by age at CpGs linked to DLGAP2 and SLC22A18AS. Estimates of DNA methylation in β-values by age for all significant CpG sites annotated to DLGAP2 (A–E) and SLC22A18AS (F–H) imprinted genes are shown; DNA methylation changes at these genes have been identified by at least three other age-related studies. Fitted regression lines are shown. Regression models included potential confounding factors (BMI and patient status); these were corrected for multiple testing (BH-method). The following CpG sites were annotated to DLGAP2: (A) cg03439898 (Delta β = 0.0085, p-value = 0.0025), (B) cg20076070 (Delta β = −0.011, p-value = 0.0056), (C) cg08082351 (Delta β = −0.0046, p-value = 0.7083), (D) cg12818159 (Delta β = 0.0059, p-value = 0.0272), and (E) cg24257495 (Delta β = 0.0049, p-value = 0.0038). The following CpG sites were annotated to SLC22A18AS: (F) cg08222610 (Delta β = −0.0153, p-value = 6.158515e-06), (G) cg26874323 (Delta β = −0.0097, p-value = 1.840627e-05), (H) cg23335134 (Delta β = −0.005, p-value = 6.076208e-05).

Figure 8. DNA methylation by age at CpGs linked to DLGAP2 and SLC22A18AS. Estimates of DNA methylation in β-values by age for all significant CpG sites annotated to DLGAP2 (AE) and SLC22A18AS (FH) imprinted genes are shown; DNA methylation changes at these genes have been identified by at least three other age-related studies. Fitted regression lines are shown. Regression models included potential confounding factors (BMI and patient status); these were corrected for multiple testing (BH-method). The following CpG sites were annotated to DLGAP2: (A) cg03439898 (Delta β = 0.0085, p-value = 0.0025), (B) cg20076070 (Delta β = −0.011, p-value = 0.0056), (C) cg08082351 (Delta β = −0.0046, p-value = 0.7083), (D) cg12818159 (Delta β = 0.0059, p-value = 0.0272), and (E) cg24257495 (Delta β = 0.0049, p-value = 0.0038). The following CpG sites were annotated to SLC22A18AS: (F) cg08222610 (Delta β = −0.0153, p-value = 6.158515e-06), (G) cg26874323 (Delta β = −0.0097, p-value = 1.840627e-05), (H) cg23335134 (Delta β = −0.005, p-value = 6.076208e-05).

When screening the 95 age-related imprinted genes based on the Geneimprint database (Supplementary Table 4), we found 28 genes that have been reported in autism development. When considering cross-reactivity of probes, the gene C6orf145 was excluded from our final list, resulting in 27 ASD-related imprinted genes (Figure 6). Eleven of these imprinted genes have been linked to a reported ICR (Figure 6, lower right panel); five of these are included in the SFARI database (see Methods): MAGEL2 (score 1; high confidence for ASD), GNAS, and DLGAP2 (score 2; strong candidate for ASD), GRB10 (score 3; suggested evidence for ASD), and FBRSL1 (unknown score). Based on other databases (including genes found in multiplex ASD families), the following six genes are also listed: B4GALNT4, KCNQ1, KCNQ1OT1, PLAGL1, CDH24, and PTPRN2 (Figure 6) [25, 29]. Finally, these eleven genes form a first set of age-related biomarkers (Figure 6). Based on important functional characteristics of epigenetic inheritance, current and earlier findings on age-related epigenetic marks in sperm and ASD risk, we ranked our findings by several characteristics (significance, magnitude of change, overlapping findings with earlier reports, multiple age-related CpGs within the same gene, promoter or island content) (Table 5). Except for the FBRSL1 gene, all 11 genes had a “score of interest” of 3 or higher. We used this numerical value (3) to further select for potential biomarkers within the remaining set of ASD-related imprinted genes without a yet established link to an ICR. Hence, additional genes with a “score of interest” of at least 3 are: OTX1, PRDM16, and SLC26A10. The latter is listed in the NCBI Gene database as a pseudogene; hence, we do not select for this gene in our final list of potential biomarker genes. Consequently, we found thirteen candidate imprinted genes as potential biomarkers for age-related inheritance of ASD from father to child. These are illustrated in Volcano plots by DNA methylation subgroups (Figure 2), and fitted regression lines are shown for each gene (Figures 8 and 9).

Table 5. ASD-associated (predicted) imprinted genes and scoring.

GeneChrProbe IDICRCoMulti CpGTop 90MaOpIsPrScore of interest
MAGEL215cg10073842101110015
DLGAP28cg03439898111000104
GNAS20cg21809160101010104
B4GALNT411cg10798664101000013
GRB107cg27006764100000113
KCNQ111cg03371125101000013
KCNQ1OT1; KCNQ111cg03654058101000013
OTX12cg10487970000001113
PLAGL16cg21113768101000013
PTPRN27cg04937416101000103
PRDM161cg10588310001001103
CDH2414cg03156547101001003
SLC26A1012cg08177625001001013
ATP10A15cg06066676001000012
FBRSL112cg19100996100001002
HOXB317cg10585948001000012
ANO111cg11058904000010001
C6orf1456cg18815879cr000000011
FOXG114cg18299578000000011
GLIS39cg13804450001000001
LMX1B9cg13466694001000001
MAGI27cg00110846001000001
NTM11cg12079699001000001
OBSCN1cg18477163000000101
UBE3A15cg12060334001000001
ADAMTS165cg26892415000000000
DDC7cg15001032000000000
RNU5D5cg19107296000000000
ASD-associated (predicted) imprinted genes and related probes. Genes were linked to ASD disorders according to previously published databases (see methods). “Score of interest” is given by the following criteria: (ICR) listed by others as being close to an ICR; (Co) confirmed by at least four studies (including the current one); (Multi CpG) age-association was found at more than one CpG of the gene reported; (Top 90) belongs to the most significant top 90 age-related DMCs; (Ma) belongs to the highest magnitude in change by age (Delta-M > 0.1); (Op) opposite direction in methylation change; (Is) located at CpG island; (Pr) located at promoter region. Note: “1” means at least one CpG was found with this characteristic; “0” means none of the DMCs or genes met our criterium. Last column is the sum of eight criteria. crGene linked to a cross-reactive probe. Genes are listed from highest to lowest scores. For each gene, probe with highest score is shown. Note: one gene (SLC26A10) is reported by the NCBI Gene database as a pseudogene in human.
DNA methylation by age at CpGs linked to imprinted genes and related to ASD. Estimates of DNA methylation in β-values by age of CpG sites mapped to our selected list of ASD-related imprinted genes (with exception of CpGs mapped to DLGAP2, which are shown in Figure 8). Fitted regression lines are shown. Our regression models included potential confounding factors (BMI and patient status), and we corrected for multiple testing (BH-method). (A–E) maternally expressed genes; (F–J) paternally expressed genes; (K, L) isoform dependent transcribed gene. If more than one CpG site per gene was significant, the one with the lowest p-value is shown.

Figure 9. DNA methylation by age at CpGs linked to imprinted genes and related to ASD. Estimates of DNA methylation in β-values by age of CpG sites mapped to our selected list of ASD-related imprinted genes (with exception of CpGs mapped to DLGAP2, which are shown in Figure 8). Fitted regression lines are shown. Our regression models included potential confounding factors (BMI and patient status), and we corrected for multiple testing (BH-method). (AE) maternally expressed genes; (FJ) paternally expressed genes; (K, L) isoform dependent transcribed gene. If more than one CpG site per gene was significant, the one with the lowest p-value is shown.

Discussion

This study revealed an overall decrease in DNA methylation at 14,622 CpG sites in sperm by age. In our unique search for age-related signatures in sperm featuring epigenetic inheritance from father to offspring we identified 95 imprinted genes. Only two of these identified genes have been found in similar studies: DLGAP2 and SLC22A18AS [912, 30]. SLC22A18AS, encoding a lncRNA, is a paternally imprinted gene [18]. While this gene is not listed in SFARI, two independent study cohorts in children with neurodevelopmental disorders showed aberrancies in expression levels or epimutations at regulatory sites of SLC22A18AS [22, 23]. Our data show a decrease in DNA methylation percentage by age at all three CpG sites of SLC22A18AS (Figure 8). While not selected as a potential biomarker for ASD-risk in the current study, we believe more research is needed to identify this gene’s potential role in inheritance of ASD. The DLGAP2 gene has been listed as a strong ASD candidate in SFARI and its imprinted domain has been validated by others [17, 18]. DLGAP2 is a paternally expressed gene, but we measured an average DNA methylation of 78% in sperm. Higher methylation percentages than anticipated have also been measured by others [31, 32]. We found that age-related DMCs near DLGAP2 are either hypermethylated (at islands, and open sea) or hypomethylated (at shores), which suggests that expression of this gene is malleable by environmental stressors. However, not all probes have been mapped to an ICR, and more research on the role of methylation in these regions is needed. Hypomethylation in sperm has also been reported in sperm cannabis users [31]. A role for intergenerational inheritance of these altered DNA methylation patterns at DLGAP2 by cannabis exposure, has been demonstrated in a rat model [32]. Other studies in rat indicate that the DLGAP2 methylation status can also be modified by conditions similar to post-traumatic stress in human [33]. According to a review by Rasmussen et al., DLGAP2 belongs to a family with a direct link to a variety of neurological and psychological disorders. Expression profiles are mainly found in testis and brain, and aberrancies in these expression patterns have been associated with schizophrenia and autism spectrum disorders [34, 35]. This confirms that this gene, important in early development of the brain, is highly susceptible to environmental influences.

After mapping our findings to a list of earlier reported ICRs, 22 genes were retrieved of which 11 have been related to ASD (Figure 6). Notably, 11 out of the 22 genes were also reported by a separate study performed by Court et al., where differentially methylated regions were verified in a range of human tissues (including sperm, embryos and somatic tissues) [36]. Five of these genes verified by Court et al. -providing evidence for inheritable epigenetic states- have been linked to (neuro)developmental disorders: MAGEL2, KCNQ1OT1, PLAGL1, GNAS, GRB10. All are included in our selected biomarkers, using an independent ranking system (Figure 6 and Table 5). Among all identified imprinted genes (n = 95), the gene MAGEL2 is matching multiple criteria, such as high magnitude in change by age, being within the top-90, at least one DMC at a promotor region, and having multiple DMCs of highest degree of significance; for instance, at probe cg10073842 we measured a Delta β of −0.022 (or −2.2% per year) (p = 1.84e-06). MAGEL2 is listed in the SFARI database and some DMCs could be allocated to an ICR (Supplementary Tables 5 and 7). The latter was (re)confirmed through data from an independent investigation where Illumina probes were mapped to candidate ICRs [37]. Aberrancies or variations in methylation at these sites have been linked to various diseases [36, 37]. In addition to its role in autism, it has also been reported as a contributor to Prader-Willi, Angelman, and Schaaf-Yang syndromes [3841]. Another interesting finding is the paternally imprinted gene PTPRN2. DMCs related to this gene showed an age-related decrease in methylation; including one DMC within the FM subgroup that was located at an island region. PTPRN2 plays an important role in vesicle-mediated secretory processes, such as the accumulation of neurotransmitters norepinephrine, dopamine and serotonin in the brain [42]. Copy number variations of this gene have been found to be associated with ASD in a Brazilian cohort [43]. DNA methylation changes have been investigated in other populations, as well as by other exposures than ageing. For instance, hypomethylation at PTPRN2 was found in cord blood of children exposure to organochlorine compounds (DDE) in families at the Faroe Islands [44]. However, hypomethylation in sperm could not be linked to DDE exposure in the same Faroese population. Instead, the authors found both directions in DNA methylation changes; significant hypo and hypermethylation at multiple DMCs of PTPRN2. They concluded that this was suggestive for a regulatory role and potential inheritance of ASD [45]. Similarly, we found nine age-related DMCs at the paternally imprinted gene B4GALNT4. These DMCs are either located in the body or the promoter region; we measured an overall decrease in DNA methylation. Liu et al. showed in a rat model that B4GALNT4 plays a role in brain function and development, including the development of ASD [46]. We further found age-related DNA methylation alterations at promotor regions of several other genes. In the case of KCNQ1, this gene is member of the KCNQ voltage-gated potassium channel family, implicated in the pathogenesis of several forms of autism spectrum disorders [47, 48]. Hypomethylation was also measured at PRDM16 and SLC26A10; however, as the latter is reported as a pseudogene in human with unknown function we did not select SLC26A10. Aberrant PRDM16 expression has been linked to cardiovascular diseases and migraine in human [49]. In mice, Prdm16 is expressed in brain throughout middle and late stages of cortical neurogenesis, as well as during early post-natal development. This gene also plays a role in gene regulation through epigenetic mechanisms, and it co-operates with Pax6 [50]. Interestingly, heterozygous spontaneous mutant mice with a mutation in Pax6 show phenotypes corresponding to autism in human; behavioral abnormalities in these mice are more prominent in offspring from aged fathers compared to young fathers [51, 52]. Next, we found age-related alterations at the promoter regions of the following genes: PLAGL1, GRB10, KCNQ1OT1 and OTX1. DNA methylation differences have been found at imprinted loci of PLAGL1 in patients with autism versus controls [38]. The same study also indicated epivariations at imprinted sites of GNAS in patients with autism. Note, according to Carreras-Gallo et al.’s report, the identified age-related DMC of GNAS in the current study is part of a reported ICRs [37]. Next, based on animal studies, GRB10 is also a contributor to ASD pathogenesis. Its transcript forms a complex with the autism-associated protein GIGYF1 to regulate the IGF-1R/ERK signaling pathway, important in brain development [53]. In a study of postmortem temporal cortex samples from ASD subjects, histone modifications could be detected at GRB10 [54]. The authors suggested that the measured GRB10 deacetylation could represent an epigenetic mechanism in idiopathic autism linked to pathways that are also affected by rare genetic variants causing syndromic ASD. This supports earlier insights that rare coding variants alone cannot account for all ASD cases, but epigenetic alterations (by environmental conditions) may contribute to many ASD cases [55]. GRB10 is listed as an isoform dependent imprinted gene because of its complex imprinted expression pattern. It is expressed maternally in most tissues, but it is paternally expressed (or maternally methylated) in the early embryo and in the brain [56]. Deletion of the paternal allele in mice resulted in offspring with an aberrant social behavior, such as social dominance [57]. Alterations at imprint regulatory regions of KCNQ1OT1 have been linked to autism and Beckwith-Wiedemann and Silver–Russell syndromes [58]. A DNA methylation alteration was found at KCNQ1OT1 in patients with autism or intellectual disability [59]. An overlap in disease outcomes was also found for FBRSL1. This gene is included in the SFARI and Homs et al.’s databases, but it has also been linked to a disability syndrome, including heart defects and intellectual disability. Evidence suggests this gene orchestrates gene expression in neurogenesis and in the development of autism. An assessment of DNA methylation patterns for this gene has been considered as a future diagnostic tool for ASD [60, 61]. The gene OTX1 encodes a transcription factor important in brain and sensory organ development [62, 63]. It is also known as a tumor promotor, suggesting its broader involvement in developmental and proliferative processes. In the context of ASD, OTX1 has emerged as a strong candidate gene (SFARI score 2, meaning that the association has been shown earlier with high confidence) [21]. CDH24 (cadherin 24) is a paternally imprinted gene. In general, this gene mediates cell adhesion (particularly in the brain) and is involved in the formation of intracellular signaling pathways. But early genome wide association studies revealed some autism-associated cadherins [64].

In summary, imprinted genes selected in the current age-association study as potential epigenetic biomarkers for ASD in offspring are: OTX1, PRDM16, PTPRN2, B4GALNT4, KCNQ1, KCNQ1OT1, DLGAP2, PLAGL1, GNAS, GRB10, MAGEL2, CDH24, and FBRSL1; these are highlighted in Figure 2 and fitted regression lines are illustrated for at least one probe per gene (Figures 8 and 9). We measured small DNA methylation differences by age; ranging from 0.07% (B4GALNT4) to 2.75% (PTPRN2) per year. However, if fertilization were to occur with a sperm cell that carries one (or multiple) aberrantly methylated sites at one (or several) of these genes, this developing child may be at risk to have ASD. For instance, at OTX1 -a gene with an average change per year- our linear estimates shows that a ten-year increase in paternal age could result in 1.2% more sperm cells being hypermethylated. On an individual level this can be translated to 1 200 000 aberrant sperm cells in an ejaculate of 100 million cells, associated with paternal age. While the chance is low that the affected sperm will result in a pregnancy, on a population level 1.2 % more children with an increased risk to develop ASD is an important contribution to public health. However, this is still hypothetical as other factors play a role in ASD development as well, and current knowledge is too limited to understand the effects of one or more affected CpGs at one gene. Even less is known about the combined effect of paternal age on multiple imprinted genes and possible interactions with other determinants.

We detected limited age-related DMCs at CpG islands; only 4.5%, while the expected island content within the array is about 31% [46]. We hypothesize that CpG islands are more protected against changes by ageing or related factors, because of their importance in gene regulation. Yet, the limited number of significant DNA methylation alterations we have measured at these sites in sperm may be important in evolution and inheritance of specific phenotypes. Notably, our selection of biomarkers was based on several stringent criteria, including ASD-related databases. It is possible that other (unknown) autism-associated genes may have been missed. Consequently, our list of biomarkers is not final and should not be taken strictly. It is a first step towards a more refined and validated set of genes sensitive to epigenetic changes by age. As our sample size is small, other populations of men (on a larger scale and using a broader age-range) should be tested before implementing our findings in clinic. Another reason why we acknowledge its prematurity, is the integration of data from lists of ICRs and imprinted genes that are extensive but not yet fully developed [17]. It has been estimated that the human genome comprises between 300 and 1,000 imprinted genes [65]. Theoretically, this means that male sperm harbors hundreds of important regulatory sites yet to be discovered. As indicated, because of variation in populations, men living in different geographical areas or men with different cultural backgrounds may have different patterns of DNA methylation as they age. For example, in a recent study in the UK, differences were seen in DNA methylation patterns by biological ageing and by housing circumstances [66]. We do not exclude that the use of age is a proxy for a mixture of environmental exposures that coincide with ageing. A remarkable finding by Wang et al. shows that depending on the method used to select a sperm cell in fertility treatment procedures, children may have higher risk for autism [67]. Given the need for more research on the exposure impact on sperm epigenetics, age remains an important variable to include in prediction modeling of acquired diseases. A potential weakness of our study is its limited age-range. The current study does not provide an answer to the potential ASD-risk associated with fathering a child at older age (e.g., 40 years old or older). Nevertheless, we detected significant DNA methylation alterations within a range of 18 years (from 18 till 35 years old). This covers the age when most men father their first child. Literature raises the problem of the biological clock of the male germ line in ageing men and related effects on the increasing numbers of children with ASD or other neurodevelopmental disorders. It has been suggested that age of the father gradually increases the risk for autism in offspring [68]. In a meta-analysis it was calculated that every 10 years increase in age results in 21% increase in risk for ASD in offspring (OR: 1.21; 95% CI: 1.18–1.24) [69].

Finally, we do not exclude a potential higher risk for other disorders (than autism) in children from men of advanced age. As hypothesized by other researchers, such as Levine et al., epigenetic biomarkers of ageing could be used to capture an overall risk for future diseases [70]. Because of recurrent findings in literature on autism-related outcomes, we focused on ASD databases in our final analysis, and we did not explore our list of 14,622 DMCs on other potential outcomes. Hence, although our results point to a (first) set of candidate biomarkers for one disorder, ASD, we believe that the sperm epigenome contains more signatures to be explored. A strength of studying imprinting in sperm -and potential offspring outcomes- is the fact that these signatures may persist in somatic cells of the next generation. This is particularly important in research on brain related disorders. Studying causes of autism in humans is complicated, especially because key tissues (brain) are not available for clinical testing. Hence, as indicated earlier by the research group of Craig, researchers can either study postmortem samples, test mechanisms associated with the disorder, or identify and evaluate biomarkers for risk or prognosis of autism [71]. However, our age-associated epigenetic marks in sperm should still be present in somatic cells of offspring. While our study design was limited to sperm analyses only, and inheritance could not be tested, including these biomarkers in future research on children (considering paternal age in children with versus without ASD) would facilitate the creation of a causation model for autism, with no (or limited) need of postmortem tissues. In this way, if explored and validated further on a larger scale, it would be possible to estimate the effect (or risk) for ASD in children by preconceptional conditions of the father. Moreover, although risk at individual level may be small, the effect on a population level is an important result to consider, especially in the context of future policy recommendations. For instance, educational programs would benefit from this knowledge. Specifically, awareness could be increased in young men about the risks for having a child with autism, if they postpone fathership; and in clinic, appropriate preconceptional counseling of couples in which the man is of advanced age could be implemented. Our results may further contribute to new insights on subtypes of autism, reported by the group of Troyanskaya [72, 73]. In their most recent publication, they were able to subdivide autism by specific genetically underlying mechanisms. In brief, some autism subtypes may be caused by de novo mutations in DNA at coding regions, while other subtypes may have their origin in changes at non-coding regions or sites with allele-specific regulatory activities. This aligns with our finding and supports the importance of a -yet underestimated role- of epigenetics in the onset and heterogeneity of neurodevelopmental diseases.

Materials and Methods

Study participants and sample collection

Male volunteers were recruited as part of The Influence of the Environment on Gametic Epigenetic Reprogramming (TIEGER) study. The cross-sectional design, sample and data collection, and selection criteria of this North Carolina, USA-based study have been published previously; a flow chart of the current study population of 63 men is included in Supplementary materials (Supplementary Figure 1) [15]. In brief, eligibility criteria at recruitment included: non-smoking, no personal history of cancer, no vasectomy or other procedures that may cause infertility, age range of 18–35, and Caucasian; this was applied to keep our population relatively homogenous in this small study sample. In the clinic, semen was analyzed for standard clinical parameters after liquefaction, no later than 60 minutes from collection. These parameters included volume, pH, viscosity, liquification time, presence of white blood cells, concentration, and motility. The World Health Organization’s (WHO) Laboratory Manual for the Examination and Processing of Human Semen 5th edition was referenced for normal values [74]. After completion of the clinical sperm analyses, the samples were subjected to two-step ISolate-gradient centrifugation (Irvine Scientific) to select a motile population enriched in normal morphology. This colloidal silica gradient, consisting of a 90% lower layer and 50% upper layer, was prepared by sequentially adding 1.5 ml of each layer to a 15 ml polystyrene conical tube. The sperm sample was pipetted on top of the upper layer and centrifuged at 200× g for 15 minutes. The gradient solution was removed, and pelleted sperm were stored at −80°C for subsequent DNA methylation analysis [15].

DNA methylation measurements in sperm, data preprocessing, and validation

Sperm genomic DNA was extracted and prepared for Illumina HumanMethylation450 BeadChip as previously described [15]. Methylation levels range from 0% to 100%; where 0% means no sperm cell is methylated at a specific CpG site, and 100% means that all sperm cells in a sample are methylated. Then, we scaled our data in an interval of (0, 1). Hence, DNA methylation values are represented as proportions ranging from 0 (if “unmethylated”) to 1 (if “methylated”). Our preprocessing scenario included sample quality control, filtering of low intensities probes, Beta Mixture Quantile normalization (BMIQ), and logit transformation of default β-values into M-values; M = log(β/1−β) (Supplementary Figure 2) [7577]. Probes with low quality signal (detection p-value greater than 0.01 in at least one sample) were filtered out. Our final dataset represents outcomes of 482,287 probes (out of 485,512), referred as “450K”. Validation tests were performed through data obtained after bisulfite pyrosequencing. These protocols, including PCR conditions and primer sequences were reported previously [15, 78, 79]. In brief, using the Pyrosequencing WorkStation, the single strand was isolated and then underwent pyrosequencing using a PyroMark Q96 MD pyrosequencing instrument (Qiagen). The region upstream from IGF2 exon 3, including three CpG dinucleotides (chr 11p15.5), was tested, as well as the DMR for H19, including four CpG sites (chr 11 p15.5). Assay validation data and the results of sensitivity tests for pyrosequencing have been published [15, 78, 79]. At IGF2, an increase of 1 year of age showed a decrease in DNA methylation (β-value = −0.10; SE = 0.04; p = 0.02). At H19, an increase of 1 year of age corresponded to a significant increase in DNA methylation (β-value = +0.14; SE = 0.05; p = 0.009). These results are similar to our findings in the current approach, when using the 450K array (e.g., all DMCs at IGF2 are negatively associated with age with a mean delta β-value of −0.01; at H19, one age-associated DMC was found, with a delta β-value of +0.0054).

Statistical analysis

Two main analyses were assessed to better understand how age may affect sperm DNA methylation. We analyzed 1) age versus global DNA methylation, and 2) age versus DNA methylation at each CpG site of the 450K. A workflow illustrates our approach (Figure 1). The mean β-value was measured at all CpG sites of the 450K. Using this mean we calculated the global DNA methylation in sperm for each subject. Pearson correlation coefficient was used to measure the correlation between global DNA methylation and age, and Mann-Whitney U-test was used to evaluate the difference in global DNA methylation for different subgroups of men. We used the mean β-value (in all subjects, for each CpG site) to classify each CpG site by its potential biological relevance, as suggested in literature [80]. Consequently, a CpG site with a mean β-value <0.20 was defined as “unmethylated” (UM), as “hemi-methylated” (HM) if 0.20≤ mean β-value ≤0.80, and as “fully methylated” (FM) if mean β-value >0.80. To estimate age-related associations, outcomes of the 450K -expressed as M-values- were included as dependent variables in a linear regression model. Age, BMI and patient status were added as independent variables. We used moderated t-statistic and corrected for multiple testing using the Benjamini-Hochberg (BH) method. Significant threshold was defined as FDR <0.05. Notably, the use of M-values in regression modeling -while less intuitive than β-values- provides a better estimate of age-related effects on DNA methylation as data of M-values are approximately homoscedastic. We back-transformed delta M-values to delta β-values, using Kruppa et al.’s “intercept method”; as follows:

Δβ=2(M0+delta M)1+2(M0+delta M)2M01+2M0;

where M0 indicates the baseline DNA methylation (intercept) [81, 82]. Volcano plots and fitted regression lines (based on estimated β-values) were used to graphically represent the results of our linear regression models (Figures 2, 8 and 9). Sensitivity tests were performed, removing outliers. We also repeated our analyses by excluding patients, and by adding additional co-factors in our models, such as sperm quality. These tests did not result in differences in our final results. In a post-hoc analysis we also repeated our analyses excluding 73,735 CpGs reported at SNPs [19]. We investigated cross-reactivity of probes using DMRcate package, based on Chen et al.’s list [20, 83]. SNPs and cross-reactive probes are indicated in our tables with “s” and “cr”, respectively. To investigate the chromosomal distribution of all age-associated DMCs, a Miami plot was generated, showing the direction of DNA methylation change by age per chromosome (hypermethylated CpG sites, Delta Beta >0 vs. hypomethylated CpG sites, Delta Beta <0). We used the Illumina annotation manifest to explore DMCs by island content, their functional genomic allocation and associated genes [84]. To investigate the potential role of differentially methylated CpG sites (DMCs) in imprinting, we identified genes corresponding to age-associated sites that were previously listed in the Geneimprint database, and as nearest transcript of a candidate imprint control region (ICR) [17]. Moreover, we verified if DMCs allocated to those genes correspond to probes identified by Carreras Gallo et al. as part of related ICRs [37]. In order to compare our results with earlier reports, we used datasets from Bernhardt et al. [11], Jenkins et al. [9], Laurentino et al. [10], and Oluwayiose et al. [12]. Finally, we used the following databases to elaborate our study on ASD: The Simons Foundation Autism Research Initiative (SFARI) database (which includes a gene scoring system to reflect the strength of evidence for its genetic involvement in ASD) and few other studies reporting ASD-related gene databases; including also ASD risk genes based on families with multiple affected children [29, 30, 85]. We also used the following open access resources: the Human Protein Atlas and the NCBI gene database [42, 86]. We ranked our findings by the following characteristics (Table 5): Top 90 most significant (yes/no), delta-M >0.1 (yes/no), opposite direction (yes/no), located at CpG island (yes/no), located at promoter region (yes/no), and listed by at least three other studies (yes/no). The sum was called “score of interest” (Table 5).

Our analysis was conducted in R (version 4.2.2.). Data preprocessing and annotation were conducted using Bioconductor packages, including missMethyl for GO terms enrichment analysis [8790]. Upon request, full lists of data will be made available upon documentation of approval by a recognized Institutional Review Board (IRB).

Abbreviations

ASD: Autism Spectrum Disorders; BH: Benjamini-Hochberg; BMIQ: Beta Mixture Quantile; DMC: Differentially Methylation CpG Site; FM: Fully Methylated; GO: Gene Ontology; HM: Hemi-methylated; ICR: Imprint Control Region; SFARI: Simons Foundation Autism Reference Initiative; SNP: Single Nucleotide Polymorphisms; TIEGER: The Influence of the Environment on Gametic Epigenetic Reprogramming; UM: Unmethylated.

Author Contributions

EC, JD and AS developed the research question and wrote the manuscript. JD was equally involved in the interpretation and discussion of the research results. MB performed the data cleaning and initiated data analysis. EC performed the major statistical analysis. EC designed and performed post-hoc comparisons. CD contributed to the interpretation of the data and results on outcomes related to psychiatric disorders. AS and CH developed the epidemiological study design. SM developed the experimental study design and oversaw DNA methylation data generation. AS initiated the overall study hypothesis of the TIEGER study and supervised the overall interpretation of the data analysis. RLJ, CH, SM, PA and TP edited the manuscript. All authors have critically reviewed the manuscript and have given final approval for publication.

Acknowledgments

We acknowledge the patients of the TIEGER study, Erin White, M.Sc., for her clinical contributions, Carole Grenier, B.Sc., for outstanding laboratory assistance.

Conflicts of Interest

We declare no competing interests. The funder of the study had no role in study design, data collection, data analysis, data interpretation, or writing of the report.

Ethical Statement and Consent

The TIEGER Study was performed with approval of the Duke University Institutional Review Board (reference number: Pro00036645). Informed consent was obtained from all participants.

Funding

This work was supported by a Health and the Environment award from the Duke Cancer Institute and the Duke Nicholas School of the Environment, a research grant from FWO (G0C8523N), co-funded by the FPS Health, Food Chain Safety and Environment, Belgium. This work was conducted in the framework of the European Partnership for the Assessment of Risks from Chemicals (PARC) and has received funding from the European Union’s Horizon Europe research and innovation program under Grant Agreement No 101057014. Views and opinions expressed are, however, those of the author(s) only and do not necessarily reflect those of the European Union or the Health and Digital Executive Agency. Neither the European Union nor the granting authority can be held responsible for them.

References

  • 1. Van Opstal J, Fieuws S, Spiessens C, Soubry A. Male age interferes with embryo growth in IVF treatment. Hum Reprod. 2021; 36:107–15. https://doi.org/10.1093/humrep/deaa256 [PubMed]
  • 2. Katz-Jaffe MG, Parks J, McCallie B, Schoolcraft WB. Aging sperm negatively impacts in vivo and in vitro reproduction: a longitudinal murine study. Fertil Steril. 2013; 100:262–8.e1-2. https://doi.org/10.1016/j.fertnstert.2013.03.021 [PubMed]
  • 3. Milekic MH, Xin Y, O'Donnell A, Kumar KK, Bradley-Moore M, Malaspina D, Moore H, Brunner D, Ge Y, Edwards J, Paul S, Haghighi FG, Gingrich JA. Age-related sperm DNA methylation changes are transmitted to offspring and associated with abnormal behavior and dysregulated gene expression. Mol Psychiatry. 2015; 20:995–1001. https://doi.org/10.1038/mp.2014.84 [PubMed]
  • 4. Pan N, Lin LZ, Wang X, Guo CH, Jing J, Li XH. Association between paternal age at childbirth and autism spectrum disorder in offspring. Zhongguo Dang Dai Er Ke Za Zhi. 2022; 24:863–8. https://doi.org/10.7499/j.issn.1008-8830.2203146 [PubMed]
  • 5. Miller B, Messias E, Miettunen J, Alaräisänen A, Järvelin MR, Koponen H, Räsänen P, Isohanni M, Kirkpatrick B. Meta-analysis of paternal age and schizophrenia risk in male versus female offspring. Schizophr Bull. 2011; 37:1039–47. https://doi.org/10.1093/schbul/sbq011 [PubMed]
  • 6. Grosvenor LP, Croen LA, Lynch FL, Marafino BJ, Maye M, Penfold RB, Simon GE, Ames JL. Autism Diagnosis Among US Children and Adults, 2011-2022. JAMA Netw Open. 2024; 7:e2442218. https://doi.org/10.1001/jamanetworkopen.2024.42218 [PubMed]
  • 7. Ha AS, Scott M, Zhang CA, Li S, Langroudi AP, Glover F, Basran S, Del Giudice F, Shaw GM, Eisenberg ML. Sociodemographic Trends and Perinatal Outcomes in Fathers 50 Years and Older. JAMA Netw Open. 2024; 7:e2425269. https://doi.org/10.1001/jamanetworkopen.2024.25269 [PubMed]
  • 8. Vervoort I, Delger C, Soubry A. A multifactorial model for the etiology of neuropsychiatric disorders: the role of advanced paternal age. Pediatr Res. 2022; 91:757–70. https://doi.org/10.1038/s41390-021-01435-4 [PubMed]
  • 9. Jenkins TG, Aston KI, Pflueger C, Cairns BR, Carrell DT. Age-associated sperm DNA methylation alterations: possible implications in offspring disease susceptibility. PLoS Genet. 2014; 10:e1004458. https://doi.org/10.1371/journal.pgen.1004458 [PubMed]
  • 10. Laurentino S, Cremers JF, Horsthemke B, Tüttelmann F, Czeloth K, Zitzmann M, Pohl E, Rahmann S, Schröder C, Berres S, Redmann K, Krallmann C, Schlatt S, et al. A germ cell-specific ageing pattern in otherwise healthy men. Aging Cell. 2020; 19:e13242. https://doi.org/10.1111/acel.13242 [PubMed]
  • 11. Bernhardt L, Dittrich M, Prell A, Potabattula R, Drummer C, Behr R, Hahn T, Schorsch M, Müller T, Haaf T. Age-related methylation changes in the human sperm epigenome. Aging (Albany NY). 2023; 15:1257–78. https://doi.org/10.18632/aging.204546 [PubMed]
  • 12. Oluwayiose OA, Wu H, Saddiki H, Whitcomb BW, Balzer LB, Brandon N, Suvorov A, Tayyab R, Sites CK, Hill L, Marcho C, Pilsner JR. Sperm DNA methylation mediates the association of male age on reproductive outcomes among couples undergoing infertility treatment. Sci Rep. 2021; 11:3216. https://doi.org/10.1038/s41598-020-80857-2 [PubMed]
  • 13. Jenkins TG, Aston KI, Cairns B, Smith A, Carrell DT. Paternal germ line aging: DNA methylation age prediction from human sperm. BMC Genomics. 2018; 19:763. https://doi.org/10.1186/s12864-018-5153-4 [PubMed]
  • 14. Jenkins TG, James ER, Alonso DF, Hoidal JR, Murphy PJ, Hotaling JM, Cairns BR, Carrell DT, Aston KI. Cigarette smoking significantly alters sperm DNA methylation patterns. Andrology. 2017; 5:1089–99. https://doi.org/10.1111/andr.12416 [PubMed]
  • 15. Soubry A, Guo L, Huang Z, Hoyo C, Romanus S, Price T, Murphy SK. Obesity-related DNA methylation at imprinted genes in human sperm: Results from the TIEGER study. Clin Epigenetics. 2016; 8:51. https://doi.org/10.1186/s13148-016-0217-2 [PubMed]
  • 16. Shnorhavorian M, Schwartz SM, Stansfeld B, Sadler-Riggleman I, Beck D, Skinner MK. Differential DNA Methylation Regions in Adult Human Sperm following Adolescent Chemotherapy: Potential for Epigenetic Inheritance. PLoS One. 2017; 12:e0170085. https://doi.org/10.1371/journal.pone.0170085 [PubMed]
  • 17. Jima DD, Skaar DA, Planchart A, Motsinger-Reif A, Cevik SE, Park SS, Cowley M, Wright F, House J, Liu A, Jirtle RL, Hoyo C. Genomic map of candidate human imprint control regions: the imprintome. Epigenetics. 2022; 17:1920–43. https://doi.org/10.1080/15592294.2022.2091815 [PubMed]
  • 18. Jirtle R. Imprinted Genes by Species. 2025.
  • 19. Naeem H, Wong NC, Chatterton Z, Hong MK, Pedersen JS, Corcoran NM, Hovens CM, Macintyre G. Reducing the risk of false discovery enabling identification of biologically significant genome-wide methylation status using the HumanMethylation450 array. BMC Genomics. 2014; 15:51. https://doi.org/10.1186/1471-2164-15-51 [PubMed]
  • 20. Peters TJ, Buckley MJ, Chen Y, Smyth GK, Goodnow CC, Clark SJ. Calling differentially methylated regions from whole genome bisulphite sequencing with DMRcate. Nucleic Acids Res. 2021; 49:e109. https://doi.org/10.1093/nar/gkab637 [PubMed]
  • 21. Abrahams BS, Arking DE, Campbell DB, Mefford HC, Morrow EM, Weiss LA, Menashe I, Wadkins T, Banerjee-Basu S, Packer A. SFARI Gene 2.0: a community-driven knowledgebase for the autism spectrum disorders (ASDs). Mol Autism. 2013; 4:36. https://doi.org/10.1186/2040-2392-4-36 [PubMed]
  • 22. Kuwano Y, Kamio Y, Kawai T, Katsuura S, Inada N, Takaki A, Rokutan K. Autism-associated gene expression in peripheral leucocytes commonly observed between subjects with autism and healthy women having autistic children. PLoS One. 2011; 6:e24723. https://doi.org/10.1371/journal.pone.0024723 [PubMed]
  • 23. Iourov IY, Vorsanova SG, Korostelev SA, Zelenova MA, Yurov YB. Long contiguous stretches of homozygosity spanning shortly the imprinted loci are associated with intellectual disability, autism and/or epilepsy. Mol Cytogenet. 2015; 8:77. https://doi.org/10.1186/s13039-015-0182-z [PubMed]
  • 24. Potabattula R, Prell A, Dittrich M, Nava C, Depienne C, Bejaoui Y, El Hajj N, Hahn T, Schorsch M, Haaf T. Effects of paternal and chronological age on BEGAIN methylation and its possible role in autism. Aging (Albany NY). 2023; 15:12763–79. https://doi.org/10.18632/aging.205275 [PubMed]
  • 25. Ruzzo EK, Pérez-Cano L, Jung JY, Wang LK, Kashef-Haghighi D, Hartl C, Singh C, Xu J, Hoekstra JN, Leventhal O, Leppä VM, Gandal MJ, Paskov K, et al. Inherited and De Novo Genetic Risk for Autism Impacts Shared Networks. Cell. 2019; 178:850–66.e26. https://doi.org/10.1016/j.cell.2019.07.015 [PubMed]
  • 26. Bajaj V, Markandaya M, Krishna L, Kumar A. Paternal imprinting of the SLC22A1LS gene located in the human chromosome segment 11p15.5. BMC Genet. 2004; 5:13. https://doi.org/10.1186/1471-2156-5-13 [PubMed]
  • 27. Ugur C, Sertcelik M, Uneri O, Senses Dinc G, Sekmen E, Solmaz E. Evaluation of serum urotensin-II levels of children with ADHD and autism spectrum disorder. Anatolian Journal of Psychiatry. 2018; 18:7.
  • 28. Valentino F, Bruno LP, Doddato G, Giliberti A, Tita R, Resciniti S, Fallerini C, Bruttini M, Lo Rizzo C, Mencarelli MA, Mari F, Pinto AM, Fava F, et al. Exome Sequencing in 200 Intellectual Disability/Autistic Patients: New Candidates and Atypical Presentations. Brain Sci. 2021; 11:936. https://doi.org/10.3390/brainsci11070936 [PubMed]
  • 29. Homs A, Codina-Solà M, Rodríguez-Santiago B, Villanueva CM, Monk D, Cuscó I, Pérez-Jurado LA. Genetic and epigenetic methylation defects and implication of the ERMN gene in autism spectrum disorders. Transl Psychiatry. 2016; 6:e855. https://doi.org/10.1038/tp.2016.120 [PubMed]
  • 30. Jenkins TG, Aston KI, Cairns BR, Carrell DT. Paternal aging and associated intraindividual alterations of global sperm 5-methylcytosine and 5-hydroxymethylcytosine levels. Fertil Steril. 2013; 100:945–51. https://doi.org/10.1016/j.fertnstert.2013.05.039 [PubMed]
  • 31. Murphy SK, Itchon-Ramos N, Visco Z, Huang Z, Grenier C, Schrott R, Acharya K, Boudreau MH, Price TM, Raburn DJ, Corcoran DL, Lucas JE, Mitchell JT, et al. Cannabinoid exposure and altered DNA methylation in rat and human sperm. Epigenetics. 2018; 13:1208–21. https://doi.org/10.1080/15592294.2018.1554521 [PubMed]
  • 32. Schrott R, Acharya K, Itchon-Ramos N, Hawkey AB, Pippen E, Mitchell JT, Kollins SH, Levin ED, Murphy SK. Cannabis use is associated with potentially heritable widespread changes in autism candidate gene DLGAP2 DNA methylation in sperm. Epigenetics. 2020; 15:161–73. https://doi.org/10.1080/15592294.2019.1656158 [PubMed]
  • 33. Chertkow-Deutsher Y, Cohen H, Klein E, Ben-Shachar D. DNA methylation in vulnerability to post-traumatic stress in rats: evidence for the role of the post-synaptic density protein Dlgap2. Int J Neuropsychopharmacol. 2010; 13:347–59. https://doi.org/10.1017/S146114570999071X [PubMed]
  • 34. Rasmussen AH, Rasmussen HB, Silahtaroglu A. The DLGAP family: neuronal expression, function and role in brain disorders. Mol Brain. 2017; 10:43. https://doi.org/10.1186/s13041-017-0324-9 [PubMed]
  • 35. Sánchez AM, Delgado MS, Ciurana SG, Medrano J, Poo-Llanillo ME, Ishida M, Moore GE, Platas II, Simón C, Monk D. Epigenetic Symmetry of DLGAP2: Pre-Implantation Maternal Methylation Switches to a Random Monoallelic Profile in Somatic Tissues. OBM Genetics. 2018; 2:026. https://doi.org/10.21926/obm.genet.1803026
  • 36. Court F, Tayama C, Romanelli V, Martin-Trujillo A, Iglesias-Platas I, Okamura K, Sugahara N, Simón C, Moore H, Harness JV, Keirstead H, Sanchez-Mut JV, Kaneki E, et al. Genome-wide parent-of-origin DNA methylation analysis reveals the intricacies of human imprinting and suggests a germline methylation-independent mechanism of establishment. Genome Res. 2014; 24:554–69. https://doi.org/10.1101/gr.164913.113 [PubMed]
  • 37. Carreras-Gallo N, Dwaraka VB, Jima DD, Skaar DA, Mendez TL, Planchart A, Zhou W, Jirtle RL, Smith R, Hoyo C. Creation and Validation of the First Infinium DNA Methylation Array for the Human Imprintome. bioRxiv. 2024; 2024.01.15.575646. https://doi.org/10.1101/2024.01.15.575646 [PubMed]
  • 38. Garg P, Sharp AJ. Screening for rare epigenetic variations in autism and schizophrenia. Hum Mutat. 2019; 40:952–61. https://doi.org/10.1002/humu.23740 [PubMed]
  • 39. Li J, Chen W, Li D, Gu S, Liu X, Dong Y, Jin L, Zhang C, Li S. Conservation of Imprinting and Methylation of MKRN3, MAGEL2 and NDN Genes in Cattle. Animals (Basel). 2021; 11:1985. https://doi.org/10.3390/ani11071985 [PubMed]
  • 40. Chen H, Victor AK, Klein J, Tacer KF, Tai DJ, de Esch C, Nuttle A, Temirov J, Burnett LC, Rosenbaum M, Zhang Y, Ding L, Moresco JJ, et al. Loss of MAGEL2 in Prader-Willi syndrome leads to decreased secretory granule and neuropeptide production. JCI Insight. 2020; 5:e138576. https://doi.org/10.1172/jci.insight.138576 [PubMed]
  • 41. Gigliucci V, Busnelli M, Santini F, Paolini C, Bertoni A, Schaller F, Muscatelli F, Chini B. Oxytocin receptors in the Magel2 mouse model of autism: Specific region, age, sex and oxytocin treatment effects. Front Neurosci. 2023; 17:1026939. https://doi.org/10.3389/fnins.2023.1026939 [PubMed]
  • 42. HPA. Human Protein Atlas.
  • 43. Costa CIS, da Silva Montenegro EM, Zarrei M, de Sá Moreira E, Silva IMW, de Oliveira Scliar M, Wang JYT, Zachi EC, Branco EV, da Costa SS, Lourenço NCV, Vianna-Morgante AM, Rosenberg C, et al. Copy number variations in a Brazilian cohort with autism spectrum disorders highlight the contribution of cell adhesion genes. Clin Genet. 2022; 101:134–41. https://doi.org/10.1111/cge.14072 [PubMed]
  • 44. Leung YK, Ouyang B, Niu L, Xie C, Ying J, Medvedovic M, Chen A, Weihe P, Valvi D, Grandjean P, Ho SM. Identification of sex-specific DNA methylation changes driven by specific chemicals in cord blood in a Faroese birth cohort. Epigenetics. 2018; 13:290–300. https://doi.org/10.1080/15592294.2018.1445901 [PubMed]
  • 45. Maggio AG, Shu HT, Laufer BI, Bi C, Lai Y, LaSalle JM, Hu VW. Elevated exposures to persistent endocrine disrupting compounds impact the sperm methylome in regions associated with autism spectrum disorder. Front Genet. 2022; 13:929471. https://doi.org/10.3389/fgene.2022.929471 [PubMed]
  • 46. Liu Y, Di Y, Zheng Q, Qian Z, Fan J, Ren W, Wei Z, Tian Y. Altered expression of glycan patterns and glycan-related genes in the medial prefrontal cortex of the valproic acid rat model of autism. Front Cell Neurosci. 2022; 16:1057857. https://doi.org/10.3389/fncel.2022.1057857 [PubMed]
  • 47. Sands TT, Miceli F, Lesca G, Beck AE, Sadleir LG, Arrington DK, Schönewolf-Greulich B, Moutton S, Lauritano A, Nappi P, Soldovieri MV, Scheffer IE, Mefford HC, et al. Autism and developmental disability caused by KCNQ3 gain-of-function variants. Ann Neurol. 2019; 86:181–92. https://doi.org/10.1002/ana.25522 [PubMed]
  • 48. Satterstrom FK, Kosmicki JA, Wang J, Breen MS, De Rubeis S, An JY, Peng M, Collins R, Grove J, Klei L, Stevens C, Reichert J, Mulhern MS, et al, and Autism Sequencing Consortium, and iPSYCH-Broad Consortium. Large-Scale Exome Sequencing Study Implicates Both Developmental and Functional Changes in the Neurobiology of Autism. Cell. 2020; 180:568–84.e23. https://doi.org/10.1016/j.cell.2019.12.036 [PubMed]
  • 49. Van Wauwe J, Kemps H, Vrancaert P, Mahy A, Schellingen R, Grootaert MOJ, Beerens M, Luttun A. PRDM16, a new kid on the block in cardiovascular health and disease. Cardiovasc Res. 2025; 121:1156–72. https://doi.org/10.1093/cvr/cvaf089 [PubMed]
  • 50. Baizabal JM, Mistry M, García MT, Gómez N, Olukoya O, Tran D, Johnson MB, Walsh CA, Harwell CC. The Epigenetic State of PRDM16-Regulated Enhancers in Radial Glia Controls Cortical Neuron Position. Neuron. 2018; 98:945–62.e8. https://doi.org/10.1016/j.neuron.2018.04.033 [PubMed]
  • 51. Umeda T, Takashima N, Nakagawa R, Maekawa M, Ikegami S, Yoshikawa T, Kobayashi K, Okanoya K, Inokuchi K, Osumi N. Evaluation of Pax6 mutant rat as a model for autism. PLoS One. 2010; 5:e15500. https://doi.org/10.1371/journal.pone.0015500 [PubMed]
  • 52. Yoshizaki K, Furuse T, Kimura R, Tucci V, Kaneda H, Wakana S, Osumi N. Paternal Aging Affects Behavior in Pax6 Mutant Mice: A Gene/Environment Interaction in Understanding Neurodevelopmental Disorders. PLoS One. 2016; 11:e0166665. https://doi.org/10.1371/journal.pone.0166665 [PubMed]
  • 53. Xing M, Zhang Q, Song W. GIGYF1-disturbed IGF-1R recycling: a potential contributor to autism spectrum disorder pathogenesis? J Clin Invest. 2022; 132:e163553. https://doi.org/10.1172/JCI163553 [PubMed]
  • 54. Sun W, Poschmann J, Cruz-Herrera Del Rosario R, Parikshak NN, Hajan HS, Kumar V, Ramasamy R, Belgard TG, Elanggovan B, Wong CCY, Mill J, Geschwind DH, Prabhakar S. Histone Acetylome-wide Association Study of Autism Spectrum Disorder. Cell. 2016; 167:1385–97.e11. https://doi.org/10.1016/j.cell.2016.10.031 [PubMed]
  • 55. Kubota T, Miyake K, Hirasawa T. Epigenetic understanding of gene-environment interactions in psychiatric disorders: a new concept of clinical genetics. Clin Epigenetics. 2012; 4:1. https://doi.org/10.1186/1868-7083-4-1 [PubMed]
  • 56. Plasschaert RN, Bartolomei MS. Tissue-specific regulation and function of Grb10 during growth and neuronal commitment. Proc Natl Acad Sci U S A. 2015; 112:6841–7. https://doi.org/10.1073/pnas.1411254111 [PubMed]
  • 57. Garfield AS, Cowley M, Smith FM, Moorwood K, Stewart-Cox JE, Gilroy K, Baker S, Xia J, Dalley JW, Hurst LD, Wilkinson LS, Isles AR, Ward A. Distinct physiological and behavioural functions for parental alleles of imprinted Grb10. Nature. 2011; 469:534–8. https://doi.org/10.1038/nature09651 [PubMed]
  • 58. Chiesa N, De Crescenzo A, Mishra K, Perone L, Carella M, Palumbo O, Mussa A, Sparago A, Cerrato F, Russo S, Lapi E, Cubellis MV, Kanduri C, et al. The KCNQ1OT1 imprinting control region and non-coding RNA: new properties derived from the study of Beckwith-Wiedemann syndrome and Silver-Russell syndrome cases. Hum Mol Genet. 2012; 21:10–25. https://doi.org/10.1093/hmg/ddr419 [PubMed]
  • 59. Mayo S, Monfort S, Roselló M, Oltra S, Orellana C, Martínez F. In Pursuit of New Imprinting Syndromes by Epimutation Screening in Idiopathic Neurodevelopmental Disorder Patients. Biomed Res Int. 2015; 2015:341986. https://doi.org/10.1155/2015/341986 [PubMed]
  • 60. Pauli S, Berger H, Ufartes R, Borchers A. Comparing a Novel Malformation Syndrome Caused by Pathogenic Variants in FBRSL1 to AUTS2 Syndrome. Front Cell Dev Biol. 2021; 9:779009. https://doi.org/10.3389/fcell.2021.779009 [PubMed]
  • 61. Bukvic N, De Rinaldis M, Chetta M, Trabacca A, Bassi MT, Marsano RM, Holoubkova L, Rivieccio M, Oro M, Resta N, Kerkhof J, Sadikovic B, Viggiano L. De Novo Pathogenic Variant in FBRSL1, Non OMIM Gene Paralogue AUTS2, Causes a Novel Recognizable Syndromic Manifestation with Intellectual Disability; An Additional Patient and Review of the Literature. Genes (Basel). 2024; 15:826. https://doi.org/10.3390/genes15070826 [PubMed]
  • 62. Terrinoni A, Micheloni G, Moretti V, Caporali S, Bernardini S, Minieri M, Pieri M, Giaroni C, Acquati F, Costantino L, Ferrara F, Valli R, Porta G. OTX Genes in Adult Tissues. Int J Mol Sci. 2023; 24:16962. https://doi.org/10.3390/ijms242316962 [PubMed]
  • 63. Huang B, Li X, Tu X, Zhao W, Zhu D, Feng Y, Si X, Chen JG. OTX1 regulates cell cycle progression of neural progenitors in the developing cerebral cortex. J Biol Chem. 2018; 293:2137–48. https://doi.org/10.1074/jbc.RA117.001249 [PubMed]
  • 64. Redies C, Hertel N, Hübner CA. Cadherins and neuropsychiatric disorders. Brain Res. 2012; 1470:130–44. https://doi.org/10.1016/j.brainres.2012.06.020 [PubMed]
  • 65. Luedi PP, Dietrich FS, Weidman JR, Bosko JM, Jirtle RL, Hartemink AJ. Computational and experimental identification of novel human imprinted genes. Genome Res. 2007; 17:1723–30. https://doi.org/10.1101/gr.6584707 [PubMed]
  • 66. Clair A, Baker E, Kumari M. Are housing circumstances associated with faster epigenetic ageing? J Epidemiol Community Health. 2023; 78:40–6. https://doi.org/10.1136/jech-2023-220523 [PubMed]
  • 67. Wang L, Chen M, Yan G, Zhao S. DNA Methylation Differences Between Zona Pellucida-Bound and Manually Selected Spermatozoa Are Associated With Autism Susceptibility. Front Endocrinol (Lausanne). 2021; 12:774260. https://doi.org/10.3389/fendo.2021.774260 [PubMed]
  • 68. Schubert C. Male biological clock possibly linked to autism, other disorders. Nat Med. 2008; 14:1170. https://doi.org/10.1038/nm1108-1170a [PubMed]
  • 69. Wu S, Wu F, Ding Y, Hou J, Bi J, Zhang Z. Advanced parental age and autism risk in children: a systematic review and meta-analysis. Acta Psychiatr Scand. 2017; 135:29–41. https://doi.org/10.1111/acps.12666 [PubMed]
  • 70. Levine ME, Lu AT, Quach A, Chen BH, Assimes TL, Bandinelli S, Hou L, Baccarelli AA, Stewart JD, Li Y, Whitsel EA, Wilson JG, Reiner AP, et al. An epigenetic biomarker of aging for lifespan and healthspan. Aging (Albany NY). 2018; 10:573–91. https://doi.org/10.18632/aging.101414 [PubMed]
  • 71. Loke YJ, Hannan AJ, Craig JM. The Role of Epigenetic Change in Autism Spectrum Disorders. Front Neurol. 2015; 6:107. https://doi.org/10.3389/fneur.2015.00107 [PubMed]
  • 72. Zhou J, Park CY, Theesfeld CL, Wong AK, Yuan Y, Scheckel C, Fak JJ, Funk J, Yao K, Tajima Y, Packer A, Darnell RB, Troyanskaya OG. Whole-genome deep-learning analysis identifies contribution of noncoding mutations to autism risk. Nat Genet. 2019; 51:973–80. https://doi.org/10.1038/s41588-019-0420-0 [PubMed]
  • 73. Litman A, Sauerwald N, Green Snyder L, Foss-Feig J, Park CY, Hao Y, Dinstein I, Theesfeld CL, Troyanskaya OG. Decomposition of phenotypic heterogeneity in autism reveals underlying genetic programs. Nat Genet. 2025; 57:1611–9. https://doi.org/10.1038/s41588-025-02224-z [PubMed]
  • 74. World Health Organization Laboratory Manual for the Examination and Processing of Human Semen. WHO Press: Geneva, Switzerland; 2010; 287.
  • 75. Bibikova M, Barnes B, Tsan C, Ho V, Klotzle B, Le JM, Delano D, Zhang L, Schroth GP, Gunderson KL, Fan JB, Shen R. High density DNA methylation array with single CpG site resolution. Genomics. 2011; 98:288–95. https://doi.org/10.1016/j.ygeno.2011.07.007 [PubMed]
  • 76. Morris TJ, Butcher LM, Feber A, Teschendorff AE, Chakravarthy AR, Wojdacz TK, Beck S. ChAMP: 450k Chip Analysis Methylation Pipeline. Bioinformatics. 2014; 30:428–30. https://doi.org/10.1093/bioinformatics/btt684 [PubMed]
  • 77. Teschendorff AE, Marabita F, Lechner M, Bartlett T, Tegner J, Gomez-Cabrero D, Beck S. A beta-mixture quantile normalization method for correcting probe design bias in Illumina Infinium 450 k DNA methylation data. Bioinformatics. 2013; 29:189–96. https://doi.org/10.1093/bioinformatics/bts680 [PubMed]
  • 78. Nye MD, Hoyo C, Huang Z, Vidal AC, Wang F, Overcash F, Smith JS, Vasquez B, Hernandez B, Swai B, Oneko O, Mlay P, Obure J, et al. Associations between methylation of paternally expressed gene 3 (PEG3), cervical intraepithelial neoplasia and invasive cervical cancer. PLoS One. 2013; 8:e56325. https://doi.org/10.1371/journal.pone.0056325 [PubMed]
  • 79. Murphy SK, Huang Z, Hoyo C. Differentially methylated regions of imprinted genes in prenatal, perinatal and postnatal human tissues. PLoS One. 2012; 7:e40924. https://doi.org/10.1371/journal.pone.0040924 [PubMed]
  • 80. Krausz C, Sandoval J, Sayols S, Chianese C, Giachini C, Heyn H, Esteller M. Novel insights into DNA methylation features in spermatozoa: stability and peculiarities. PLoS One. 2012; 7:e44479. https://doi.org/10.1371/journal.pone.0044479 [PubMed]
  • 81. Kruppa J, Sieg M, Richter G, Pohrt A. Estimands in epigenome-wide association studies. Clin Epigenetics. 2021; 13:98. https://doi.org/10.1186/s13148-021-01083-9 [PubMed]
  • 82. Xie C, Leung YK, Chen A, Long DX, Hoyo C, Ho SM. Differential methylation values in differential methylation analysis. Bioinformatics. 2019; 35:1094–7. https://doi.org/10.1093/bioinformatics/bty778 [PubMed]
  • 83. Chen YA, Lemire M, Choufani S, Butcher DT, Grafodatskaya D, Zanke BW, Gallinger S, Hudson TJ, Weksberg R. Discovery of cross-reactive probes and polymorphic CpGs in the Illumina Infinium HumanMethylation450 microarray. Epigenetics. 2013; 8:203–9. https://doi.org/10.4161/epi.23470 [PubMed]
  • 84. Hansen KD. IlluminaHumanMethylation450kanno.ilmn12.hg19: Annotation for Illumina's 450k methylation arrays. R package version 0.6.0. 2016.
  • 85. Feinberg JI, Bakulski KM, Jaffe AE, Tryggvadottir R, Brown SC, Goldman LR, Croen LA, Hertz-Picciotto I, Newschaffer CJ, Fallin MD, Feinberg AP. Paternal sperm DNA methylation associated with early signs of autism risk in an autism-enriched cohort. Int J Epidemiol. 2015; 44:1199–210. https://doi.org/10.1093/ije/dyv028 [PubMed]
  • 86. Bethesda (MD): National Library of Medicine (US) NCfBI. 2004.
  • 87. Aryee MJ, Jaffe AE, Corrada-Bravo H, Ladd-Acosta C, Feinberg AP, Hansen KD, Irizarry RA. Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics. 2014; 30:1363–9. https://doi.org/10.1093/bioinformatics/btu049 [PubMed]
  • 88. Pidsley R, Y Wong CC, Volta M, Lunnon K, Mill J, Schalkwyk LC. A data-driven approach to preprocessing Illumina 450K methylation array data. BMC Genomics. 2013; 14:293. https://doi.org/10.1186/1471-2164-14-293 [PubMed]
  • 89. Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, Smyth GK. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015; 43:e47. https://doi.org/10.1093/nar/gkv007 [PubMed]
  • 90. Maksimovic J, Oshlack A, Phipson B. Gene set enrichment analysis for genome-wide DNA methylation data. Genome Biol. 2021; 22:173. https://doi.org/10.1186/s13059-021-02388-x [PubMed]