Back to Home Back to Research

A spatially-aware unsupervised pipeline to identify co-methylation regions in DNA methylation data

Siddhant Meshram, Arindam Fadikar, Ganesan Arunkumar, and Suvo Chatterjee

bioRxiv preprint (posted November 14, 2025)

bioRxiv, New Results, 2025

SACOMA framework for co-methylation region discovery
Figure: Spatially-aware clustering concept used by SACOMA to identify co-methylated regions.

Abstract

DNA methylation data are high-dimensional and spatially structured, which creates substantial multiple-testing burden and can reduce reproducibility in association analyses.

This work introduces SACOMA (Spatially-Aware Clustering for Co-Methylation Analysis), a flexible and unsupervised pipeline that identifies co-methylated regions by combining genomic proximity and methylation similarity through spatially constrained hierarchical clustering.

A tunable, data-adaptive mixing parameter allows SACOMA to avoid rigid threshold assumptions while remaining robust across settings. Although developed for DNAm array studies, the method is generalizable to other spatially dependent data domains.

In simulations and population-level DNAm analyses, SACOMA showed strong sensitivity with effective false-positive control and identified biologically meaningful co-regulated regions, improving both specificity and discovery.

Citation

BibTeX
@article{meshram2025sacoma,
  title={A spatially-aware unsupervised pipeline to identify co-methylation regions in DNA methylation data},
  author={Meshram, Siddhant and Fadikar, Arindam and Arunkumar, Ganesan and Chatterjee, Suvo},
  journal={bioRxiv},
  year={2025},
  doi={10.1101/2025.11.13.688356},
  note={Preprint}
}

Related Publications