Discovering Association Rules in High-Dimensional Small Tabular Data

Open Access
Authors
Publication date 2025
Host editors
  • A. Agiollo
  • E. Bardhi
  • G. Ciatto
  • S. Dumancic
  • G. Marra
Book title Proceedings of the 1st International Workshop on Advanced Neuro-Symbolic Applications
Book subtitle co-located with the 28th European Conference on Artificial Intelligence (ECAI 2025) : Bologna, Italy, October 26, 2025
Series CEUR Workshop Proceedings
Event 1st International Workshop on Advanced Neuro-Symbolic Applications
Pages (from-to) 105-113
Number of pages 9
Publisher Aachen: CEUR-WS
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Association Rule Mining (ARM) aims to discover patterns between features in datasets in the form of propositional rules, supporting both knowledge discovery and interpretable machine learning in high-stakes decision-making. However, in high-dimensional settings, rule explosion and computational overhead render popular algorithmic approaches impractical without effective search space reduction—challenges that propagate to downstream tasks. Neurosymbolic methods, such as Aerial+, have recently been proposed to address the rule explosion in ARM. While they tackle the high-dimensionality of the data, they also inherit limitations of neural networks, particularly reduced performance in low-data regimes. This paper makes three key contributions to association rule discovery in high-dimensional tabular data. First, we empirically show that Aerial+ scales one to two orders of magnitude better than state-of-the-art algorithmic and neurosymbolic baselines across five real-world datasets. Second, we introduce the novel problem of ARM in high-dimensional, low data settings, such as gene expression data from the biomedicine domain with ~18K features and ~50 samples. Third, we propose two fine-tuning approaches to Aerial+ using tabular foundation models. Our proposed approaches are shown to significantly improve rule quality on five real-world datasets, demonstrating their effectiveness in low-data, high-dimensional scenarios.
Document type Conference contribution
Language English
Published at https://ceur-ws.org/Vol-4125/paper_26.pdf
Other links https://ceur-ws.org/Vol-4125
Downloads
paper_26 (Final published version)
Permalink to this page
Back