TY - JOUR
T1 - PyAerial
T2 - Scalable association rule mining from tabular data
AU - Karabulut, Erkan
AU - Groth, Paul
AU - Degeler, Victoria
N1 - Publisher Copyright:
© 2025 The Authors
PY - 2025/9
Y1 - 2025/9
N2 - Association Rule Mining (ARM) is a knowledge discovery technique that identifies frequent patterns as logical implications within transaction datasets and has been applied across domains such as e-commerce, healthcare, and cyber–physical systems. However, many state-of-the-art ARM methods, typically algorithmic or nature-inspired, suffer from rule explosion and long execution times. Aerial is a novel neurosymbolic ARM algorithm for tabular datasets that mitigates rule explosion using neural networks, while remaining compatible with existing approaches. Aerial transforms tables into transactions, uses an autoencoder to learn compact neural representations, and extracts logical rules from the neural representations. This paper presents PyAerial, a Python library that makes Aerial accessible and easy to use on generic tabular datasets for end users in a domain-independent way. Besides association rules, PyAerial can also be used to extract frequent itemsets, learn classification rules, apply item constraints to learn rules over the features of interest rather than all features, pre-discretize numerical data for ARM, and can be run on a GPU.
AB - Association Rule Mining (ARM) is a knowledge discovery technique that identifies frequent patterns as logical implications within transaction datasets and has been applied across domains such as e-commerce, healthcare, and cyber–physical systems. However, many state-of-the-art ARM methods, typically algorithmic or nature-inspired, suffer from rule explosion and long execution times. Aerial is a novel neurosymbolic ARM algorithm for tabular datasets that mitigates rule explosion using neural networks, while remaining compatible with existing approaches. Aerial transforms tables into transactions, uses an autoencoder to learn compact neural representations, and extracts logical rules from the neural representations. This paper presents PyAerial, a Python library that makes Aerial accessible and easy to use on generic tabular datasets for end users in a domain-independent way. Besides association rules, PyAerial can also be used to extract frequent itemsets, learn classification rules, apply item constraints to learn rules over the features of interest rather than all features, pre-discretize numerical data for ARM, and can be run on a GPU.
KW - Association rule mining
KW - Interpretable machine learning
KW - Neurosymbolic ai
KW - Tabular data
UR - https://www.scopus.com/pages/publications/105015836837
U2 - 10.1016/j.softx.2025.102341
DO - 10.1016/j.softx.2025.102341
M3 - Artículo
AN - SCOPUS:105015836837
SN - 2352-7110
VL - 31
JO - SoftwareX
JF - SoftwareX
M1 - 102341
ER -