A Scalable Python-Based Architecture for Causal Structure Learning in Non-Gaussian Linear Systems Using the PyCD-LiNGAM Framework

Dr. Sara Mohammadi

Open Access

A Scalable Python-Based Architecture for Causal Structure Learning in Non-Gaussian Linear Systems Using the PyCD-LiNGAM Framework

PDF

Dr. Sara Mohammadi ¹ ,

⁴ Faculty of Computer Science, Sharif University of Technology, Tehran, Iran

Abstract

Causal structure learning in high-dimensional systems remains a fundamental challenge in modern machine learning and statistical inference, particularly when underlying data-generating processes deviate from Gaussian assumptions. Linear Non-Gaussian Acyclic Models (LiNGAM) provide a principled framework for identifying causal directions using non-Gaussianity as an identification condition. However, scalability, computational efficiency, and reproducibility issues continue to limit their practical adoption in large-scale data environments. This study proposes a scalable Python-based architecture implemented through the PyCD-LiNGAM framework to address these limitations by integrating modular computation, optimized matrix operations, and automated causal graph discovery pipelines.

The proposed framework builds upon prior theoretical advancements in causal discovery and graphical modeling, particularly greedy structure learning strategies (Chickering, 2002), probabilistic graphical modeling principles (Drton & Maathuis, 2017), and linear non-Gaussian causal identification theory (Entner & Hoyer, 2011). Furthermore, it incorporates semiparametric inference perspectives for handling latent confounding structures (Bhattacharya et al., 2020). The system is evaluated conceptually for scalability, robustness to noise, and interpretability in non-Gaussian environments.

Results indicate that modular Python-based causal pipelines significantly enhance computational tractability while preserving theoretical identifiability guarantees under non-Gaussian assumptions. The study contributes a unified computational architecture bridging theoretical causal discovery models with practical implementation constraints, enabling reproducible and scalable causal inference workflows.

Keywords

Causal discovery, LiNGAM, non-Gaussian models, Python framework

References

Bhattacharya, R., Nabi, R., & Shpitser, I. (2020). Semiparametric inference for causal effects in graphical models with hidden variables. arXiv preprint arXiv:2003.12659.

Campomanes, P., Neri, M., Horta, B. A. C., Roehrig, U. F., Vanni, S., Tavernelli, I., & Rothlisberger, U. (2014). Origin of the spectral shifts among the early intermediates of the rhodopsin photocycle. Journal of the American Chemical Society, 136(10), 3842-3851.

Chickering, D. M. (2002). Optimal structure identification with greedy search. Journal of Machine Learning Research, 3, 507-554.

Drton, M., & Maathuis, M. H. (2017). Structure learning in graphical modeling. Annual Review of Statistics and Its Application, 4, 365-393.

Entner, D., & Hoyer, P. O. (2011). Discovering unconfounded causal relationships using linear non-Gaussian models. In New Frontiers in Artificial.

Similar Articles

Liang Wu, Anita Sari, PYCD-LINGAM: A PYTHON FRAMEWORK FOR CAUSAL INFERENCE WITH NON-GAUSSIAN LINEAR MODELS , International Journal of Intelligent Data and Machine Learning: Vol. 2 No. 07 (2025): Volume 02 Issue 07
Prof. Elena M. Petrova, A Python Framework for Causal Discovery in Non-Gaussian Linear Models: The PyCD-LiNGAM Library , International Journal of Intelligent Data and Machine Learning: Vol. 2 No. 08 (2025): Volume 02 Issue 08
Dr. Elias R. Hoffmann, Predictive Behavioral Cybersecurity for Smart Healthcare and Mobile Ecosystems: An Ensemble Machine Learning Framework for Dynamic Malware Intelligence , International Journal of Intelligent Data and Machine Learning: Vol. 3 No. 01 (2026): Volume 03 Issue 01
Dr. Arman V. Solberg, Prof. Elina K. Marovic, Machine Learning Approaches for Detecting Interventions and Conditions to Elevate Power Utilization in Established Facilities , International Journal of Intelligent Data and Machine Learning: Vol. 3 No. 04 (2026): Volume 03 Issue 04
Dr. Lucas Vermeulen, Sophie De Smet, Dr. Thomas Dubois, Integrated Temporal Analytics and AI-Based Approaches for Predicting Culinary Ingredient Consumption Patterns: Evidence from Thai Markets , International Journal of Intelligent Data and Machine Learning: Vol. 3 No. 04 (2026): Volume 03 Issue 04
Dr. Alexei V. Morozov, Dr. Elena S. Petrova, Identification of Harmful Programs Using a Fusion of Deep Feature Extraction Networks and Context-Aware Sequential Modeling Techniques , International Journal of Intelligent Data and Machine Learning: Vol. 3 No. 04 (2026): Volume 03 Issue 04
Dr. James William Carter, Dr. Emily Rose Thompson, A Hybrid Quantum–Classical Deep Learning Approach for Image Recognition: Performance Analysis of Quanvolution-Based Convolutional Models , International Journal of Intelligent Data and Machine Learning: Vol. 3 No. 06 (2026): Volume 03 Issue 06
Elias J. Vance, Clara M. Soto, High-Frequency Data Driven Network Learning for Systemic Risk Analysis in Financial Markets , International Journal of Intelligent Data and Machine Learning: Vol. 2 No. 09 (2025): Volume 02 Issue 09
Prof. Jiao L. Shen, Kwa Kai Ming, A Hybrid Sentiment-Aware Machine Learning Framework for Real-Time Dynamic Pricing in E-Commerce. , International Journal of Intelligent Data and Machine Learning: Vol. 2 No. 11 (2025): Volume 02 Issue 11
Dr. Javier M. Ortega, Dr. Lucia Fernández-Ríos, Predictive Modeling of Online Retail Revenue Using Data Exploration and Intelligent Algorithms , International Journal of Intelligent Data and Machine Learning: Vol. 3 No. 04 (2026): Volume 03 Issue 04

1-10 of 41 Next

You may also start an advanced similarity search for this article.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors retain the copyright of their manuscripts, and all Open Access articles are disseminated under the terms of the Creative Commons Attribution License 4.0 (CC-BY), which licenses unrestricted use, distribution, and reproduction in any medium, provided that the original work is appropriately cited. The use of general descriptive names, trade names, trademarks, and so forth in this publication, even if not specifically identified, does not imply that these names are not protected by the relevant laws and regulations.

International Journal of Intelligent Data and Machine Learning

A Scalable Python-Based Architecture for Causal Structure Learning in Non-Gaussian Linear Systems Using the PyCD-LiNGAM Framework

Abstract

Keywords

References

Similar Articles