Sipina Research Edition
Platforms Available
This software is available only for the Windows environment.
Where to get
Sipina Research Edition is available for free and can be download on the website below:
http://eric.univ-lyon2.fr/~ricco/sipina.html
There is also documentation available in both English and French.
Tree Methods
Unlike the earlier version of Sipina, this version implements other data mining methods such as neural networks. The following decision tree algorithms are available:
- A limited search induction tree: J. CATLETT, "Megainduction : machine learning on very large databases", PhD thesis, University of Sidney, 1991.
- ID3-IV: J.R. QUINLAN, "Induction of decision trees", Machine Learning, 1:81-106, 1986.
- GID3: J. CHENG, U. FAYYAD, K. IRANI, Z. QIAN, "Improved decision trees : a generalized version of ID3", in Proceedings of the 5th ICML, pp.100-108, 1988.
- ASSISTANT 86: B. CESTNIK, I. KONONENKO, I. BRATKO, "Assitant-96 : a knowledge elicitation tool for sophisticated users", in I. Bratko and N. Lavrac editors, Progress in Machine Learning, 1987.
- CHAID: G.V. KASS, "An exploratory technique for investigating large quantities of categorical data", Applied Statistics, 29(2):119-127, 1980.
- C4.5: J.R. QUINLAN, "C4.5 : Programs for Machine Learning", Morgan Kaufmann, 1993.
- Improved C4.5: R. RAKOTOMALALA, S. LALLICH, "Handling noise with generalized entropy of type beta in induction graphs algorithm", in Proceedings of International Conference on Computer Science and Informatics, pp. 25-27, 1998.
- SIPINA: D. ZIGHED, "Sipina : Méthode et logiciel", Lacassagne, 1992.
- Improved CHAID (Tschuprow goodness of split): R. RAKOTOMALALA, D. ZIGHED, "Mesures d'association dans les graphes d'induction : une approche statistique de l'arbitrage généralité-précision", in Proceedings of AIDRI'97, pp.131-134, 1997.
- Cost Sensitive Decision Tree: J-H. CHAUCHAT, R. RAKOTOMALALA, M. CARLOZ, C. PELLETIER, "Targeting Customer Groups using Gain and Cost Matrix : a Marketing Application", in Data Mining for Marketing Applications (Working Notes), PKDD'2001, pp. 1-13, September 2001.
- Cost Sensitive C4.5: J-H. CHAUCHAT, R. RAKOTOMALALA, M. CARLOZ, C. PELLETIER, "Targeting Customer Groups using Gain and Cost Matrix : a Marketing Application", in Data Mining for Marketing Applications (Working Notes), PKDD'2001, pp. 1-13, September 2001.
Notes
This version of Sipina is easy to use, and its data management functions are quite convenient. It was not chosen as it is no longer being updated or maintained and the last version was developed in 2000. It is being replaced by a new software package called Tanagra.