ابدأ بالتواصل مع الأشخاص وتبادل معارفك المهنية

أنشئ حسابًا أو سجّل الدخول للانضمام إلى مجتمعك المهني.

متابعة

Which tool is best for data mining?

user-image
تم إضافة السؤال من قبل Syed Ilyaz , BI Manager , Syren Technology
تاريخ النشر: 2015/12/26
مستخدم محذوف‎
من قبل مستخدم محذوف‎

RapidMiner and Weka are good tools for data mining, but they have some leakages in cleansing data and preparing it before building models. R and python with scikit-learn are excellent tools for data mining. R is more popular than python, even i prefer using python with Scikit-learn. In general, if you (are/have) a good programmer, it is preferable to use R or python, if not, weka or RapidMiner is ok.  

Seyed Yahya Moradi
من قبل Seyed Yahya Moradi , biomedical engineer , ghods hospital

RapidMiner (formerly known as YALE)

 Written in the Java Programming language, this tool offers advanced analytics through template-based frameworks. A bonus: Users hardly have to write any code. Offered as a service, rather than a piece of local software, this tool holds top position on the list of data mining tools.

 In addition to data mining, RapidMiner also provides functionality like data preprocessing and visualization, predictive analytics and statistical modeling, evaluation, and deployment. What makes it even more powerful is that it provides learning schemes, models and algorithms from WEKA and R scripts.

 

 

WEKA

 The original non-Java version of WEKA primarily was developed for analyzing data from the agricultural domain. With the Java-based version, the tool is very sophisticated and used in many different applications including visualization and algorithms for data analysis and predictive modeling. Its free under the GNU General Public License, which is a big plus compared to RapidMiner, because users can customize it however they please.

 WEKA supports several standard data mining tasks, including data preprocessing, clustering, classification, regression, visualization and feature selection. WEKA would be more powerful with the addition of sequence modeling, which currently is not included.

Badr-Eddine ADNANI
من قبل Badr-Eddine ADNANI , Market Insight Manager , Sage Software

SPSS Clementine

is one of the best and most friendly usage data mining software

Evan Wyse
من قبل Evan Wyse , Scientist , MaxPoint

It depends strongly upon your needs. 

 

If you'd like to do machine learning on a local machine with small amounts of data (less than 100,000 rows), Weka is an excellent tool with an accessible interface. 

If you'd like to do something a little more advanced, R is a popular statistical package/programming language that is extremely flexible. 

Finally, a number of software packages written in Python (a full-stack programming language) are extremely popular for large-scale machine learning. It's what my company primarily uses. Specifically, we use scikit-learn, statsmodels, pandas, and numpy primarily. 

All of the tools discussed here are 'open source', meaning that they're free to use. 

Peter kamoga
من قبل Peter kamoga , Fire Fighter , KI & KA

RapidMiner (formerly known as YALE). Written in the Java Programming language, this tool offers advanced analytics through template-based frameworks. A bonus: Users hardly have to write any code. Offered as a service, rather than a piece of local software, this tool holds top position on the list of data mining tools.

SUBODH KUMAR OJHA
من قبل SUBODH KUMAR OJHA , Sr. Cloud Solution Architect - Data & AI , Microsoft

R is the best tool for Data mining

somayeh davari
من قبل somayeh davari , Inbound Marketer / Digital Marketing Specialist , Mahdi Sewing Machine store

Weka is one of the best tools for data mining. 

It's easy to use, effective and fast.

المزيد من الأسئلة المماثلة