RapidMiner System

Jump to navigation Jump to search

A RapidMiner System is a Java-based open source system data mining and text mining system.



  • (Wikipedia, 2013) ⇒ http://en.wikipedia.org/wiki/RapidMiner
    • RapidMiner is a software platform developed by the company of the same name that provides an integrated environment for machine learning, data mining, text mining, predictive analytics and business analytics. It is used for business and industrial applications as well as for research, education, training, rapid prototyping, and application development and supports all steps of the data mining process including results visualization, validation and optimization.[1] RapidMiner is developed on a business source model which means the core and earlier versions of the software are available under an OSI-certified open source license on Sourceforge.[2] A Starter Edition is available for free download, a Personal Edition is offered for US$999, a Professional Edition is $2,999 and pricing for the Enterprise Edition is available from the developer.[3]

  • http://en.wikipedia.org/wiki/RapidMiner#Description
    • RapidMiner uses a client/server model with the server offered as Software as a Service or on cloud infrastructures.[1]

      According to Bloor Research, RapidMiner provides 99% of an advanced analytical solution through template-based frameworks that speed delivery and reduce errors by nearly eliminating the need to write code. RapidMiner provides data mining and machine learning procedures including: data loading and transformation (Extract, transform, load (ETL)), data preprocessing and visualization, predictive analytics and statistical modeling, evaluation, and deployment. RapidMiner is written in the Java programming language. RapidMiner provides a GUI to design and execute analytical workflows. Those workflows are called “Process” in RapidMiner and they consist of multiple “Operators”. Each operator is performing a single task within the process and the output of each operator forms the input of the next one. Alternatively, the engine can be called from other programs or used as an API. Individual functions can be called from the command line. RapidMiner provides learning schemes and models and algorithms from Weka and R scripts that can be used through extensions.[2]

      RapidMiner functionality can be extended with additional plugins. The Rapid Miner Extensions marketplace provides a platform for developers to create data analysis algorithms and publish them to the community.[3] RapidMiner is distributed under the AGPL open source license and has been hosted by SourceForge where it is rated the #1 business analytics software. Commercial licenses and hosting are offered by RapidMiner.[4]

      With version 6.0, RapidMiner started to offer new application wizards addressed to business analysts needs for predictive analytics.[5]

  1. David Norris, “RapidMiner - a potential game changer,” IT-Director.com, November 22, 2013.
  2. David Norris, “RapidMiner - a potential game changer,” Bloor Research, November 13, 2013.
  3. Ajay Ohri, “Interview with Rapid-I Ingo Mierswa and Simon Fischer,” KDnuggets, August 2011.
  4. RapidMiner,” Sourceforget.net.
  5. RapidMiner 6 Review, Butler Analytics, November 22, 2013.


  • (Mierswa et al., 2006) ⇒ I. Mierswa, M. Wurst, Ralf Klinkenberg, M. Scholz, and T. Euler. (2006). “YALE: Rapid prototyping for complex data mining tasks.” In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2006).