2000 DataMiningPracticalMLToolsWithJava: Difference between revisions

From GM-RKB
Jump to navigation Jump to search
m (Text replace - "==Notes ==" to "==Notes==")
 
m (Text replace - "ms]] " to "m]]s ")
Line 57: Line 57:
Ordinal attributes are generally called ''numeric'', or perhaps ''continuous'', but without the implication of mathematical continuity. A special case of the nominal scale is the ''dichotomy'', which has only two members - often designates are ''true'' and ''false'', or ''yes'' and ''no'' in the weather data. Such attributes are sometimes called ''boolean''.
Ordinal attributes are generally called ''numeric'', or perhaps ''continuous'', but without the implication of mathematical continuity. A special case of the nominal scale is the ''dichotomy'', which has only two members - often designates are ''true'' and ''false'', or ''yes'' and ''no'' in the weather data. Such attributes are sometimes called ''boolean''.


[[Machine Learning System|Machine learning systems]] can use a wide variety of other information about attributes. For instance, dimensional considerations could be used to restrict the search to expressions or comparisons that are dimensionally correct. Circular ordering could affect the kinds of tests that are considered. For example, in a temporal context, tests on a <code>day</code> attribute could involve <code>next day, previous day, next week, same day next week</code>. Partial orderings, that, generalize/specialization relations, frequently occur in practical situations. Information this kind is often referred to as ''metadata'', data about data. However, the kind of practical schemes currently used for data mining are rarely capable of taking metadata into account, although it is likely that these capabilities will develop rapidly in the future.
[[Machine Learning System|Machine learning system]]s can use a wide variety of other information about attributes. For instance, dimensional considerations could be used to restrict the search to expressions or comparisons that are dimensionally correct. Circular ordering could affect the kinds of tests that are considered. For example, in a temporal context, tests on a <code>day</code> attribute could involve <code>next day, previous day, next week, same day next week</code>. Partial orderings, that, generalize/specialization relations, frequently occur in practical situations. Information this kind is often referred to as ''metadata'', data about data. However, the kind of practical schemes currently used for data mining are rarely capable of taking metadata into account, although it is likely that these capabilities will develop rapidly in the future.


====Missing values====
====Missing values====

Revision as of 01:31, 21 March 2014

Subject Headings: Data Mining Text Book.

Notes

Quotes

Book Overview

This book offers a thorough grounding in machine learning concepts as well as practical advice on applying machine learning tools and techniques in real-world data mining situations. Inside, you'll learn all you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining including both tried-and-true techniques of the past and Java-based methods at the leading edge of contemporary research. If you're involved at any level in the work of extracting usable knowledge from large collections of data, this clearly written and effectively illustrated book will prove an invaluable resource. Complementing the authors' instruction is a fully functional platform-independent Java software system for machine learning, available for download. Apply it to the sample data sets provided to refine your data mining skills, apply it to your own data to discern meaningful patterns and generate valuable insights, adapt it for your specialized data mining applications, or use it to develop your own machine learning schemes.,


 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2000 DataMiningPracticalMLToolsWithJavaIan H. Witten
Eibe Frank
Data Mining: Practical Machine Learning Tools and Techniques with Java implementationshttp://books.google.com/books/elsevier?id=6lVEKlrTq8EC2000