Skip to content
You are not logged in |Login  
     
Limit search to available items
Record:   Prev Next
Resources
More Information
Bestseller
BestsellerE-book
Author Chisholm, Andrew, 1959- author.

Title Exploring data with RapidMiner : explore, understand, and prepare real data using rapidminer's practical tips and tricks / Andrew Chisholm.

Publication Info. Birmingham, UK : Packt Publishing, 2013.

Item Status

Description 1 online resource (iv, 148 pages) : illustrations.
Physical Medium polychrome
Description text file
Series Community experience distilled
Community experience distilled.
Note Includes index.
Summary A step-by-step tutorial style using examples so that users of different levels will benefit from the facilities offered by RapidMiner. If you are a computer scientist or an engineer who has real data from which you want to extract value, this book is ideal for you. You will need to have at least a basic awareness of data mining techniques and some exposure to RapidMiner.
Contents Cover; Copyright; Credits; About the Author; About the Reviewer; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Setting the Scene; A process framework; Data volume and velocity; Datavariety, formats, and meanings; Missing data; Cleaning data; Visualizing data; Resource constraints; Terminology; Accompanying material; Summary; Chapter 2: Loading Data; Reading files; Alternative delimiters; Reading complete lines; Reading large numbers of attributes; Splitting files into smaller pieces; Databases; The Read Database operator; Large datasets; Using macros; Summary.
Chapter 3: Visualizing DataGetting started; Statistical summaries; Relationships between attributes; Scatter plots; Scatter 3D color; Parallel and deviation; Quartile color; Time series data; Plotting series; Using the survey plotter; Relations between examples; Using histograms; Using block plots; Summary; Chapter 4: Parsing and Converting Attributes; Generating attributes; Date functions; Regular expression functions; Generating extracts; Regular expressions; XPath; Renaming attributes; Searching and replacing attribute values; Using the Map operator; Using the Replace operator.
Using Replace (Dictionary)Summary; Chapter 5: Outliers; Manual inspection; Increasing the data volume; Rules for handling outliers; Automated detection of example outliers; Detect Outlier (Distances); Detect Outlier (Densities); Detect Outlier (LOF); Detect Outliers (COF); Summary; Chapter 6: Missing Values; Missing or empty?; Types of missing data; Missing completely at random; Missing at random; Not missing at random; Categorizing missing data; Finding MCAR data; Finding MAR data; Finding NMAR data; A cautionary note; Effect of missing data; Options for handling missing data.
Returning to the root causeIgnore it; Manual editing; Deletion of examples; Deletion of attributes; Imputation with single values; Modeling; Summary; Chapter 7: Transforming Data; Creating new attributes; Aggregation; Using pivoting; Using de-pivoting; Summary; Chapter 8: Reducing Data Size; Removing examples using sampling; Removing attributes; Removing useless attributes; Weighting attributes; Selecting attributes using models; Summary; Chapter 9: Resource Constraints; Measuring and estimating performance; Measuring performance; Adding memory; Parallel processing; Restructuring processes.
Local Note eBooks on EBSCOhost EBSCO eBook Subscription Academic Collection - North America
Subject RapidMiner (Electronic resource)
RapidMiner (Electronic resource)
Data mining.
Data mining.
Genre/Form Electronic books.
Other Form: Print version: Chisholm, Andrew, 1959- author. Exploring data with RapidMiner 9781782169338 (OCoLC)868306800
ISBN 9781782169345 (electronic book)
1782169342 (electronic book)
9781782169338
1782169334