出版時(shí)間:2003-9 出版社:機(jī)械工業(yè)出版社 作者:lan H.Witten,Eibe Frank 頁數(shù):369
Tag標(biāo)簽:無
內(nèi)容概要
這是一本將數(shù)據(jù)挖掘算法和數(shù)據(jù)挖掘?qū)嵺`完美結(jié)合起來的優(yōu)秀教材。作者以其豐富的經(jīng)驗(yàn),對(duì)數(shù)據(jù)挖掘的概念和數(shù)據(jù)挖掘所有的技術(shù)(特別是機(jī)器學(xué)習(xí))進(jìn)行了深入淺出的介紹,并對(duì)應(yīng)用機(jī)器學(xué)習(xí)工具進(jìn)行數(shù)據(jù)挖掘給出了良好的建議。數(shù)據(jù)挖掘中的各個(gè)關(guān)鍵要素也事例融合在眾多實(shí)例中加以介紹。
本書還介紹了Weka這種基于Java的軟件系統(tǒng)。該軟件系統(tǒng)可以用來分析數(shù)據(jù)集,找到適用的模式,進(jìn)行正確的分析,也可以用來開發(fā)自己的機(jī)器學(xué)方案。
本書的主要特點(diǎn):
解釋數(shù)據(jù)挖掘算法的原理。
通過實(shí)例幫助讀者根據(jù)實(shí)際情況選擇合適的算法,并比較和評(píng)估不同方法得出的結(jié)果。
介紹提高性能的技術(shù),包括數(shù)據(jù)處理以及組合不同方法得到的輸出。
提供了本書所有的Weka軟件和附加學(xué)習(xí)材料,可以從http://www.mkp.com/datamining上下載這些資料。
作者簡介
Lan H.Witten,新西蘭懷卡托大學(xué)計(jì)算機(jī)科學(xué)系教授。他是ACM和新西蘭皇家學(xué)會(huì)的成員,并參加了英國、美國、加拿大和新西蘭的專業(yè)計(jì)算、信息檢索、工程等協(xié)會(huì)。他著有多部著作,是多家技術(shù)雜志的作者,發(fā)表過大量論文。
書籍目錄
ForewordPreface1 What's it all about? 1.1 Data mining and machine learning 1.2 Simple examples:The weather problem and others 1.3 Fielded application 1.4 Machine learning and statistics 1.5 Generalization as search 1.6 Data mining and ethics 1.7 Further reading2 Input:Concepts,instances,attributes 2.1 What's a concept? 2.2 What's in an example? 2.3 What's in an attribute? 2.4 Preparing the input 2.5 Further reading3 Output:Knowledge representation 3.1 Decision tables 3.2 Decision trees 3.3 Classification rules 3.4 Association rules 3.5 Rules with exceptions 3.6 Rules involving relations 3.7 Trees for numeric prediction 3.8 Instance-based representation 3.9 Clusters 3.10 Further reading 4 Algorithms:The basic methods 4.1 Infereing rudimentary rules 4.2 Statistical modeling 4.3 Divide and conuquer:Constructing decision trees 4.4 Covering algorithms:Construsting rules 4.5 Mining association rules 4.6 Linear models 4.7 Instance-based learning 4.8 Further reading5 Credibility:Evaluation what's been learnde 5.1 Training and testing 5.2 predicting per formance 5.3 Cross-vaidation 5.4 Other estimates 5.5 Comparing data mining schems 5.6 Predicting Probabilities 5.7 Counting the cost 5.8 Evaluating numer ic prediction 5.9 The minimum description length principle 5.10 Applying MDL to clustering 5.11 Further reading6 Implemententation:Real machine learning schemes 6.1 Decision tress 6.2 Classification rules 6.3 Extending linear classification:Support vector machines 6.4 Instance-based learning 6.5 Numeric prediction 6.6 Clustering7 Moving on:Engineering the input and output 7.1 Attribute selection 7.2 Discretizing numeric attributes 7.3 Automtic data cleansing 7.4 Combining multiple models 7.5 Further reading8 Nuts and bolts:Machine learning algorithms in Java 8.1 Getting started 8.2 Javadoc and the class library 8.3 Processing dataset using the machine learning programs 8.4 Embedded machine learning 8.5 Writing new learning schemes9 Looking forward 9.1 learning from massive datasets 9.2 Visualizing machine learning 9.3 Incorporation domain knowlgdge 9.4 Text mining 9.5 Mining the World Wide Web 9.6 Further readingReferencesIndexAbout the authors
媒體關(guān)注與評(píng)論
書評(píng)本書是綜合運(yùn)用數(shù)據(jù)挖掘、數(shù)據(jù)分析、信息理論通訊機(jī)器學(xué)習(xí)技術(shù)的里程碑。
編輯推薦
其它版本請(qǐng)見:《經(jīng)典原版書庫·數(shù)據(jù)挖掘:實(shí)用機(jī)器學(xué)習(xí)技術(shù)(英文版)(第2版)(新版)》
圖書封面
圖書標(biāo)簽Tags
無
評(píng)論、評(píng)分、閱讀與下載