挖掘社交網(wǎng)絡(luò)

出版時(shí)間：2011-5 出版社：東南大學(xué)出版社作者：Matthew A. Russell 頁(yè)數(shù)：332
Tag標(biāo)簽：無(wú)

內(nèi)容概要

Facebook、Twitter和Linkedln產(chǎn)生了大量的寶貴的社交數(shù)據(jù)，但是你怎樣才能找出誰(shuí)通過(guò)社交媒介進(jìn)行聯(lián)系？他們?cè)谟懻撔┦裁?？或者他們?cè)谀膬?？《挖掘社交網(wǎng)絡(luò)(影印版)》這本簡(jiǎn)潔而且具有操作性的書(shū)將為你展示如何回答這些甚至更多的問(wèn)題。你將學(xué)到如何組合社交網(wǎng)絡(luò)數(shù)據(jù)、分析技術(shù)，如何通過(guò)可視化幫助你找到你一直在社交世界中尋找的內(nèi)容，以及那些你都不知道存在的有用信息。
每個(gè)獨(dú)立章節(jié)介紹了在社交網(wǎng)絡(luò)的不同領(lǐng)域挖掘數(shù)據(jù)的技術(shù)，這些領(lǐng)域包括博客和電子郵件。你所需要具備的就是一定的編程經(jīng)驗(yàn)和學(xué)習(xí)基本的python工具的意愿。

作者簡(jiǎn)介

Matthew A.Russell，Digital Reasoning
Systems的工程副總裁和Zaffra的負(fù)責(zé)人，是熱愛(ài)數(shù)據(jù)挖掘、開(kāi)源和網(wǎng)絡(luò)應(yīng)用技術(shù)的計(jì)算機(jī)科學(xué)家。他是《Dojo：The
Definitive Guide》(O'Reilly出版)的作者。

書(shū)籍目錄

Preface
1. Introduction: Hacking on Twitter Data
  Installing Python Development Tools
  Collecting and Manipulating Twitter Data
  Tinkering with Twitter's API
  Frequency Analysis and Lexical Diversity
  Visualizing Tweet Graphs
  Synthesis: Visualizing Retweets with Protovis
  Closing Remarks
2. Microformats: Semantic Markup and Common Sense Collide
  XFN and Friends
  Exploring Social Connections with XFN
  A Breadth-First Crawl of XFN Data
  Geocoordinates: A Common Thread for Just About Anything
  Wikipedia Articles + Google Maps = Road Trip?
  Slicing and Dicing Recipes (for the Health of It)
  Collecting Restaurant Reviews
  Summary
3. Mailboxes: Oldies but Goodies
  mbox: The Quick and Dirty on Unix Mailboxes
  mbox + CouchDB = Relaxed Email Analysis
  Bulk Loading Documents into CouchDB
  Sensible Sorting
  Map/Reduce-Inspired Frequency Analysis
  Sorting Documents by Value
  cotichdb-lucene: Full-Text Indexing and More
  Threading Together Conversations
  Look Who's Talking
  Visualizing Mail "Events" with SIMILE Timeline
  Analyzing Your Own Mail Data
  The Graph Your (Gmail) Inbox Chrome Extension
  Closing Remarks
4. Twitter: Friends, Followers, and Setwise Operations
  RESTful and OAuth-Cladded APIs
  No, You Can't Have My Password
  A Lean, Mean Data-Collecting Machine
  A Very Brief Refactor Interlude
  Redis: A Data Structures Server
  Elementary Set Operations
  Souping Up the Machine with Basic Friend/Follower Metrics
  Calculating Similarity by Computing Common Friends and Followers
  Measuring Influence
  Constructing Friendship Graphs
  Clique Detection and Analysis
  The Infochimps "Strong Links" API
  Interactive 3D.Graph Visualization
  Summary
5. Twitter: The Tweet, the Whole Tweet, and Nothing but the Tweet
  Pen : Sword :: Tweet : Machine Gun (?!?)
  Analyzing Tweets (One Entity at a Time)
  Tapping (Tim's) Tweets
  Who Does Tim Retweet Most Often?
  What's Tim's Influence?
  How Many of Tim's Tweets Contain Hashtags?
  Juxtaposing Latent Social Networks (or #JustinBieber Versus
#TeaParty)
  What Entities Co-Occur Most Often with #JustinBieber and
#TeaParty
  Tweets?
  On Average, Do #JustinBieber or #TeaParty Tweets Have More
  Hashtags?
  Which Gets Retweeted More Often: #JustinBieber or #TeaParty?
  How Much Overlap Exists Between the Entities of #TeaParty and
  #JustinBieber Tweets?
  Visualizing Tons of Tweets
  Visualizing Tweets with Tricked-Out Tag Clouds
  Visualizing Community Structures in Twitter Search Results
  Closing Remarks
6. Linkedln: Clustering Your Professional Network for Fun (and
Profit?)
  Motivation for Clustering
  Clustering Contacts by Job Title
  Standardizing and Counting Job Titles
  Common Similarity Metrics for Clustering
  A Greedy Approach to Clustering
  Hierarchical and k-Means Clustering
  Fetching Extended Profile Information
  Geographically Clustering Your Network
  Mapping Your Professional Network with Google Earth
  Mapping Your Professional Network with Dorling Cartograms
  Closing Remarks
7. Google Buzz: TF-IDF, Cosine Similarity, and Collocations
  Buzz = Twitter + Blogs (???)
  Data Hacking with NLTK
  Text Mining Fundamentals
  A Whiz-Bang Introduction tO TF-IDF
  Querying Buzz Data with TF-IDF
  Finding Similar Documents
  The Theory Behind Vector Space Models and Cosine Similarity
  Clustering Posts with Cosine Similarity
  Visualizing Similarity with Graph Visualizations
  Buzzing on Bigrams
  How the Collocation Sausage Is Made: Contingency Tables and
Scoring
  Functions
  Tapping into Your Gmail
  Accessing Gmail with OAuth
  Fetching and Parsing Email Messages
  Before You Go Off and Try to Build a Search Engine...
  Closing Remarks
8. Blogs et al.: Natural Language Processing (and Beyond)
  NLP: A Pareto-Like Introduction
  Syntax and Semantics
  A Brief Thought Exercise
  A Typical NLP Pipeline with NLTK
  Sentence Detection in Blogs with NLTK
  Summarizing Documents
  Analysis of Luhn's Summarization Algorithm
  Entity-Centric Analysis: A Deeper Understanding of the Data
  Quality of Analytics
  Closing Remarks
9. Facebook:TheAll-in-OneWonder
  Tapping into Your Social Network Data
  From Zero to Access Token in Under 10 Minutes
  Facebook's Query APIs
  Visualizing Facebook Data
  Visualizing Your Entire Social Network
  Visualizing Mutual Friendships Within Groups
  Where Have My Friends All Gone? (A Data-Driven Game)
  Visualizing Wall Data As a (Rotating) Tag Cloud
  Closing Remarks
10. The Semantic Web: A Cocktail Discussion
  An Evolutionary Revolution?
  Man Cannot Live on Facts Alone
  Open-World Versus Closed-World Assumptions
  Inferencing About an Open World with FuXi
  Hope
Index

圖書(shū)封面

圖書(shū)標(biāo)簽Tags

無(wú)

評(píng)論、評(píng)分、閱讀與下載

還沒(méi)讀過(guò)(83)
勉強(qiáng)可看(607)
一般般(103)
內(nèi)容豐富(4297)
強(qiáng)力推薦(352)

挖掘社交網(wǎng)絡(luò) PDF格式下載

用戶(hù)評(píng)論 (總計(jì)8條)

書(shū)的內(nèi)容不錯(cuò)，但是需要有一定數(shù)據(jù)挖掘或者機(jī)器學(xué)習(xí)的基礎(chǔ)，不然挺吃力的，最好對(duì)python不陌生，如果是完全不明白可能需要花功夫了，另外在跑實(shí)驗(yàn)例子的時(shí)候由于twitter的api地址改了，需要google一下
不經(jīng)過(guò)翻譯的書(shū)，經(jīng)典
希望可以用作實(shí)踐指導(dǎo)！
好書(shū)，強(qiáng)烈推薦??！
這本書(shū)對(duì)數(shù)據(jù)挖掘尤其是圖的挖掘很有用
有點(diǎn)偏技術(shù)，不過(guò)也能看到一些Ideas
書(shū)的質(zhì)量，電商兩方面都不錯(cuò)。
發(fā)現(xiàn)其實(shí)沒(méi)有那么多的價(jià)值。并且這本書(shū)好貴好貴。。。。
記得一個(gè)老師說(shuō)過(guò)，如果一本書(shū)里有太多的源碼的話(huà)，這本書(shū)就沒(méi)有多少價(jià)值，這本書(shū)有好多源碼。

挖掘社交網(wǎng)絡(luò)

用戶(hù)評(píng)論 (總計(jì)8條)

推薦圖書(shū)

相關(guān)圖書(shū)