万本电子书0元读

万本电子书0元读

顶部广告

Python Social Media Analytics电子书

售       价:¥

1人正在读 | 0人评论 9.8

作       者:Siddhartha Chatterjee, Michal Krystyanczuk

出  版  社:Packt Publishing

出版时间:2017-07-28

字       数:34.5万

所属分类: 进口书 > 外文原版书 > 电脑/网络

温馨提示:数字商品不支持退换货,不提供源文件,不支持导出打印

为你推荐

  • 读书简介
  • 目录
  • 累计评论(条)
  • 读书简介
  • 目录
  • 累计评论(条)
Leverage the power of Python to collect, process, and mine deep insights from social media data About This Book ? Acquire data from various social media platforms such as Facebook, Twitter, YouTube, GitHub, and more ? Analyze and extract actionable insights from your social data using various Python tools ? A highly practical guide to conducting efficient social media analytics at scale Who This Book Is For If you are a programmer or a data analyst familiar with the Python programming language and want to perform analyses of your social data to acquire valuable business insights, this book is for you. The book does not assume any prior knowledge of any data analysis tool or process. What You Will Learn ? Understand the basics of social media mining ? Use PyMongo to clean, store, and access data in MongoDB ? Understand user reactions and emotion detection on Facebook ? Perform Twitter sentiment analysis and entity recognition using Python ? Analyze video and campaign performance on YouTube ? Mine popular trends on GitHub and predict the next big technology ? Extract conversational topics on public internet forums ? Analyze user interests on Pinterest ? Perform large-scale social media analytics on the cloud In Detail Social Media platforms such as Facebook, Twitter, Forums, Pinterest, and YouTube have become part of everyday life in a big way. However, these complex and noisy data streams pose a potent challenge to everyone when it comes to harnessing them properly and benefiting from them. This book will introduce you to the concept of social media analytics, and how you can leverage its capabilities to empower your business. Right from acquiring data from various social networking sources such as Twitter, Facebook, YouTube, Pinterest, and social forums, you will see how to clean data and make it ready for analytical operations using various Python APIs. This book explains how to structure the clean data obtained and store in MongoDB using PyMongo. You will also perform web scraping and visualize data using Scrappy and Beautifulsoup. Finally, you will be introduced to different techniques to perform analytics at scale for your social data on the cloud, using Python and Spark. By the end of this book, you will be able to utilize the power of Python to gain valuable insights from social media data and use them to enhance your business processes. Style and approach This book follows a step-by-step approach to teach readers the concepts of social media analytics using the Python programming language. To explain various data analysis processes, real-world datasets are used wherever required.
目录展开

Title Page

Copyright

Python Social Media Analytics

Credits

About the Authors

Acknowledgments

About the Reviewer

www.PacktPub.com

Why subscribe?

Customer Feedback

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Downloading the example code

Errata

Piracy

Questions

Introduction to the Latest Social Media Landscape and Importance

Introducing social graph

Notion of influence

Social impacts

Platforms on platform

Delving into social data

Understanding semantics

Defining the semantic web

Exploring social data applications

Understanding the process

Working environment

Defining Python

Selecting an IDE

Illustrating Git

Getting the data

Defining API

Scraping and crawling

Analyzing the data

Brief introduction to machine learning

Techniques for social media analysis

Setting up data structure libraries

Visualizing the data

Getting started with the toolset

Summary

Harnessing Social Data - Connecting, Capturing, and Cleaning

APIs in a nutshell

Different types of API

RESTful API

Stream API

Advantages of social media APIs

Limitations of social media APIs

Connecting principles of APIs

Introduction to authentication techniques

What is OAuth?

User authentication

Application authentication

Why do we need to use OAuth?

Connecting to social network platforms without OAuth

OAuth1 and OAuth2

Practical usage of OAuth

Parsing API outputs

Twitter

Creating application

Selecting the endpoint

Using requests to connect

Facebook

Creating an app and getting an access token

Selecting the endpoint

Connect to the API

GitHub

Obtaining OAuth tokens programmatically

Selecting the endpoint

Connecting to the API

YouTube

Creating an application and obtaining an access token programmatically

Selecting the endpoint

Connecting to the API

Pinterest

Creating an application

Selecting the endpoint

Connecting to the API

Basic cleaning techniques

Data type and encoding

Structure of data

Pre-processing and text normalization

Duplicate removal

MongoDB to store and access social data

Installing MongoDB

Setting up the environment

Starting MongoDB

MongoDB using Python

Summary

Uncovering Brand Activity, Popularity, and Emotions on Facebook

Facebook brand page

The Facebook API

Project planning

Scope and process

Data type

Analysis

Step 1 – data extraction

Step 2 – data pull

Step 3 – feature extraction

Step 4 – content analysis

Keywords

Extracting verbatims for keywords

User keywords

Brand posts

User hashtags

Noun phrases

Brand posts

User comments

Detecting trends in time series

Maximum shares

Brand posts

User comments

Maximum likes

Brand posts

Comments

Uncovering emotions

How to extract emotions?

Introducing the Alchemy API

Connecting to the Alchemy API

Setting up an application

Applying Alchemy API

How can brands benefit from it?

Summary

Analyzing Twitter Using Sentiment Analysis and Entity Recognition

Scope and process

Getting the data

Getting Twitter API keys

Data extraction

REST API Search endpoint

Rate Limits

Streaming API

Data pull

Data cleaning

Sentiment analysis

Customized sentiment analysis

Labeling the data

Creating the model

Model performance evaluation and cross-validation

Confusion matrix

K-fold cross-validation

Named entity recognition

Installing NER

Combining NER and sentiment analysis

Summary

Campaigns and Consumer Reaction Analytics on YouTube – Structured and Unstructured

Scope and process

Getting the data

How to get a YouTube API key

Data pull

Data processing

Data analysis

Sentiment analysis in time

Sentiment by weekday

Comments in time

Number of comments by weekday

Summary

The Next Great Technology – Trends Mining on GitHub

Scope and process

Getting the data

Rate Limits

Connection to GitHub

Data pull

Data processing

Textual data

Numerical data

Data analysis

Top technologies

Programming languages

Programming languages used in top technologies

Top repositories by technology

Comparison of technologies in terms of forks, open issues, size, and watchers count

Forks versus open issues

Forks versus size

Forks versus watchers

Open issues versus Size

Open issues versus Watchers

Size versus watchers

Summary

Scraping and Extracting Conversational Topics on Internet Forums

Scope and process

Getting the data

Introduction to scraping

Scrapy framework

How it works

Related tools

Creating a project

Creating spiders

Teamspeed forum spider

Data pull and pre-processing

Data cleaning

Part-of-speech extraction

Data analysis

Introduction to topic models

Latent Dirichlet Allocation

Applying LDA to forum conversations

Topic interpretation

Summary

Demystifying Pinterest through Network Analysis of Users Interests

Scope and process

Getting the data

Pinterest API

Step 1 - creating an application and obtaining app ID and app secret

Step 2 - getting your authorization code (access code)

Step 3 - exchanging the access code for an access token

Step 4 - testing the connection

Getting Pinterest API data

Scraping Pinterest search results

Building a scraper with Selenium

Scraping time constraints

Data pull and pre-processing

Pinterest API data

Bigram extraction

Building a graph

Pinterest search results data

Bigram extraction

Building a graph

Data analysis

Understanding relationships between our own topics

Finding influencers

Conclusions

Community structure

Summary

Social Data Analytics at Scale – Spark and Amazon Web Services

Different scaling methods and platforms

Parallel computing

Distributed computing with Celery

Celery multiple node deployment

Distributed computing with Spark

Text mining With Spark

Topic models at scale

Spark on the Cloud – Amazon Elastic MapReduce

Summary

累计评论(条) 个书友正在讨论这本书 发表评论

发表评论

发表评论,分享你的想法吧!

买过这本书的人还买过

读了这本书的人还在读

回顶部