万本电子书0元读

万本电子书0元读

顶部广告

Learning Microsoft Cognitive Services - Second Edition电子书

售       价:¥

1人正在读 | 0人评论 9.8

作       者:Leif Larsen

出  版  社:Packt Publishing

出版时间:2017-10-23

字       数:38.0万

所属分类: 进口书 > 外文原版书 > 电脑/网络

温馨提示:数字商品不支持退换货,不提供源文件,不支持导出打印

为你推荐

  • 读书简介
  • 目录
  • 累计评论(0条)
  • 读书简介
  • 目录
  • 累计评论(0条)
Learn to build interactive and efficient applications by leveraging 24 effective cognitive services APIs powered by Microsoft About This Book ? Explore the capabilities of 24 of the APIs released as part of the Cognitive Services platform ? Build intelligent apps that combine the power of computer vision, speech recognition, and language processing ? Give your apps human-like cognitive intelligence with this hands-on guide Who This Book Is For .NET developers who want to add AI capabilities to their applications will find this book useful. No knowledge of machine learning or AI is necessary to work through this book. What You Will Learn ? Identify a person through visual inspection and audio ? Reduce user effort by utilizing AI-like capabilities ? Understand how to analyze images and text in different ways ? Find out how to analyze images using Vision APIs ? Add video analysis to applications using Vision APIs ? Utilize Search to find anything you want ? Analyze text to extract information and explore text structure In Detail Microsoft has revamped its Project Oxford to launch the all new Cognitive Services platform-a set of 30 APIs to add speech, vision, language, and knowledge capabilities to apps. This book will introduce you to 24 of the APIs released as part of Cognitive Services platform and show you how to leverage their capabilities. More importantly, you'll see how the power of these APIs can be combined to build real-world apps that have cognitive capabilities. The book is split into three sections: computer vision, speech recognition and language processing, and knowledge and search. You will be taken through the vision APIs at first as this is very visual, and not too complex. The next part revolves around speech and language, which are somewhat connected. The last part is about adding real-world intelligence to apps by connecting them to Knowledge and Search APIs. By the end of this book, you will be in a position to understand what Microsoft Cognitive Service can offer and how to use the different APIs. Style and approach This book takes you through essential API capabilities and shows how to utilize them to suit the needs of your application.
目录展开

Title Page

Second Edition

Copyright

Learning Microsoft Cognitive Services

Second Edition

Credits

About the Author

About the Reviewer

www.PacktPub.com

Why subscribe?

Customer Feedback

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Downloading the example code

Downloading the color images of this book

Errata

Piracy

Questions

Getting Started with Microsoft Cognitive Services

Cognitive Services in action for fun and life-changing purposes

Setting up boilerplate code

Detecting faces with the Face API

An overview of what we are dealing with

Vision

Computer Vision

Emotion

Face

Video

Video Indexer

Content Moderator

Custom Vision Service

Speech

Bing Speech

Speaker Recognition

Custom Recognition

Translator Speech API

Language

Bing Spell Check

Language Understanding Intelligent Service (LUIS)

Linguistic Analysis

Text Analysis

Web Language Model

Translator Text API

Knowledge

Academic

Entity Linking

Knowledge Exploration

Recommendations

QnA Maker

Custom Decision Service

Search

Bing Web Search

Bing Image Search

Bing Video Search

Bing News Search

Bing Autosuggest

Bing Entity Search

Getting feedback on detected faces

Summary

Analyzing Images to Recognize a Face

Learning what an image is about using the Computer Vision API

Setting up a chapter example project

Generic image analysis

Recognizing celebrities using domain models

Utilizing Optical Character Recognition

Generating image thumbnails

Diving deep into the Face API

Retrieving more information from the detected faces

Deciding whether two faces belong to the same person

Finding similar faces

Grouping similar faces

Adding identification to our smart-house application

Creating our smart-house application

Adding people to be identified

Identifying a person

Automatically moderating user content

The content to moderate

Image moderation

Text moderation

Moderation tools

Using the review tool

Other tools

Summary

Analyzing Videos

Knowing your mood using the Emotion API

Getting images from a web camera

Letting the smart-house know your mood

Diving into the Video API

Video operations as common code

Getting operation results

Wiring up the execution in the ViewModel

Detecting and tracking faces in videos

Detecting motion

Stabilizing shaky videos

Generating video thumbnails

Analyzing emotions in videos

Unlocking video insights using Video Indexer

General overview

Typical scenarios

Key concepts

Breakdowns

Summarized insights

Keywords

Sentiments

Blocks

How to use Video Indexer

Through a web portal

Video Indexer API

Summary

Letting Applications Understand Commands

Creating language-understanding models

Registering an account and getting a license key

Creating an application

Recognizing key data using entities

Understanding what the user wants using intents

Simplifying development using prebuilt models

Prebuilt domains

Training a model

Training and publishing the model

Connecting to the smart-house application

Model improvement through active usage

Visualizing performance

Resolving performance problems

Adding model features

Adding labeled utterances

Looking for incorrect utterance labels

Changing the schema

Active learning

Summary

Speaking with Your Application

Converting text to audio and vice versa

Speaking to the application

Letting the application speak back

Audio output format

Error codes

Supported languages

Utilizing LUIS based on spoken commands

Knowing who is speaking

Adding speaker profiles

Enrolling a profile

Identifying the speaker

Verifying a person through speech

Customizing speech recognition

Creating a custom acoustic model

Creating a custom language model

Deploying the application

Summary

Understanding Text

Setting up a common core

New project

Web requests

Data contracts

Correcting spelling errors

Natural Language Processing using the Web Language Model

Breaking a word into several words

Generating the next word in a sequence of words

Learning if a word is likely to follow a sequence of words

Learning if certain words are likely to appear together

Extracting information through textual analysis

Detecting language

Extracting key phrases from text

Learning if a text is positive or negative

Exploring text using linguistic analysis

Introduction to linguistic analysis

Analyzing text from a linguistic viewpoint

Summary

Extending Knowledge Based on Context

Linking entities based on context

Providing personalized recommendations

Creating a model

Importing catalog data

Importing usage data

Building a model

Consuming recommendations

Recommending items based on prior activities

Summary

Querying Structured Data in a Natural Way

Tapping into academic content using the Academic API

Setting up an example project

Interpreting natural language queries

Finding academic entities from query expressions

Calculating the distribution of attributes from academic entities

Entity attributes

Creating the backend using the Knowledge Exploration Service

Defining attributes

Adding data

Building the index

Understanding natural language

Local hosting and testing

Going for scale

Hooking into Microsoft Azure

Deploying the service

Answering FAQs using QnA Maker

Creating a knowledge base from frequently asked questions

Training the model

Publishing the model

Improving the model

Summary

Adding Specialized Searches

Searching the web from the smart-house application

Preparing the application for web searches

Searching the web

Getting the news

News from queries

News from categories

Trending news

Searching for images and videos

Using a common user interface

Searching for images

Searching for videos

Helping the user with auto suggestions

Adding Autosuggest to the user interface

Suggesting queries

Search commonalities

Languages

Pagination

Filters

Safe search

Freshness

Errors

Summary

Connecting the Pieces

Connecting the pieces

Creating an intent

Updating the code

Executing actions from intents

Searching news on command

Describing news images

Real-life applications using Microsoft Cognitive Services

Uber

DutchCrafters

CelebsLike.me

Pivothead - wearable glasses

Zero Keyboard

The common theme

Where to go from here

Summary

LUIS Entities and Additional Information on Linguistic Analysis

LUIS pre-built entities

Part-of-speech tags

Phrase types

License Information

Video Frame Analyzer

OpenCvSharp3

Newtonsoft.Json

NAudio

Definitions

Grant of Rights

Conditions and Limitations

累计评论(0条) 0个书友正在讨论这本书 发表评论

发表评论

发表评论,分享你的想法吧!

买过这本书的人还买过

读了这本书的人还在读

回顶部