售 价:¥
温馨提示:数字商品不支持退换货,不提供源文件,不支持导出打印
为你推荐
Getting Started with Talend Open Studio for Data Integration
Table of Contents
Getting Started with Talend Open Studio for Data Integration
Credits
Foreword
Foreword
About the Author
Acknowledgement
About the Reviewers
www.PacktPub.com
Support files, eBooks, discount offers, and more
Why Subscribe?
Free Access for Packt account holders
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Downloading the example code
Errata
Piracy
Questions
1. Knowing Talend Open Studio
What Talend Open Studio is
Use cases
History of Talend Open Studio
Benefits of Talend Open Studio
Installing Talend Open Studio
Prerequisites
Installation guide
Other useful software
Text editor
MySQL
Sample jobs and data
Summary
2. Working with Talend Open Studio
Studio definitions
Starting the Studio
Tour of the Studio
The Repository
The design workspace
The Palette
Configuration tabs
Outline and Code panels
Creating a new project
Creating an example job
Metadata
Summary
3. Transforming Files
Transforming XML to CSV
Transforming CSV to XML
Maps and expressions
Advanced XML output for complex XML structures
Working with multi-schema XML files
Enriching data with lookups
Extracting data from Excel files
Extracting data from multiple sheets
Joining data from multiple sheets
Summary
4. Working with Databases
Database metadata
Extracting data from a database
Extracts from multiple tables
Joining within the database component
Joining outside the database component
Writing data to a database
Database to database transfer
Modifying data in a database
Dynamic database lookup
Summary
5. Filtering, Sorting, and Other Processing Techniques
Filtering data
Simple filter
Filter and rejects
Filter and split
Sorting data
Aggregating data
Normalizing and denormalizing data
Data normalization
Data denormalization
Extracting delimited fields
Find and replace
Sampling rows
Summary
6. Managing Files
Managing local files
Copying files
Copying and removing files
Renaming files
Deleting files
Timestamping a file
Listing files in a directory
Checking for files
Archiving and unarchiving files
FTP file operations
FTP Metadata
FTP Put
FTP Get
FTP File Exist
FTP File List and Rename
Deleting files on an FTP server
Summary
7. Job Orchestration
What is a subjob
A simple subjob
On Subjob Error
On Component OK
Run If
Jobs as subjobs
Iterating and looping
Iterate connections
ForEach loop
Loop "n" times
Infinite loop
Duplicating and merging dataflows
Duplicating data
Merging data
Summary
8. Managing Jobs
Job versions
Exporting and importing jobs
Exporting jobs
Exporting a project
Exporting a job
Exporting a job for execution
Importing jobs
Importing a project
Importing a job
Scheduling jobs
Summary
9. Global Variables and Contexts
Global variables
Studio global variables
User defined global variables
Contexts
Embedded context variables
Repository context variables
External context variables
Complex context variables
Using embedded, repository, and external contexts
Summary
10. Worked Examples
Product catalog
Data import from the ERP system
Data import from Fabric Fashions
Data import from Runway Collections
Product inventory data
Order file processing
Order status updates
Automating processes
E-mailing daily sales
Automating product visibility
Summary
A. Installing Sample Jobs and Data
Downloading job and data files
Sample data files
Sample database
Sample jobs
B. Resources
Talend documentation
TalendForge forum
Webinars
Tutorials
Talend Exchange
Index
买过这本书的人还买过
读了这本书的人还在读
同类图书排行榜