文本數據挖掘(英文版)

文本數據挖掘(英文版)

《文本數據挖掘(英文版)》是2021年清華大學出版社出版的圖書,作者是宗成慶、夏睿、張家俊。

基本介紹

  • 中文名:文本數據挖掘(英文版)
  • 作者:宗成慶、夏睿、張家俊 
  • 出版社清華大學出版社
  • 出版時間:2021年 
  • 定價:119 元 
  • ISBN: 9787302590293
內容簡介,目錄,

內容簡介

《Text data mining》 offers thorough and detailed introduction to the fundamental theories and methods of text data mining, ranging from pre-processing (for both Chinese and English texts), text representation, feature selection, to text classification and text clustering. Also it presents predominant applications of text data mining, for example, topic model, sentiment analysis and opinion mining, topic detection and tracking, information extraction, and text automatic summarization, etc.

目錄

Contents
1 Introduction 1
1.1 The Basic Concepts 1
1.2 Main Tasks of Text Data Mining 3
1.3 Existing Challenges in Text Data Mining 6
1.4 Overview and Organization of This Book 9
1.5 Further Reading 12
2 Data Annotation and Preprocessing 15
2.1 Data Acquisition 15
2.2 Data Preprocessing 20
2.3 Data Annotation 22
2.4 Basic Tools of NLP 25
2.4.1 Tokenization and POS Tagging 25
2.4.2 Syntactic Parser 27
2.4.3 N-gram Language Model 29
2.5 Further Reading 30
3 Text Representation 33
3.1 Vector Space Model 33
3.1.1 Basic Concepts 33
3.1.2 Vector Space Construction 34
3.1.3 Text Length Normalization 36
3.1.4 Feature Engineering 37
3.1.5 Other Text Representatio...
check!

熱門詞條

聯絡我們