Book contents
- Frontmatter
- Dedication
- Contents
- Preface
- Part One Fundamentals
- 1 Introduction
- 2 Features, Combined: Normalization, Discretization and Outliers
- 3 Features, Expanded: Computable Features, Imputation and Kernels
- 4 Features, Reduced: Feature Selection, Dimensionality Reduction and Embeddings
- 5 Advanced Topics: Variable-Length Data and Automated Feature Engineering
- Part II Case Studies
- Bibliography
- Index
5 - Advanced Topics: Variable-Length Data and Automated Feature Engineering
from Part One - Fundamentals
Published online by Cambridge University Press: 29 May 2020
- Frontmatter
- Dedication
- Contents
- Preface
- Part One Fundamentals
- 1 Introduction
- 2 Features, Combined: Normalization, Discretization and Outliers
- 3 Features, Expanded: Computable Features, Imputation and Kernels
- 4 Features, Reduced: Feature Selection, Dimensionality Reduction and Embeddings
- 5 Advanced Topics: Variable-Length Data and Automated Feature Engineering
- Part II Case Studies
- Bibliography
- Index
Summary
This chapter closes Part I presenting advanced topics, including dealing with variable length feature vectors, Feature Engineering and Deep Learning and automatic Feature Engineering (either supervised or unsupervised). It starts bridging the pure domain independent techniques to start drilling into problems of domain-specific importance. Variable length feature vectors has been a problem for fixed-size vector ML ever since. In general, techniques involve truncation, computing the most general tree and encoding paths on it or just destructive projection into a smaller plane. The chapter briefly delve into some Deep Learning concepts and what it entails for feature engineering. Automated Feature Learning using FeatureTools (the DataScience Machine) and genetic programming is covered, also Instance Engineering and Unsupervised Feature Engineering (in the form of autoencoders).
Keywords
- Type
- Chapter
- Information
- The Art of Feature EngineeringEssentials for Machine Learning, pp. 112 - 136Publisher: Cambridge University PressPrint publication year: 2020