Data Engineering
Data engineering is the cornerstone of AI systems — high-quality data processing pipelines directly determine the upper bound of model performance.
Contents:
- Data Cleaning & Preprocessing — Missing value handling, anomaly detection, feature transformation
- Data Augmentation — Image augmentation, text augmentation, MixUp
- Representation Space Alignment — Cross-modal alignment, domain adaptation
- Data Version Control — DVC, data lineage tracking
- Classic Datasets — Overview of commonly used benchmark datasets