Workflow Element Store

  1. Annotation
  2. Feature Selection
  3. Auto-Preprocessing libraries
  4. Interaction Features
  5. Handling Noisy Data
  6. Data Scaling and Normalization
  7. Dimensionality Reduction
  8. Feature Extraction from Images
  9. Data Transformations
  10. Data Partitioning - Train, Validation, & Test
  11. Handling Time-Series Data
  12. Time-Based Features
  13. Binning / Discretization
  14. Handling Imbalanced Classes
  15. Handling Categorical Data
  16. Augmentation
  17. Handling Missing Data
  18. AutoEDA libraries
  19. Textual Feature Extraction
  20. Domain-Specific Feature Engineering
  21. Dealing with Outliers
  22. Polynomial Features
  1. Multiclass Classification Techniques
  2. External Validation
  3. Regression Analysis
  4. Association Rules
  5. Clustering
  6. Binary Classification Techniques
  7. Forecasting Techniques
  8. Ensemble Techniques
  9. Natural Language Processing
  10. Transfer Learning
  11. Network Analytics/ GeoSpatial Analytics
  12. Reinforcement Learning
  13. Hyperparameter Tuning
  14. Cross-Validation
  15. AutoML
  16. Learning Rate Scheduling
  17. Model Comparison
  18. Data Augmentation
  19. Evaluation Metrics
  20. Regular Monitoring and Logging
  21. Model Interpretability
  22. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  23. Regularization Techniques
  24. Weight Initialization
  25. Cross-Validation
  26. Blackbox - Neural Network Models
  27. Batch Size Selection
  28. Early Stopping
  29. Regularization
  30. Batch Normalization
  31. Transfer Learning
  32. Performance Visualization
  33. Word Embeddings
  34. Recommendation Engine
  1. Datawarehouse
  2. Evidently.ai
  3. Data Preprocessing pipeline models
  4. model registry
  5. Apache Airflow
  6. code repository
  7. Databases
  8. Github
  9. Github Actions
  10. Kafka Brokers
ML Workflow - Architecture
  • Element belongs to model
  • Element not belongs to model

Data Sources

Streaming Data

Batch Data

Cloud Storage

Labeled Data

Feature Engineering Pipeline

Experimentation

ML Model

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Orchestration Component

Artifact Store

CI/CD Component

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)