Big Data and ML Tools Model Evaluation Machine Learning Workflow Data Preparation and Cleaning Einstein Discovery
100
What is Spark?
Scalable in memory compute engine for generic data processing
100
What is an ROC curve?
A plot of the false positive rate versus the true positive rate for a binary classifier.
100
What is classification?
Predicting qualitative outputs
200
What is scikit learn?
The most popular Python framework for building machine learning pipelines
200
What is overfitting?
A model that performs well on training data, but poorly on test data.
200
What is regression?
Predicting quantitative outputs
300
What is deep learning?
This type of machine learning algorithm is useful on image, textual, and sound data.
300
What is a model A/B test?
Comparing two models on live production data
300
What is a holdout set?
A portion of labeled data that is not used during model training or tuning.
400
What is underfitting?
A model fails to produce good results on training data.
500
What is regularization?
A common technique to reduce model complexity and improve interpretability.
500
What is 5-fold (or n-fold) cross validation?
Split your data into 5 segments and hold out each segment once for validation.






Machine Learning Jeopardy

Press F11 for full screen mode



Limited time offer: Membership 25% off


Clone | Edit | Download / Play Offline