Home » Data Science » Machine Learning » Let’s say you have a relatively small training set (~5 thousand images). Which training strategy makes the most sense? Q: Practice More Questions From: Disease Detection With Computer Vision Created with Fabric.js 4.6.0 Practice More Questions Data Analysis 2000+ Qs Machine Learning 1000+ Qs Created with Fabric.js 4.6.0 Similar Questions Now let’s say you have a very large dataset (~1 million images). Which training strategies will make the most sense?Let’s say blood pressure (BP) measurements are more likely to be missing among young people, who generally have lower blood pressure. You use mean imputation to train your model. Which option…Let’s see why F1 is used instead of the regular mean of precision and recall. Let’s say the mean of precision and recall is at least 0.75. Which of the following could be the true value of the…You find that your training set has 70% negative examples and 30% positive. Which of the following techniques will NOT help for training this imbalanced dataset?Now let’s say F1 score is at least 0.75. Now which of the following values of precision are possible? Let f(x) be the probability that a person with feature x dies within 5 years. Let Sx(t) be the survival function of a person with feature x. Assume t is measured in years. Which of the following is…You say that you will respond professionally using the information you currently have available in order to move quickly past the objection.Say you have trained a decision tree which never splits on a variable X. What will be the variable importance for X using the permutation method?Fill in the blank: The _____ creates a scatterplot and then adds a small amount of random noise to each point in the plot to make the points easier to find.What function creates a scatterplot and then adds a small amount of random noise to each point in the plot to make the points easier to find?Why is it bad to have the same patients in both training and test sets?True or False: the start and end vectors are fixed throughout training A data analyst writes the following code chunk to return a statistical summary of their dataset: quartet %>% group_by(set) %>% summarize(mean(x), sd(x), mean(y), sd(y), cor(x, y)) Which function will…Assume you have missing data on one of your features, and are considering two options: True or False: "Both options have the same performance".Fill in the blank: An effective slideshow guides your audience through your main communication points, but it does not repeat every word you say. A best practice is to keep text to fewer than five… Created with Fabric.js 4.6.0 Practice More Questions Data Analysis 200+ Qs Machine Learning 100+ Qs