Question 1:

Refer to the ROC curve:

As you move along the curve, what changes?

A. The priors in the population

B. The true negative rate in the population

C. The proportion of events in the training data

D. The probability cutoff for scoring

Correct Answer: D

Question 2:

Refer to the exhibit:

Based upon the comparative ROC plot for two competing models, which is the champion model and why?

A. Candidate 1, because the area outside the curve is greater

B. Candidate 2, because the area under the curve is greater

C. Candidate 1, because it is closer to the diagonal reference curve

D. Candidate 2, because it shows less over fit than Candidate 1

Correct Answer: B

Question 3:

This question will ask you to provide missing code segments.

A logistic regression model was fit on a data set where 40% of the outcomes were events (TARGET=1) and 60% were non-events (TARGET=0). The analyst knows that the population where the model will be deployed has 5% events and 95% non-events. The analyst also knows that the company\’s profit margin for correctly targeted events is nine times higher than the company\’s loss for incorrectly targeted non-event.

Given the following SAS program:

What X and Y values should be added to the program to correctly score the data?

A. X=40, Y=10

B. X=.05, Y=10

C. X=.05, Y=.40

D. X=.10.Y=05

Correct Answer: B

Question 4:

What is a drawback to performing data cleansing (imputation, transformations, etc.) on raw data prior to partitioning the data for honest assessment as opposed to performing the data cleansing after partitioning the data?

A. It violates assumptions of the model.

B. It requires extra computational effort and time.

C. It omits the training (and test) data sets from the benefits of the cleansing methods.

D. There is no ability to compare the effectiveness of different cleansing methods.

Correct Answer: D

Question 5:

Customers were surveyed to assess their intent to purchase a product. An analyst divided the customers into groups defined by the company\’s pre-assigned market segments and tested for difference in the customers\’ average intent to purchase. The following is the output from the GLM procedure:

What percentage of customers\’ intent to purchase is explained by market segment? Click the calculator button to display a calculator if needed.

A. <0.01%

B. 35%

C. 65%

D. 76%

Correct Answer: D

Question 6:

Refer to the exhibit:

The box plot was used to analyze daily sales data following three different ad campaigns. The business analyst concludes that one of the assumptions of ANOVA was violated. Which assumption has been violated and why?

A. Normality, because Prob > F < .0001.

B. Normality, because the interquartile ranges are different in different ad campaigns.

C. Constant variance, because Prob > F < .0001.

D. Constant variance, because the interquartile ranges are different in different ad campaigns.

Correct Answer: D

Question 7:

An analyst compares the mean salaries of men and women working at a company. The SAS data set SALARY contains variables:

Which SAS programs can be used to find the p-value for comparing men\’s salaries with women\’s salaries? (Choose two.)

A. Option A

B. Option B

C. Option C

D. Option D

Correct Answer: AB

Question 8:

Given the following GLM procedure output:

Which statement is correct at an alpha level of 0.05?

A. School*Gender should be removed because it is non-significant.

B. Gender should be removed because it is non-significant.

C. School should be removed because it is significant.

D. Gender should not be removed due to its involvement in the significant interaction.

Correct Answer: D

Question 9:

There are missing values in the input variables for a regression application. Which SAS procedure provides a viable solution?





Correct Answer: C

Question 10:

Screening for non-linearity in binary logistic regression can be achieved by visualizing:

A. A scatter plot of binary response versus a predictor variable.

B. A trend plot of empirical logit versus a predictor variable.

C. A logistic regression plot of predicted probability values versus a predictor variable.

D. A box plot of the odds ratio values versus a predictor variable.

Correct Answer: B

Author: CertBus