Filters
Question type

Study Flashcards

Which data mining process/methodology is thought to be the most comprehensive, according to kdnuggets.com rankings?


A) SEMMA
B) proprietary organizational methodologies
C) KDD Process
D) CRISP-DM

E) All of the above
F) A) and B)

Correct Answer

verifed

verified

________ represent the labels of multiple classes used to divide a variable into specific groups, examples of which include race, sex, age group, and educational level.

Correct Answer

verifed

verified

Patterns have been manually ________ from data by humans for centuries, but the increasing volume of data in modern times has created a need for more automatic approaches.

Correct Answer

verifed

verified

Statistics and data mining both look for data sets that are as large as possible.

A) True
B) False

Correct Answer

verifed

verified

Knowledge extraction, pattern analysis, data archaeology, information harvesting, pattern searching, and data dredging are all alternative names for ________.

Correct Answer

verifed

verified

In estimating the accuracy of data mining (or other) classification models, the true positive rate is


A) the ratio of correctly classified positives divided by the total positive count.
B) the ratio of correctly classified negatives divided by the total negative count.
C) the ratio of correctly classified positives divided by the sum of correctly classified positives and incorrectly classified positives.
D) the ratio of correctly classified positives divided by the sum of correctly classified positives and incorrectly classified negatives.

E) A) and D)
F) C) and D)

Correct Answer

verifed

verified

What does the scalability of a data mining method refer to?


A) its ability to predict the outcome of a previously unknown data set accurately
B) its speed of computation and computational costs in using the mode
C) its ability to construct a prediction model efficiently given a large amount of data
D) its ability to overcome noisy data to make somewhat accurate predictions

E) A) and B)
F) A) and C)

Correct Answer

verifed

verified

Data mining requires specialized data analysts to ask ad hoc questions and obtain answers quickly from the system.

A) True
B) False

Correct Answer

verifed

verified

Fayyad et al. (1996) defined ________ in databases as a process of using data mining methods to find useful information and patterns in the data.

Correct Answer

verifed

verified

If using a mining analogy, "knowledge mining" would be a more appropriate term than "data mining."

A) True
B) False

Correct Answer

verifed

verified

All of the following statements about data mining are true EXCEPT


A) understanding the business goal is critical.
B) understanding the data, e .g., the relevant variables, is critical to success.
C) building the model takes the most time and effort.
D) data is typically preprocessed and/or cleaned before use.

E) B) and C)
F) A) and C)

Correct Answer

verifed

verified

The data mining in cancer research case study explains that data mining methods are capable of extracting patterns and ________ hidden deep in large and complex medical databases.

Correct Answer

verifed

verified

Briefly describe five techniques (or algorithms) that are used for classification modeling.

Correct Answer

verifed

verified

Decision tree analysis. Decision...

View Answer

What is the main reason parallel processing is sometimes used for data mining?


A) because the hardware exists in most organizations and it is available to use
B) because the most of the algorithms used for data mining require it
C) because of the massive data amounts and search efforts involved
D) because any strategic application requires parallel processing

E) B) and C)
F) C) and D)

Correct Answer

verifed

verified

As described in the 2degrees case study, a common problem in the mobile telecommunications industry is defined by the term ________, which means customers leaving.

Correct Answer

verifed

verified

Because of its successful application to retail business problems, association rule mining is commonly called ________.

Correct Answer

verifed

verified

market-bas...

View Answer

The data mining algorithm type used for classification somewhat resembling the biological neural networks in the human brain is


A) association rule mining.
B) cluster analysis.
C) decision trees.
D) artificial neural networks.

E) A) and B)
F) None of the above

Correct Answer

verifed

verified

Data mining can be very useful in detecting patterns such as credit card fraud, but is of little help in improving sales.

A) True
B) False

Correct Answer

verifed

verified

Data preparation, the third step in the CRISP-DM data mining process, is more commonly known as ________.

Correct Answer

verifed

verified

Which broad area of data mining applications analyzes data, forming rules to distinguish between defined classes?


A) associations
B) visualization
C) classification
D) clustering

E) B) and C)
F) C) and D)

Correct Answer

verifed

verified

Showing 41 - 60 of 70

Related Exams

Show Answer