Information
AI Chat

MCQ on Data mining

Data Mining

91 Documents

Students shared 91 documents in this course

Assiut University

Academic year: 2021/2022

Uploaded by:

Please sign in or register to post comments.

Ngegba3 months ago
tnx man

Biological Data Mining zelalem simon SACET

a) More than one type

a. identifying patterns in data.

b relationships in data.

c data.

d trends in data.

78. our use of association analysis will yield the same frequent itemsets

and strong association rules whether a specific item occurs once or

three times in an individual transaction.

79. The k-means clustering algorithm that we studied will automatically find

the best value of k as part of its normal operation.

80. A density-based clustering algorithm can generate non-globular

clusters.

81. In association rule mining the generation of the frequent itemsets is the

computational intensive step

Neural Networks are complex ______________ with many parameters.

a) Linear Functions

b) Nonlinear Functions

c) Discrete Functions

d ) Exponential Functions

82 - A 4-input neuron has weights 1, 2, 3 and 4. The transfer function is

linear with the constant of proportionality being equal to 2. The inputs are 4,

10, 5 and 20 respectively. The output will be:

82. ANN is composed of large number of highly interconnected processing

96 a rule of the form IF X THEN Y, rule confidence is defined as the

conditional probability that

a. Y is true when X is known to be true.

b. X is true when Y is known to be true.

c. Y is false when X is known to be false.

d. X is false when Y is known to be false.

97 rule support is defined as

a. the percentage of instances that contain the antecendent conditional items

listed in the association rule.

b. the percentage of instances that contain the consequent conditions listed in

the association rule.

c. the percentage of instances that contain all items listed in the association

rule.

d. the percentage of instances in the database that contain at least one of the

antecendent conditional items listed in the association rule.

98 approach is best when we are interested in finding all possible

interactions among a set of attributes.

a. decision tree

b. association rules

c. K-Means algorithm

d. genetic learning

99 choice of a data mining tool is made at this step of the KDD process.

a. goal identification

b. creating a target dataset

c. data preprocessing

d. data mining

100. Attibutes may be eliminated from the target dataset during this step of the

KDD process.

a. creating a target dataset

b. data preprocessing

c. data transformation

d. data mining

101. This step of the KDD process model deals with noisy data.

a. Creating a target dataset

b. data preprocessing

c. data transformation

d. data mining

102. A common method used by some data mining techniques to deal with

missing data items during the learning process.

a. replace missing real-valued data items with class means

b. discard records with missing data

c. replace missing attribute values with the values found within other

similar instances

C = No 5

D = No 8

Sex = Male 6

Two Item Sets Number

of Items

A= Yes & B = No 4

A = Yes & C = Yes 5

A = Yes & D = No 5

B= No & D = No 5

108. One rule that can be generated from the tables above is:

If A = Yes Then C= Yes

The confidence for this rule is:

a. 5 / 7

b. 5 / 12

c. 7 / 12

d. 1

109. Based on the two-item set table, which of the following is not a possible two-

item set rule?

a. IF C= Yes THEN A= Yes

b. IF B= No THEN A= Yes

c. IF D= No THEN A= Yes

d. IF C= No THEN D= No

Multiple Choice

Multiple Choice

Was this document helpful?

MCQ on Data mining

Course: Data Mining

91 Documents

Students shared 91 documents in this course

University: Assiut University

Multiple Choice

Multiple Choice

Was this document helpful?

Data mining questions bank with answer

1. What is the median of the following set of scores?

18, 6, 12, 10, 14 ?

2. what percentage of scores fall Approximately within one standard deviation of the mean in a

normal distribution?

b. 95% -→ Approximately 95% of the data fall within two standard deviations of the mean

c. 99% -→ of the data fall within three standard deviations of the mean.

d. 68% ---→ within one standard deviation of the mean

3. ___________ is the goal to focus on summarizing and explaining a specific set of data.

a. Inferential statistics

b. Descriptive statistics

c. None of the above

d. All of the above

4. most frequently occurring number in a set of values is called the ____.

5. the _______ is the best measure As a general rule of central tendency because it is more precise.

6. Focusing on describing or explaining data versus going beyond immediate data and making inferences is the

difference between _______.

a. Central tendency and common tendency

b. Mutually exclusive and mutually exhaustive properties

c. Descriptive and inferential

d. Positive skew and negative skew

7. ___________ are used when you want to visually examine the relationship between two quantitative

a. Bar graphs

b. Pie graphs

c. Line graphs

d. Scatterplots

8. _______ is often the preferred measure of central tendency if the data are severely skewed.

9. ................... is an essential process where intelligent methods are applied to extract data patterns.

MCQ on Data mining

Data Mining

Assiut University

Comments

Students also viewed

Related documents

Related Studylists

Preview text

Data mining questions bank with answer

1. What is the median of the following set of scores?

18, 6, 12, 10, 14?

a. 10

b. 14

c. 18

d. 12

2. what percentage of scores fall Approximately within one standard deviation of the mean in a

normal distribution?

a. 34%

b. 95% -→ Approximately 95% of the data fall within two standard deviations of the mean

c. 99% -→ of the data fall within three standard deviations of the mean.

d. 68% ---→ within one standard deviation of the mean

3. ___________ is the goal to focus on summarizing and explaining a specific set of data.

a. Inferential statistics

b. Descriptive statistics

c. None of the above

d. All of the above

4. most frequently occurring number in a set of values is called the ____.

a. Mean

b. Median

c. Mode

d. Range

5. the _______ is the best measure As a general rule of central tendency because it is more precise.

a. Mean

b. Median

c. Mode

d. Range

6. Focusing on describing or explaining data versus going beyond immediate data and making inferences is the

difference between _______.

a. Central tendency and common tendency

b. Mutually exclusive and mutually exhaustive properties

c. Descriptive and inferential

d. Positive skew and negative skew

7. ___________ are used when you want to visually examine the relationship between two quantitative

variables.

a. Bar graphs

b. Pie graphs

c. Line graphs

d. Scatterplots

8. _______ is often the preferred measure of central tendency if the data are severely skewed.

a. Mean

b. Median

c. Mode

d. Range

9. ................... is an essential process where intelligent methods are applied to extract data patterns.

A) Data warehousing

B) Data mining

C) Text mining

D) Data selection

10. Data mining can also applied to other forms such as ................

i) Data streams

ii) Sequence data

iii) Networked data

iv) Text data

v) Spatial data

A) i, ii, iii and v only

B) ii, iii, iv and v only

C) i, iii, iv and v only

D) All i, ii, iii, iv and v

11. Which of the following is not a data mining functionality?

A) Characterization and Discrimination

B) Classification and regression

C) Selection and interpretation

D) Clustering and Analysis

A)

12. Hypothesis testing and estimation are both types of descryptive statistics.

a. True

b. False

13. A set of data organized in a participants(rows)-by-variables(columns) format is known as a “data set.”

a. True

b. False