Data mining Online Test Series 1, Data mining Question and Answers, Mock Test
Finish Quiz
0 of 20 questions completed
Questions:
- 1
- 2
- 3
- 4
- 5
- 6
- 7
- 8
- 9
- 10
- 11
- 12
- 13
- 14
- 15
- 16
- 17
- 18
- 19
- 20
Information
Data mining Online Test Series 1, Data mining Question and Answers, Mock Test, Online Data mining, online Test Quiz 1. Data mining Online Test 1 Question and Answers 2024. Online Data Mining Practice and Preparation Tests cover Mining Engineering, Data Mining, Data Sufficiency, Data Interpretation, Data Analysis Test 1, Data Base. Data mining is defined as a process used to extract usable data from a larger set of any raw data. It implies analysing data patterns in large batches of data using one or more software. Data mining has applications in multiple fields, like science and research. Here we are providing Data mining Online Test in English Now Test your self for “Data mining Online Test in English” Exam by using below quiz…
This paper has 20 questions.
Time allowed is 25 minutes.
The Data mining Online Test is Very helpful for all students. Now Scroll down below n click on “Start Quiz” or “Start Test” and Test yourself.
You have already completed the quiz before. Hence you can not start it again.
Quiz is loading...
You must sign in or sign up to start the quiz.
You have to finish following quiz, to start this quiz:
Results
0 of 20 questions answered correctly
Your time:
Time has elapsed
You have reached 0 of 0 points, (0)
Average score |
|
Your score |
|
Categories
- Not categorized 0%
Pos. | Name | Entered on | Points | Result |
---|---|---|---|---|
Table is loading | ||||
No data available | ||||
- 1
- 2
- 3
- 4
- 5
- 6
- 7
- 8
- 9
- 10
- 11
- 12
- 13
- 14
- 15
- 16
- 17
- 18
- 19
- 20
- Answered
- Review
-
Question 1 of 20
1. Question
Which of the following data sources used by data Mining to discover interesting knowledge from large amounts of data
Correct
Data mining is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics, and database systems, data warehouses, file systems. It is an interdisciplinary subfield of computer science. The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.
Incorrect
Data mining is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics, and database systems, data warehouses, file systems. It is an interdisciplinary subfield of computer science. The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.
-
Question 2 of 20
2. Question
Which of the following data mining system does not utilize any functionality of a database.
Correct
In No-coupling scheme, the data mining system does not utilize any of the database or data warehouse functions. It fetches the data from a particular source and processes that data using some data mining algorithms. The data mining result is stored in another file.
Incorrect
In No-coupling scheme, the data mining system does not utilize any of the database or data warehouse functions. It fetches the data from a particular source and processes that data using some data mining algorithms. The data mining result is stored in another file.
-
Question 3 of 20
3. Question
Which of the data mining system is mainly for memory-based data mining system that does not require high scalability and high performance.
Correct
In loose coupling, the data mining system uses the database or data warehouse for data retrieval. In loose coupling data mining architecture, the data mining system retrieves data from the database or data warehouse, processes data using data mining algorithms and stores the result in those systems. This architecture is mainly for memory-based data mining system that does not require high scalability and high performance.
Incorrect
In loose coupling, the data mining system uses the database or data warehouse for data retrieval. In loose coupling data mining architecture, the data mining system retrieves data from the database or data warehouse, processes data using data mining algorithms and stores the result in those systems. This architecture is mainly for memory-based data mining system that does not require high scalability and high performance.
-
Question 4 of 20
4. Question
Which are the three tiers in the tight-coupling data mining architecture
i) Data Layer
ii) Application Layer
iii) Front end Layer
iv) All of the AboveCorrect
In tight-coupling, database or data warehouse is treated as an information retrieval component of data mining system using integration. All the features of database or data warehouse are used to perform data mining tasks. This architecture provides system scalability, high performance, and integrated information. There are three tiers in the tight-coupling data mining architecture:
1. Data layer: This layer is an interface for all data sources. Data mining results are stored in data layer so it can be presented to end-user in the form of reports or another kind of visualization.2. Application layer: It is used to retrieve data from the database. Some transformation routine can be performed here to transform data into the desired format. Then data is processed using various data mining algorithms.
3. Front-end layer: It provides intuitive and friendly user interface for end-user to interact with data mining system. Data mining result presented in visualization form to the user in the front-end layer.
Incorrect
In tight-coupling, database or data warehouse is treated as an information retrieval component of data mining system using integration. All the features of database or data warehouse are used to perform data mining tasks. This architecture provides system scalability, high performance, and integrated information. There are three tiers in the tight-coupling data mining architecture:
1. Data layer: This layer is an interface for all data sources. Data mining results are stored in data layer so it can be presented to end-user in the form of reports or another kind of visualization.2. Application layer: It is used to retrieve data from the database. Some transformation routine can be performed here to transform data into the desired format. Then data is processed using various data mining algorithms.
3. Front-end layer: It provides intuitive and friendly user interface for end-user to interact with data mining system. Data mining result presented in visualization form to the user in the front-end layer.
-
Question 5 of 20
5. Question
Classification is a classic data mining technique based on
Correct
Classification is a classic data mining technique based on machine learning. Basically, classification is used to classify each item in a set of data into one of a predefined set of classes or groups. Classification method makes use of mathematical techniques such as decision trees, linear programming, neural network and statistics.
Incorrect
Classification is a classic data mining technique based on machine learning. Basically, classification is used to classify each item in a set of data into one of a predefined set of classes or groups. Classification method makes use of mathematical techniques such as decision trees, linear programming, neural network and statistics.
-
Question 6 of 20
6. Question
Match the Following
a) Sequential Pattern i) makes meaningful objects that have similar characteristic
b) Prediction ii) discover relationship of a particular item on other items
c) Clustering iii) discover similar patterns in data transaction
d) Association iv) discover relationship between dependent and independent variablesCorrect
Incorrect
-
Question 7 of 20
7. Question
Prediction analysis technique can be used in sale to predict profit for the future if
Correct
The prediction, as its name implied, is one of a data mining techniques that discovers the relationship between independent variables and relationship between dependent and independent variables. For instance, the prediction analysis technique can be used in the sale to predict profit for the future if we consider the sale is an independent variable, profit could be a dependent variable. Then based on the historical sale and profit data, we can draw a fitted regression curve that is used for profit prediction.
Incorrect
The prediction, as its name implied, is one of a data mining techniques that discovers the relationship between independent variables and relationship between dependent and independent variables. For instance, the prediction analysis technique can be used in the sale to predict profit for the future if we consider the sale is an independent variable, profit could be a dependent variable. Then based on the historical sale and profit data, we can draw a fitted regression curve that is used for profit prediction.
-
Question 8 of 20
8. Question
Genetic algorithms is
Correct
Genetic Algorithm (GA) is a search-based optimization technique based on the principles of Genetics and Natural Selection. It is frequently used to find optimal or near-optimal solutions to difficult problems which otherwise would take a lifetime to solve. It is frequently used to solve optimization problems, in research, and in machine learning.
Incorrect
Genetic Algorithm (GA) is a search-based optimization technique based on the principles of Genetics and Natural Selection. It is frequently used to find optimal or near-optimal solutions to difficult problems which otherwise would take a lifetime to solve. It is frequently used to solve optimization problems, in research, and in machine learning.
-
Question 9 of 20
9. Question
Data Mining Enterprise-wide applications generally range in size from
Correct
Enterprise-wide applications generally range in size from 10 gigabytes to over 11 terabytes.
Incorrect
Enterprise-wide applications generally range in size from 10 gigabytes to over 11 terabytes.
-
Question 10 of 20
10. Question
Data mining consists of major elements that
Correct
Data mining consists of five major elements:
1. Extract, transform, and load transaction data onto the data warehouse system.
2. Store and manage the data in a multidimensional database system.
3. Provide data access to business analysts and information technology professionals.
4. Analyze the data by application software.
5. Present the data in a useful format, such as a graph or table.Incorrect
Data mining consists of five major elements:
1. Extract, transform, and load transaction data onto the data warehouse system.
2. Store and manage the data in a multidimensional database system.
3. Provide data access to business analysts and information technology professionals.
4. Analyze the data by application software.
5. Present the data in a useful format, such as a graph or table. -
Question 11 of 20
11. Question
What do you mean by MPP in data mining
Correct
MPP stands for massively parallel processing database. An MPP database is a database that is optimized to be processed in parallel for many operations to be performed by many processing units at a time.
Incorrect
MPP stands for massively parallel processing database. An MPP database is a database that is optimized to be processed in parallel for many operations to be performed by many processing units at a time.
-
Question 12 of 20
12. Question
Which of the following key issues raised by data mining technology
Correct
One of the key issues raised by data mining technology is not a business or technological one, but a social one. It is the issue of individual privacy. Data mining makes it possible to analyze routine business transactions and glean a significant amount of information about individuals buying habits and preferences. Another issue is that of data integrity. Clearly, data analysis can only be as good as the data that is being analyzed. A key implementation challenge is integrating, conflicting or redundant data from different sources.
Incorrect
One of the key issues raised by data mining technology is not a business or technological one, but a social one. It is the issue of individual privacy. Data mining makes it possible to analyze routine business transactions and glean a significant amount of information about individuals buying habits and preferences. Another issue is that of data integrity. Clearly, data analysis can only be as good as the data that is being analyzed. A key implementation challenge is integrating, conflicting or redundant data from different sources.
-
Question 13 of 20
13. Question
Which Tree-shaped structures that represent sets of decisions
Correct
Decision trees represent sets of decisions. These decisions generate rules for the classification of a dataset. Specific decision tree methods include Classification and Regression Trees (CART) and Chi Square Automatic Interaction Detection (CHAID) . CART and CHAID are decision tree techniques used for classification of a dataset. They provide a set of rules that you can apply to a new (unclassified) dataset to predict which records will have a given outcome. CART segments a dataset by creating 2-way splits while CHAID segments using chi square tests to create multi-way splits. CART typically requires less data preparation than CHAID.
Incorrect
Decision trees represent sets of decisions. These decisions generate rules for the classification of a dataset. Specific decision tree methods include Classification and Regression Trees (CART) and Chi Square Automatic Interaction Detection (CHAID) . CART and CHAID are decision tree techniques used for classification of a dataset. They provide a set of rules that you can apply to a new (unclassified) dataset to predict which records will have a given outcome. CART segments a dataset by creating 2-way splits while CHAID segments using chi square tests to create multi-way splits. CART typically requires less data preparation than CHAID.
-
Question 14 of 20
14. Question
It groups data objects based only on information found in the data that describes the objects and their relationships is called
Correct
Cluster analysis groups data objects based only on information found in the data that describes the objects and their relationships. The goal is that the objects within a group be similar to one another and different from the objects in other groups.
Incorrect
Cluster analysis groups data objects based only on information found in the data that describes the objects and their relationships. The goal is that the objects within a group be similar to one another and different from the objects in other groups.
-
Question 15 of 20
15. Question
Which techniques have to be used for the preparing dataset
Correct
Incorrect
-
Question 16 of 20
16. Question
Data mining algorithms to find patterns in the training set which are not present in the general data set. This is called
Correct
The final step of knowledge discovery from data is to verify that the patterns produced by the data mining algorithms occur in the wider data set. Not all patterns found by the data mining algorithms are necessarily valid. It is common for the data mining algorithms to find patterns in the training set which are not present in the general data set. This is called overfitting. To overcome this, the evaluation uses a test set of data on which the data mining algorithm was not trained. The learned patterns are applied to this test set, and the resulting output is compared to the desired output.
Incorrect
The final step of knowledge discovery from data is to verify that the patterns produced by the data mining algorithms occur in the wider data set. Not all patterns found by the data mining algorithms are necessarily valid. It is common for the data mining algorithms to find patterns in the training set which are not present in the general data set. This is called overfitting. To overcome this, the evaluation uses a test set of data on which the data mining algorithm was not trained. The learned patterns are applied to this test set, and the resulting output is compared to the desired output.
-
Question 17 of 20
17. Question
The Knowledge Discovery in Databases (KDD) process is commonly defined with which of follwing stage
Correct
Data mining is the analysis step of the “knowledge discovery in databases” process, or KDD. The Knowledge Discovery in Databases (KDD) process is commonly defined with the stages:
1) Selection
2) Pre-processing
3) Transformation
4) Data Mining
5) Interpretation/Evaluation.Incorrect
Data mining is the analysis step of the “knowledge discovery in databases” process, or KDD. The Knowledge Discovery in Databases (KDD) process is commonly defined with the stages:
1) Selection
2) Pre-processing
3) Transformation
4) Data Mining
5) Interpretation/Evaluation. -
Question 18 of 20
18. Question
The First International Conference on Knowledge Discovery and Data Mining, held in
Correct
The First International Conference on Knowledge Discovery and Data Mining (KDD-95) was held August 20–21, 1995, Montreal, Quebec, Canada. Knowledge Discovery in Databases (KDD) and Data Mining are areas of common interest to researchers in machine learning, machine discovery, statistics, intelligent databases, knowledge acquisition, data visualization, high performance computing, and expert systems.
Incorrect
The First International Conference on Knowledge Discovery and Data Mining (KDD-95) was held August 20–21, 1995, Montreal, Quebec, Canada. Knowledge Discovery in Databases (KDD) and Data Mining are areas of common interest to researchers in machine learning, machine discovery, statistics, intelligent databases, knowledge acquisition, data visualization, high performance computing, and expert systems.
-
Question 19 of 20
19. Question
Which are the key challenges faced by the Distributed data mining algorithm
Correct
Key challenges faced distributed data mining algorithms include:
1. To reduce the amount of communication needed to perform the distributed computation.
2. To effectively consolidate the data mining results obtained from multiple sources.
3. To address data security issues.Incorrect
Key challenges faced distributed data mining algorithms include:
1. To reduce the amount of communication needed to perform the distributed computation.
2. To effectively consolidate the data mining results obtained from multiple sources.
3. To address data security issues. -
Question 20 of 20
20. Question
Techniques from Parallel Computing are often important in
Correct
Techniques from high performance (parallel) computing are often important in addressing the massive size of some data sets.
Incorrect
Techniques from high performance (parallel) computing are often important in addressing the massive size of some data sets.