IMPORTANT QUESTIONS
UNIT 1
Q1) What are the major components of BI ? What do you understand b business Intelligence . Explain its major components
Q2) Major similarities and differences of DSS and BI ?
Q3) What are structured, semi structured and unstructured decisions? Provide two examples of each.
Q4) How can computer provide support to semi structured and unstructured decisions?
Q5) What are some of the drivers and benefits of computerized decision support system?
Q6) Explain Decision modeling process.
Q7) What are the components of DSS?
Q8) Differentiate between DSS and GDSS?
Q9) What are the major components of Business Intelligence?
Q10) Explain different categories or classification of DSS
Q11) List key characteristics or capabilities of DSS
Q12) What is document driven DSS
Q13) How is data driven DSs related to EIS?
Q14) Why is it important to include model in a DSS ?
Q15) Define Groupware?
Q16) List the major groupware tools and divide them into synchronous and asynchronous types.
Q17) Describe online workspace
Q18) Define GSS and list its benefits.
Q19) How GSS improves group work?
Q20) Explain 3 options of deploying GDSS.
UNIT 2
Q1) Explain the following:
a) EIS
b) Expert System
c) OLAP
d) OLTP
e) AI
f) EL Process
g) Snowflake Schema
h) Star Schema
i) Virtual Warehouse
j) Hypothesis driven Exploration
k) Discovery driven
l) ROLAP
m) MOLAP
n) Drill through and drill across
o) Fact table
p) HOLAP
Differentiate between a) & b), c) & d), g) & h), j) & k), l) & m)
Q2) What is data warehouse? Explain various characteristics of data warehouse?
Q3) List the different types of data warehouse architecture
Q4) List the benefits of data warehouses
Q5) Explain various tools for data warehousing?
Q6) What is data cube? What is multidimensional model?
Q7) Why is summary level data required to be kept in Data warehouse?
Q8) What are the various components of data warehouse?
Q9) Explain different types of OLAP servers?
Q10) Explain various types of OLAP operations?
Q11) Differentiate between dependent data mart and independent data mart?
Q12) Differentiate between apex cuboids and base cuboids?
Q13) Explain the architecture for OLAM
Q14) What do you understand by concept hierarchy?
Q15) Suppose that a data warehouse contains three dimensions date, doctor and patient. There is only measure – charge where charge is the fee that a doctor charges to a patient for a visit.
a) Draw a star schema for the above data warehouse
b) Starting with the base cuboid [date, doctor, patient], which sequence of OLAP operations do you need to list the total fee collected by each doctor in the year 2004?
ANS a)
b ) Starting with the base cuboid [day, doctor, patient], what specific OLAP operations should be performed in order to list the total fee collected by each doctor in 2004?
1. roll up from day to month to year
2. slice for year = “2004”
3. roll up on patient from individual patient to all
4. slice for patient = “all”
4. get the list of total fee collected by each doctor in 2004
Q16) Discuss whether or not each of the following activities is a data mining task:
Dividing the customers of a company according to their gender.
ANS: No. This is a simple database query.
Dividing the customers of a company according to their profitability.
ANS: No. This is an accounting calculation, followed by the application of a threshold. However, predicting the profitability of a new customer would be data mining.
Computing the total sales of a company.
ANS: No. Again, this is simple accounting.
Sorting a student database based on student identification numbers.
ANS: No. Again, this is a simple database query.
Predicting the outcomes of tossing a (fair) pair of dice.
ANS: No. Since the die is fair, this is a probability calculation. If the die were not fair, and we needed to estimate the probabilities of each outcome from the data, then this is more like the problems considered by data mining. However, in this specific case, solutions
to this problem were developed by mathematicians a long time ago, and thus, we wouldn’t consider it to be data mining.
Predicting the future stock price of a company using historical records.
ANS: Yes. We would attempt to create a model that can predict the
continuous value of the stock price. This is an example of the area of data mining known as predictive modeling. We could use regression for this modeling, although researchers in many fields have developed a wide variety of techniques for predicting time series.
Q17) Suppose that a data warehouse for Big University consists of the following four dimensions: student, course, semester, and instructor, and two measures count and avg_grade. When at the lowest conceptual level (e.g., for a given student, course, semester, and instructor combination), the avg_grade measure stores the actual course grade of the student. At higher conceptual levels, avg_grade stores the average grade for the given combination.
Draw a snowflake schema diagram for the data warehouse.
Starting with the base cuboid [student, course, semester, instructor], what specific OLAP operations (e.g., roll-up from semester to year) should one perform in order to list the average grade of CS courses for each Big University student.
If each dimension has five levels (including all), such as “student < major < status < university < all”, how many cuboids will this cube contain (including the base and apex cuboids)?
Solution:
(a)
(b)
Starting with the base cuboid [student, course, semester, instructor]
1. roll-up on course from (course_key) to major
2. roll-up on student from (student_key) to university
3. Dice on course, student with department =”CS” and university=”Big University”
4. Drill-down on student from university to student name
(c) The cube will contain 54=625 cuboids.
Q18) What is snow flaking. How doe sit affect the performance of database?
Q19) Define metadata and give reasons why it can be useful in DW?
Q20) What are the advantages of multidimensional database structure over relational data base structure for DW applications?
UNIT 3
Q1) What is data mining? Explain KDD?
Q2) Describe various data mining functionalities: characterization, discrimination, association , classification and clustering
Q3) Differentiate between OLAP and data mining?
Q4) What are the major issues in data mining?
Q5) Explain various data mining techniques and tools?
Q6) What are some major characteristics of Data mining?
Q7) Identify and explain at least 5 applications of data mining?
Q8) Differentiate between classification and clustering
Q9) What are the thee main areas of web mining?
Q10) Differentiate between KDD and data mining?
Q11) Describe the important predictive tools of data mining
Q12) differentiate between descriptive and predictive data mining?
Q13) What kind of data mining can be performed on Spatial databases?
Q14) Explain why data preprocessing is necessary before feeding data into DW?
Q15) What is data cleansing and data integration and why are they important?
Q16) Explain how evolution of database technology led to data mining?
Q17) How is data mining classified? Explain the various database systems on which data mingin can be performed?
Q18) Explain the process of integrating data mining with database.
Integration of a Data Mining System with a Database or Data Warehouse System
DataBase and DataWarehouse systems, possible integration schemes include
No coupling: No coupling means that a DM system will not utilize any function of a DB or DW system
Loose coupling: Loose coupling means that a DM system will use some facilities of a DB or DW system, fetching data from a data repository managed by these systems, performing data mining, and then storing the mining results either in a file or in a designated place in a database or data warehouse.
Semitight coupling: Semitight coupling means that besides linking a DM system to a DB/DW system, efficient implementations of a few essential data mining primitives (identified by the analysis of frequently encountered data mining functions) can be provided in the DB/DW system.
Tight coupling: Tight coupling means that a DM system is smoothly integrated into the DB/DW system.
Q19) Discuss the issues to be considered during data integration.
Q20) What is data preprocessing?
UNIT 4
Q1) What is knowledge management system?
Q2) Explain the terms ‘ Knowledge generation’,’ knowledge storage’, and ‘knowledge utilization ‘ related to knowledge management.
Q3) what are the basic steps in KMSLC and what are the activities ine ach?
Q4) What s meant by dimension of knowledge?
Q5) Differentiate between knowledge base and database?
Q6) How knowledge is discovered from data warehouse?
Q7) Discuss the various methods of knowledge representations?
Q8) Why information technology is must for the implementation of knwoeldge management in the organization?
Q9) How is CRM linked with Knowledge management?
Q10) How knowledge management is beneficial in developing business strategy. Explain?
Q11) What is knowledge capture and what tools are used?
Q12) What are the different approaches to knowledge management?
Q13) Compare and contrast information management and knowledge management?
Q14) Define the term fuzzy logic in the context of knowledge management?
Q15) Discuss the various methods of knowledge representation?
Q16) Differentiate between data , information and knowledge?
Q17) Describe the role of CKO?
Q18)What is meant by culture of sharing knowledge?
Q19) Why knowledge is known as new economy of an organization?
Q20) Explain the following:
a) Knwoware
b) Tacit and explicit knowledge
c) Components of Knowledge management
d) Technologies that support knowledge management
e) Why is it important to manage knowledge?
f) Knowledge edge
g) Importance and limitations of knowledge management
h) List down several trends that highlight the need for business to manage knowledge for competitive advantage.
i) Knowledge codification; Techniques
j) Knowledge map
No comments:
Post a Comment