أسئلة في استرجاع واستخلاص المعلومات
#استرجاع_واستخلاص_المعلومات
#Information_Retrieval_and_Extraction
SOON✨
👇👇
#علم_البيانات الدفـعـ(1)ــة
What is the primary goal of an Information Retrieval (IR) system?
Anonymous Quiz
14%
To perform complex data analytics
26%
To manage structured databases
40%
To find unstructured material that satisfies an information need from large collections
20%
To translate natural language
The two main challenges in IR are:
Anonymous Quiz
3%
Indexing and querying
63%
Effectiveness and efficiency
27%
Storage and memory
7%
Speed and cost
A document in IR can be:
Anonymous Quiz
4%
Only a text file
78%
Any unit of information to be retrieved(text, image, video, etc.)
15%
Only a web page
4%
Only a database record
A query is best defined as:
Anonymous Quiz
8%
The output of the search engine
8%
A formal SQL statement
8%
A command to the database
77%
A free-text expression of a users information need
The concept of relevance is measured with respect to:
Anonymous Quiz
26%
The number of keywords matched
16%
The search engines algorithm
53%
The users information need
5%
The popularity of the document
The relationship between NLP and IR is best described as:
Anonymous Quiz
10%
NLP is not useful for IR
15%
They are the same field
5%
IR is a subset of NLP
70%
NLP provides tools that make IR more effective;IR provides a practical application for NLP
IR is different from Database (DB) systems because IR:
Anonymous Quiz
0%
Has a formal data model
6%
Always provides exact answers
6%
Uses SQL for queries
89%
Deals with unstructured data and returns imprecise,ranked results
The Bag-of-Words (BOW) model assumes that:
Anonymous Quiz
12%
All words have equal importance
12%
Word order is critically important for meaning
0%
Documents should be treated as strings
76%
The topic of a document is determined by the words it contains,not their order
The Boolean Retrieval Model is called exact-match because it:
Anonymous Quiz
6%
Is 100%accurate
11%
Always finds all relevant documents
0%
Uses a precise ranking function
83%
Returns documents that exactly satisfy the Boolean logical condition
An example of a Boolean query is:
Anonymous Quiz
0%
cat dog
21%
cat~
5%
black cat
74%
cat AND dog NOT pet
The main component of an IR system that matches queries to documents is the:
Anonymous Quiz
21%
User Interface
11%
Parser
58%
Search Engine
11%
Database
Which of these is NOT a typical application of IR technology?
Anonymous Quiz
5%
Recommender system
5%
Email search
86%
Relational database management system(RDBMS)
5%
Web search engine
The Cranfield Paradigm for evaluation requires:
Anonymous Quiz
0%
A live user study
7%
Only a document collection
0%
Only a set of queries
93%
A document collection,queries, and relevance judgments
A test collection in IR is reusable because:
Anonymous Quiz
7%
It only uses synthetic data
0%
It never changes
7%
It is very small
86%
It allows different systems to be compared on the same ground truth
System-centered evaluation focuses on:
Anonymous Quiz
0%
The aesthetics of the interface
8%
How quickly users learn the system
0%
User satisfaction surveys
92%
Measuring performance against a fixed test collection
User-centered evaluation focuses on:
Anonymous Quiz
8%
The number of documents in the collection
0%
The compression ratio of the index
8%
The speed of the indexing algorithm
85%
Involving real users to complete tasks and measuring their success/satisfaction
The unit document problem in indexing refers to:
Anonymous Quiz
0%
Removing stop words
0%
Tokenizing the text
0%
Choosing a character encoding
100%
Deciding what constitutes a single document to be indexed(e.g., a book, a chapter, a page)
IR systems are important because they:
Anonymous Quiz
7%
Are easier to build than other systems
0%
Can understand the semantic meaning of all text
0%
Are faster than database systems
93%
Help users overcome information overload by finding needed information
The initial step in any IR process is:
Anonymous Quiz
0%
Building an index
6%
Displaying the results
18%
Ranking the results
76%
Understanding the users information need