Artificial Intelligence Information Retrieval(P76I900)

Instructor: Prof. Jung-Hsien Chiang (蔣榮先)

Course: Tue A-C; Office Hour: Mon 3-4

Course Goals

To understand and be able to apply information retrieval technology in automated biomedical literature search. Students will participate in intensive computer programming projects and will enhance their skills via research to various search technologies. This course also includes paper presentation and final project as well. Students will be expected to complete all course requirements upon their participation.

Course Outline
  • Introduction
  • Basic IR Modeling
  • Retrieval Evaluation
  • Document Processing
  • Word Embedding
  • Query Operation
  • Indexing and Search
  • Searching the Webs
  • Social text Analytics/Image Retrieval
  • BERT/ChatGPT Challenge
Textbook

Manning, Raghavan, and Schutze, Introduction to Information Retrieval, Cambridge University Press, 2009

References

Information Retrieval: Implementing and Evaluating Search Engines (MIT Press), 2010

Ricardo Baeza-Yates and Berthier Ribeiro-Neto, Modern Information Retrieval, Addison-Wesley Inc., 1999

Grading Policy

Computer Assignment*                    40%

Presentation**                                   30%

Final Course Project                         30%

Related Material

資料集1

資料集2

資料集3

Syllabus

若是決定要修這門課的同學,請留意以下事項
9/24(二)下午6-9點的課程中,需要進行作業展示,內容如下,Related Material有測試用資料集

製作一個搜尋引擎UI介面,包含以下功能:

1. 字數統計:
可計算 keywords 數量、characters (including spaces)、characters (excluding spaces)、words、sentences、non-ASCII characters、non-ASCII words。

2. 上傳XML資料:
UI介面需具備上傳XML資料的功能,使用者能夠直接將XML檔案上傳並進行搜尋。

3. 搜尋欄:
提供一個搜尋欄,使用者可輸入欲搜尋的關鍵字或短語。

4. 搜尋選項:
應具備多種搜尋選項,允許使用者設定搜尋範圍或過濾條件。

5. 查看儲存資料:
使用者可以在介面中查看已儲存的搜尋結果或已上傳的資料。

Lecture Note

lecture-1-introduction

lecture-2-applications

lecture-3-dictionary

lecture-4-tfidf

Supplementary

Precision-recall

Homework

homework-1-24

Homework-2-24