Instructor: Prof. Jung-Hsien Chiang (蔣榮先)
Course: Tue 2-4; Office Hour: Mon 3-4
Course Goals
To understand and be able to apply information retrieval technology in automated biomedical literature search. Students will participate in intensive computer programming projects and will enhance their skills via research to various search technologies. This course also includes paper presentation and final project as well. Students will be expected to complete all course requirements upon their participation.
Course Outline
- Introduction
- Basic IR Modeling
- Retrieval Evaluation
- Document Processing
- Word Embedding
- Query Operation
- Indexing and Search
- Searching the Webs
- Social text Analytics/Image Retrieval
- Bioinformatics Information Extraction
Textbook
Manning, Raghavan, and Schutze, Introduction to Information Retrieval, Cambridge University Press, 2009
References
Information Retrieval: Implementing and Evaluating Search Engines (MIT Press), 2010
Ricardo Baeza-Yates and Berthier Ribeiro-Neto, Modern Information Retrieval, Addison-Wesley Inc., 1999
Grading Policy
Computer Assignment* 40%
Presentation** 30%
Final Course Project 30%
Related Material
Syllabus
若是決定要修這門課的同學,請留意以下事項
9/24(二)下午6-9點的課程中,需要進行作業展示,內容如下,Related Material有測試用資料集
製作一個搜尋引擎UI介面,包含以下功能:
1. 字數統計:
可計算 keywords 數量、characters (including spaces)、characters (excluding spaces)、words、sentences、non-ASCII characters、non-ASCII words。
2. 上傳XML資料:
UI介面需具備上傳XML資料的功能,使用者能夠直接將XML檔案上傳並進行搜尋。
3. 搜尋欄:
提供一個搜尋欄,使用者可輸入欲搜尋的關鍵字或短語。
4. 搜尋選項:
應具備多種搜尋選項,允許使用者設定搜尋範圍或過濾條件。
5. 查看儲存資料:
使用者可以在介面中查看已儲存的搜尋結果或已上傳的資料。
Lecture Note
Supplementary
Homework