PROJECTS IN THE EDUCATIONAL TECHNOLOGY DEPARTMENT

Improving Question Answering in Domain-Specific Information Retrieval through Instruction Tuning

Professor: Syaamantak Das

UID: ET01

Description	Instruction tuning in a retrieval-augmented generation framework is a crucial step in domain adaptation for conversational AI systems. It helps these systems become more knowledgeable and context-aware in specific domains, ensuring they provide accurate and relevant responses to user queries, especially in the context of information retrieval and question answering. By providing specialized instructions and adapting the model to the target domain, instruction tuning helps improve the model's performance and relevance in the chosen domain. In the context of question answering in the domain of information retrieval, instruction tuning in retrieval augmented generation framework in domain adaptation of pre-trained LLMs works as follows: A pre-trained LLM, such as GPT-3 or LaMDA, is augmented with a retrieval module to retrieve relevant passages from a large text corpus. This is done by passing the query to the retrieval module, which then returns a set of ranked passages. The LLM is then fine-tuned on the retrieved passages, along with a set of instructions that specify the desired output format and style of the answer. This is done using a technique called instruction tuning, which involves training the LLM to minimize the loss between its predictions and the desired output. Once the LLM is fine-tuned, it can be used to answer questions by retrieving relevant passages from the corpus and generating an answer based on the retrieved passages and the instructions. This approach has a number of advantages over traditional question-answering methods. First, it can leverage the knowledge and capabilities of a pre-trained LLM, which has been trained on a massive dataset of text and code. Second, it can be used to answer questions in a variety of different formats, such as summaries, explanations, and creative text formats. Third, it can be easily adapted to new domains by fine-tuning the LLM on a domain-specific dataset of retrieved passages and instructions. Your task is to develop "a better and more efficient QA algorithm" for a web-based chatbot-cum-copilot using the above concepts.
Number of students	2
Year of study	Students in their 2nd year (Semester 3), Students in their 3rd year (Semester 5), Students in their 4th/5th year (Semester 7/9)
CPI	None
Prerequisites	You must have taken course(s) like - CS 337/335 and/or CS 344 and/or CS 635
Duration	From Joining date till June 2024
Learning outcome	You will be able understand and write programs on the following topics: a) Retrieval-augmented generation framework b) Pre-trained large language models c) Instruction tuning
Weekly time commitment	4 hours (mandatory)
General expectations	(Compulsory): Ability to write and understand code (Should have ideally participated in any extra-curricular programming activities in platforms like Project Euler, HackerRank, Code-Chef etc.). (Compulsory): Tenacity and interest for working in foundational AI related hard problems (which have a relatively lower chance of getting successful in short time). You may have to continuously work hard for a full semester, really hard. (Optional): To contribute to an academic research paper targeted at conferences like SIGIR, AIED, CIKM, etc. Note: This may be a multi-university project with TU Delft in Netherlands and UNSW in Australia (currently we are in the process). Please do not apply if you are not serious enough to contribute. Woman Candidates are equally encouraged to apply as per existing rules and regulations.
Assignment	1) https://arxiv.org/pdf/2309.14805.pdf 2) https://arxiv.org/pdf/2305.11541.pdf
Instructions for assignment	Students need to prepare Retrieval-Augmented Generation (RAG), Instruction Tuning, Pre-trained LLMs etc. concepts for the interview.

Contact us

Harsh Shah
Institute Secretary Academic Affairs and Head, EnPoWER
Email: isaa.enpower.iitb@gmail.com
Phone number: 98201 83012

WURP 2023

Dive deep in the realm of Academic and Industry related research projects

Created with ❤️ by UGAC Web Team, 2023-2024