CMPUT 692-A1

Topics in Data Management with LLMs

Fall 2025

Meetings: Mon & Wed, 11:00–12:20
Instructor: Davood Rafiei, UCOMM 7-130

Large Language Models (LLMs) are increasingly applied to tasks that were once labor-intensive or difficult to automate. This course explores their emerging role in addressing core challenges in data management. We begin with foundations in databases and LLMs, then study their points of intersection—focusing on models, algorithms, and systems that enable scalable, practical solutions for managing large datasets and high-volume workloads.

Topics to be Covered (Tentative)

DBDatabase Foundations

LLMLarge Language Models

DB & LLMNatural Language Interfaces to Databases

DB & LLMData Integration with LLMs

DBScaling Retrieval

DBExample-Based Queries

Course Prerequisites

Students are expected to have a background in introductory data management and/or information retrieval (e.g., CMPUT 291 or equivalent) or be willing to learn these fundamentals. They should also have some knowledge of probability and statistics, along with demonstrated programming proficiency. Programming experience should include working with data analysis tools and libraries (e.g., JSON, CSV) and familiarity with scripting or coding for tasks such as LLM inference and basic model training.

Grading (Tentative)

Recommended Books and Resources