How can we get computers to understand and generate human language? This is among the most challenging—and currently, the most quickly advancing—approaches in contemporary artificial intelligence. Natural language systems are deployed in the world in increasingly many forms: chatbots, code assistants, web agents, among others. This course provides an introduction to the engineering and science that underlies current NLP systems.
Prerequisites
Highly recommended prerequisites: Not required, but it will be very useful to have taken a machine learning course before taking this one. Check out these resources to help get you acquainted with the basics:
Learning objectives
Students will:
We will be loosely following this book throughout the course: Jurafsky & Manning (J&M), available online for free. You may find it helpful to do these readings to prepare questions for lecture or to review the content in more depth after lecture. Content in the J&M book may appear on the exam; however, you will not be expected to know the topics in the textbook that are not covered in lecture.
I will also provide links to optional readings and resources related to the content we cover in class. These can be found in the schedule below. These are to supplement the course material for those interested in reading further.
Note: we will almost definitely alter this schedule! Order may also change depending on the availability of guests.
| Date | Topic | Homework | Readings |
|---|---|---|---|
| Jan 20, 2026 | Course introduction
|
Prereq review released (ungraded) Prereq review solutions |
No readings. |
| Jan 22, 2026 | Text classification
|
HW0 released [Code and data] |
|
| Jan 27, 2026 |
Introduction to language modeling
Tokenization
|
|
|
| Jan 29, 2026 | Sequence modeling
|
|
|
| Feb 2, 2026 | Homework 0 help session: Text classification | ||
| Feb 3, 2026 | Neural sequence modeling I
|
HW0 due HW1 released [Code and data] |
|
| Feb 5, 2026 | Neural sequence modeling II
This lecture is pre-recorded. You can watch it [here]. [Slides] |
|
|
| Feb 10, 2026 | Attention
|
|
|
| Feb 12, 2026 | Large language models I
|
HW2 released |
|
| Feb 17, 2026 | Homework 1 help session: Language modeling Note: This is a Monday schedule! |
||
| Feb 19, 2026 | Large language models II
|
HW1 due |
|
| Feb 24, 2026 | Language model use and adaptation
|
|
|
| Feb 26, 2026 |
Machine translation
|
|
|
| Mar 2, 2026 | Homework 2 help session: Transformers | ||
| Mar 3, 2026 | Post-training
|
Final project proposal released |
|
| Mar 5, 2026 | Morphology and syntax I
|
HW2 due HW3 released |
|
| Mar 10, 2026 |
Spring break - no class |
||
| Mar 12, 2026 |
Spring break - no class |
||
| Mar 16, 2026 | Homework 3 help session: Syntax and parsing | ||
| Mar 17, 2026 |
Morphology and syntax II
|
|
|
| Mar 19, 2026 |
Morphology and syntax III
|
HW3 due Friday, Mar. 20 |
|
| Mar 24, 2026 | Exam | ||
| Mar 26, 2026 |
Semantics
|
|
|
| Mar 31, 2026 |
Review of exam
Discourse and pragmatics
|
Final project proposal due |
|
| Apr 2, 2026 |
NLP applications and benchmarking
|
|
|
| Apr 7, 2026 |
Interpretability and evaluation I
|
|
|
| Apr 9, 2026 |
Interpretability and evaluation II
|
|
|
| Apr 14, 2026 | Retrieval and tool use |
|
|
| Apr 16, 2026 | LLM bias, safety, and fairness | Midway report due |
|
| Apr 21, 2026 | Multimodal NLP
|
|
|
| Apr 23, 2026 | Human language processing
|
|
|
| Apr 28, 2026 | Guest lecture | ||
| Apr 30, 2026 | Final project help session | ||
| TBD | Final report due |
By the end of this course, you should be familiar with each of the following topics. Items with an asterisk* may be on the exam.
The course is graded out of 100 total points.
The homeworks are largely for your benefit as study tools. You may use AI in any way you wish to complete the homeworks, but you will find studying for the exam much easier if you understand the methods you'll be implementing in the homeworks. Regardless of whether you decide to use AI tools, you (the student) are fully responsible for what you submit.
There will be one exam about 2/3 of the way through the course. See the list of topics above for a guide to what the exam will cover. You may not use any electronic resources for the exam; this includes the textbook, AI tools, the internet, text messages, among other items.
This will be an open-note exam! If you bring notes, they must be on one physical piece of paper (front and back).
This is an open-ended project where you will review and pursue an NLP topic of your choosing.
Perform a literature review on your topic of choice. Also briefly describe your planned project, including the task you'll be focusing on, your data, methods, baselines, and evaluation.
Report your progress on the final project thus far. By this point, you should have run some experiments and obtained some preliminary results. Outline your plan for the rest of the project.
Describe your experiments, present your results, and report your findings in the style of a typical NLP paper.
Can we publish our final project? It is feasible to convert a course project into an academic publication, but it can take a lot of work! I encourage those interested to discuss this with me at the end of the semester.
AI tools are completely allowed for the homeworks. I recommend doing the assignments on your own as exam preparation, but for the purpose of grading, you can complete the assignments completely with AI if you so choose. It is the student's responsibility to verify any submitted content.
AI tools are allowed for the final project. Our policy here is more nuanced: you may use AI as a tool, but do not use AI as a crutch or replacement for thinking. What's the difference? AI as a tool includes:No AI tools are allowed during exams. These will be hand-written in class.
I strongly encourage you to use any outside source at your disposal when doing the homework and your final project. Your reports and code should be original, but you may take inspiration from existing papers as long as you give them proper credit. When doing your project, feel free to base your implementations on publicly available code as well (as long as you make significant modifications to accommodate your original idea), but be sure to give proper credit in your report and your GitHub README if you do so.
For the final project, failing to properly cite an outside source is equivalent to taking credit for ideas that are not your own, which is plagiarism.
Read through BU's Academic Conduct Code. All students are expected to abide by these guidelines. In the context of this class, it's particularly important that you cite the source of your ideas, facts, and/or methods, and do not claim someone else's work as your own.
You are allowed to work in groups to do the homeworks, but you should upload your homework individually.
Attendance will not be taken. Attend lectures as you wish.
You have 6 free late days that you can use however you wish with no excuse necessary. Using a late day means that you can still receive full credit for the assignment with no late penalty. Turning in an assignment late after using all your late days will incur a 10% drop in the score for each late day. 5 days after an assignment's due date, the assignment can no longer be turned in, regardless of whether you use your free late days. This applies to all homeworks. It also applies to the proposal and midway report for the final project—but not the final report, which must be turned in on time. Note that a late day is a step function: turning in a homework 5 minutes late is equivalent to turning it in 23 hours late, so if you know you'll be late, we recommend taking the extra time to verify your understanding of the material.
Extensions can be negotiated in cases of medical emergency or other sudden pressing circumstances. Students should contact the course staff ASAP and negotiate
Exams cannot easily be made up. If you know you cannot make an exam day,
Boston University's policy is to provide reasonable accommodations to students with qualifying disabilities who are enrolled in Boston University courses. Students seeking accommodations must engage in an interactive process with, and provide appropriate documentation of their disability to, Disability & Access Services (DAS). If this applies, please get in touch with me as soon as possible to discuss accommodations; note that students are not required to disclose information regarding their disability, if applicable, but should request approval for such accommodations through DAS beforehand.
Students are permitted to be absent from class, including classes involving examinations, labs, excursions, and other special events, for purposes of religious observance. In-class, take-home and lab assignments, and other work shall be made up in consultation with the student's instructors. More details on BU's religious observance policy are available here.
Much of the content in this course was inspired by NLP courses taught by Greg Durrett, John Hewitt, and Jason Eisner. Please do check out their syllabi (available online) if you're interested in getting a new perspective on many of these topics!