CSCE 6260 – Advanced Topics in Pattern Recognition and Image Processing
Spring 2026
Basic information:
-
Instructor: Heng Fan (heng.fan@unt.edu)
-
Office: Discovery Park F284
-
Office hours: Wednesday 12:30 - 2:30 pm or by appointment
-
Lecture time: Wednesday 2:30 - 5:20 pm
-
Classroom: NTDP F285
Course description
This is a research-oriented course that aims to provide latest frontiers in computer vision, pat-
tern recognition, multimodal learning, large models, and artificial intelligence (AI). It will describe
advanced approaches in AI, with a focus on recent topics such as prompt learning, multimodal
vision-language learning, multimodal large language models, visual generation, video understand-
ing, etc. Through this course, the students are expected to understand and digest various advanced
AI topics by extensive in-class paper presentation and discussion.
Textbooks
This course does not follow any textbooks closely. However, the following textbooks will be useful for this course:
-
Deep Learning, by Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016. online version
-
Computer Vision: Algorithms and Applications (the second edition), by Rick Szeliski, 2022. online version
-
Dive into Deep Learning, by Aston Zhang, Zack C. Lipton, Mu Li, and Alex J. Smola, 2019. online version
(A lot of examples are provided to practice deep learning.)
-
Neural Networks and Deep Learning, by Michael Nielsen, 2019. online version
-
Introduction to Deep Learning, by Eugene Charniak, 2019. link
In addition to the textbooks, you're highly encouraged to read more related papers.
Schedule (update may be applied)
|
Date
|
Topic
|
|
Week 1 (1/14)
|
Review of Basic Tasks in Computer Vision
|
Grading policy
Grading will be based on the following components:
-
Paper presentation: 40%
-
Paper review: 40%
-
In-class discussion: 20%