Introduction to Computer Vision for Image Understanding

Instructor: Dr. Nimol Thuon
Institution: University of Science and Technology of China
Semester: Fall 2024

Course Outline

Level: Foundation / Undergraduate
Duration: 10 weeks
Delivery: Lectures, Labs, Assignments, Capstone Project
Tools: Python, OpenCV, PyTorch or TensorFlow

Course Objectives

Weekly Module Outline

Week 1: Introduction to Computer Vision

📄 Week Page | 💻 Code & Resources

What is Computer Vision?
Image Understanding vs. Image Processing
Applications and Real-World Impact
History and Evolution of the Field

Assignment: Research report on vision applications

Week 2: Image Formation and Representation

📄 Week Page | 💻 Code & Resources

Digital Images and Color Spaces (RGB, HSV, Grayscale)
Coordinate Systems and Image Resolution
Image Acquisition and Sensors
Image File Formats (JPEG, PNG, TIFF)

Lab: Load, display, and convert images using OpenCV

Week 3: Image Processing Fundamentals

📄 Week Page | 💻 Code & Resources

Brightness, Thresholding
Smoothing and Sharpening
Edge Detection: Sobel, Prewitt, Canny
Histogram Equalization and Contrast Adjustment

Lab: Apply basic filters and detect edges

Week 4: Feature Extraction and Matching

📄 Week Page | 💻 Code & Resources

Interest Point Detection: Harris, FAST
Descriptors: SIFT, SURF, ORB
Feature Matching Techniques
Applications: Object Tracking, Panorama Stitching

Lab: Feature matching between two images

Week 5: Geometric Vision and Camera Models

📄 Week Page | 💻 Code & Resources

Pinhole Camera Model
Intrinsic and Extrinsic Parameters
Homographies and Epipolar Geometry
Stereo Vision and Depth Estimation

Assignment: Camera calibration with OpenCV

Week 6: Classical Machine Learning for Vision

📄 Week Page | 💻 Code & Resources

Supervised vs. Unsupervised Learning
KNN, SVM, and Decision Trees
Feature Engineering
Evaluation Metrics

Lab: Image classification using Scikit-learn

Week 7: Deep Learning for Image Understanding

📄 Week Page | 💻 Code & Resources

Neural Networks and CNNs
LeNet, AlexNet, VGG
Transfer Learning
PyTorch or TensorFlow

Lab: Train a CNN on CIFAR-10 or MNIST

Week 8: Object Detection

📄 Week Page | 💻 Code & Resources

R-CNN, Fast R-CNN, YOLO, SSD
Anchor Boxes, IoU, NMS
Pre-trained Models

Lab: Detect objects using YOLOv5

Week 9: Image Segmentation

📄 Week Page | 💻 Code & Resources

Semantic vs. Instance Segmentation
FCN, U-Net, DeepLab, Mask R-CNN
Labeling Tools and Datasets

Lab: Semantic segmentation on sample dataset

Week 10: Applications and Ethics

📄 Week Page | 💻 Code & Resources

OCR and Document Understanding
Facial Recognition and Bias
Medical Imaging and Privacy
Ethical Implications in Vision AI

Assignment: Write a position paper on ethics in computer vision

Week 11–12: Capstone Project

📄 Week Page | 💻 Code & Resources

Define a small project
Implement, evaluate, and present results

Deliverables: Code, Report, Presentation

6178101: Introduction to Computer Vision for Image Understanding

Course Outline

Course Objectives

Weekly Module Outline

Week 1: Introduction to Computer Vision

Week 2: Image Formation and Representation

Week 3: Image Processing Fundamentals

Week 4: Feature Extraction and Matching

Week 5: Geometric Vision and Camera Models

Week 6: Classical Machine Learning for Vision

Week 7: Deep Learning for Image Understanding

Week 8: Object Detection

Week 9: Image Segmentation

Week 10: Applications and Ethics

Week 11–12: Capstone Project

Evaluation Breakdown