Pytorch Image Colorizer

Project Overview

This project demonstrates how to:

Build and train a U-Net-based CNN for end-to-end image colorization.
Prepare grayscale input and RGB output pairs from STL-10 images.
Apply early stopping, loss tracking, and model checkpointing.
Run inference on new grayscale images using the trained model.

Model Architecture

The U-Net model is a fully convolutional encoder–decoder network often used for image-to-image tasks. It consists of:

Downsampling blocks with convolutional layers and max-pooling.
Upsampling blocks with transposed convolutions and skip connections.
A final output layer producing 3-channel RGB predictions.

Code Structure

pytorch-unet
├── classes/                 # U-Net model and dataset definitions
│   ├── model_unet96.py
│   └── colorization_dataset.py
├── model/                   # Saved model and training history
│   ├── colorizer_model_unet96_best.pth
│   └── colorizer_training_history_unet96.pkl
└── scripts/                # Training, inference, and plot loss scripts
    ├── train_colorize_model_unet96.py
    ├── colorize_unet96.py
    └── plot_loss.py

How to Use

Train the Model

cd scripts
python train_colorize_model_unet96.py

This script will train the U-Net model and save:

The best-performing model: colorizer_model_unet96_best.pth
The training history: colorizer_training_history_unet96.pkl

Run Inference Example

python scripts/colorize_unet96.py \
    --input example-image/tiger_grey.jpg \
    --output example-image/tiger_color.jpg \
    --model model/colorizer_model_unet96_best.pth

Visualizing Loss over Training Epochs

You can visualize training and validation loss curves using the provided utilities.

python3 scripts/plot_loss.py --history model/colorizer_training_history_unet96.pkl