Skip to content

Latest commit

 

History

History
110 lines (68 loc) · 6.81 KB

File metadata and controls

110 lines (68 loc) · 6.81 KB

ArtoPIH-Painterly-Image-Harmonization

We release the dataset and code used in the following paper:

Painterly Image Harmonization by Learning from Painterly Objects [arXiv]

Li Niu, Junyan Cao, Yan Hong, Liqing Zhang

Accepted by AAAI 2024

For painterly image harmonization, it is hard to provide pixel-wise supervision based on pairs of painterly images and photographic images. Nevertheless, we can provide style-level supervision based on pairs of painterly objects and photographic objects. Our harmonization results can better maintain the intrinsic information of foreground object and have closer style to painterly objects.

Artistic-Object-Dataset-Arto

Our Arto dataset contains 33,294 artistic/painterly objects in artistic images with accurate object masks. Each painterly object is associated with a set of similar photographic objects.

Painterly Objects

First, we use off-the-shelf object detection model to detect the objects in artistic images from WikiArt training set. We use segment anything model and manual refinement to get accurate masks of painterly objects. We obtain 33,294 painterly objects with accurate object masks. We release the artistic images in WikiArt [Baidu Cloud (access code: sc0c), OneDrive] and the painterly object masks [Baidu Cloud (access code: ait8), OneDrive]. Painterly object mask is named <painterly-image-name>_<id>.png, indicating the painterly object in the artistic image.

Similar Photographic Objects

Then, we train an object retrieval model (see the supplementary in [arXiv]) to retrieve similar photographic objects for each painterly object. Specifically, provided with one painterly object, we retrieve 100 nearest photographic objects from COCO 2014 training set, which have similar color and semantics with the given painterly object. However, the retrieved results are very noisy, so we ask annotators to manually remove those dissimilar photographic objects. Each painterly object has an average of 9.83 similar photographic objects.

We release all the photographic objects with object masks in Baidu Cloud (access code: 3ujl) or OneDrive. Note that the photographic objects are cropped from the original photographic images based on the bounding boxes. Photographic object image is named <photographic-image-name>_<id>.jpg and its object mask is named <photographic-image-name>_<id>.png. For each painterly object, we provide a similar object list named <painterly-image-name>_<id>.txt which records its similar photographic objects. Each line in the list records the information of one similar photographic object. We release the similar object lists in Baidu Cloud (access code: h9n4) or OneDrive.

Examples

An example is shown below. The painterly object in the leftmost column is outlined in yellow. The rest of columns are similar photographic objects. dataset_example

In the painterly object name Art_Nouveau_Modern/andrei-ryabushkin_moscow-street-of-xvii-century_0, Art_Nouveau_Modern/andrei-ryabushkin_moscow-street-of-xvii-century is the name of the painterly image where the painterly object belongs to, and 0 is the annotation id. The similar object list is Art_Nouveau_Modern/andrei-ryabushkin_moscow-street-of-xvii-century_0.txt. In this list, each line records the information of a similar photographic object. From left to right are: painterly object name, category id and name, similar photographic object name. In the photographic object name COCO_train2014_000000021830_481814, COCO_train2014_000000021830 is the name of the photographic image where the photographic object belongs to, and 481814 is the annotation id.

We provide more example pairs of painterly objects and photographic objects below.

dataset_example_pairs

Dataset Statistics

The distribution of painterly object categories:

1706083873335

The distribution of the number of similar photographic objects for each painterly object:

1706084867354

Prerequisites

  • Linux
  • Python 3.9
  • PyTorch 1.10
  • NVIDIA GPU + CUDA

Getting Started

Installation

  • Clone this repo:
git clone https://github.com/bcmi/ArtoPIH-Painterly-Image-Harmonization.git
  • Download the datasets and put them in the corresponding foldes in ./datasets.

  • Install PyTorch and dependencies:

    conda create -n ProPIH python=3.9
    conda install pytorch==1.10.1 torchvision==0.11.2 torchaudio==0.10.1 cudatoolkit=11.3 -c pytorch -c conda-forge
  • Install python requirements:

pip install -r requirements.txt
  • Download pre-trained VGG19 from Baidu Cloud (access code: pc9y) or OneDrive. Put it in ./<checkpoints_dir>/pretrained.

ArtoPIH train/test

  • Train ArtoPIH:

Modify the content_dir, style_dir, and info_dir to the corresponding path of each dataset in train.sh.

cd scripts
bash train.sh

The trained model would be saved in ./<checkpoints_dir>/<name>/. If you want to load a model and resume training, add --continue_train and set the --epoch XX in train.sh. It would load the model ./<checkpoints_dir>/<name>/<epoch>_net_G.pth. For example, if the model is saved in ./AA/BB/latest_net_G.pth, the checkpoints_dir should be ../AA/, the name should be BB, and the epoch should be latest.

  • Test ArtoPIH:

Our pre-trained model is available on Baidu Cloud (access code: io7s) or OneDrive. Put it in ./<checkpoints_dir>/pretrained. We provide some test examples in ./examples.

cd scripts
bash test.sh

The output results would be saved in ./output. We provide some results below.

harmonization_results