voxel-photo-album

I got lost in the challenge of finding good classes and forgot my main goal. My goals was to quickly find the raw images I wanted to work with for production. This is not, given photos of dogs, which breed is it. This is, here is a general photo that could be of anything, do I care about it.

I only need minimal classes for that:

I could simply do - People vs Not people
Or I could do people, animal, plant, landscape, buildings

People means there is a person in the photo. Even if the people are small or singular in the picture it should be considered a people picture.

For the second scheme, this will be more dependent on my deciding what is the intended subject of the photo. For example, a plant with an insect centered on it would be insect.

THere is an open question about how to handle photos that I am just not interested in, like a picture of a box or a picture from a doorway but just is not interesting for development. There are no people in the picture and there is not one of the classes I am interested in. I think this is basically teaching the model my concept of uninteresting.

Doing it with the 16 classes lead to overlapping categories and unclean labeling. This in turn led to poor model performance but for "understandble" reasons. This was not a problem with the models but a problem with data prep

Markus gave the suggestion - which is a really good one - to run this in two stages. First, train a model for people not people. Then for the not-people use that data to train a multi-class model

@@TODO I need to rename the datasets, and their references in code, to make more sense The way to rename a dataset is

import fiftyone as fo
dataset = fo.load_dataset("foo")
dataset.name = "footwo"

voxel-photo-album

Here are the general steps we want to accomplish

photos -> dataset
dataset to datasetview - some for fitting, some for test, and some for validation
dataset -> embeddings 3. https://docs.voxel51.com/model_zoo/index.html
show how different transformers embed
Talk about how to sample that space and what happens if you exclude and area
Look for out of focus images (maybe also low contrast ones)
Now do the predictions with each embedding and see how they do
Now fine train at least one if not two and compare them both for ease and for accuracy

Order to run things

First run make_dataset.py to make the dataset in FiftyOne
Now run make_embeddings.py to create all the embeddings

The classes I am going for

"boy" "girl" "man" "woman" "people" "dog" "cat" "bird" "insect" "monkey" "crustacean", "fish" "animal" "plant" "flower" "landscape" "architecture" "not an animal, plant, landscape, person, or building"

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.gitignore		.gitignore
10_final_predictions.py		10_final_predictions.py
1_make_dataset.py		1_make_dataset.py
2_data_splitting.py		2_data_splitting.py
3_make_embeddings.py		3_make_embeddings.py
4_generate_labels.py		4_generate_labels.py
5_clean_ground_truth.py		5_clean_ground_truth.py
6_fine_tuning.py		6_fine_tuning.py
7_new_predictions.py		7_new_predictions.py
8_cleaning_ground_truth_round2.py		8_cleaning_ground_truth_round2.py
9_final_fine_tuning.py		9_final_fine_tuning.py
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

voxel-photo-album

Order to run things

The classes I am going for

About

Releases

Packages

Languages

License

thesteve0/voxel-photo-album

Folders and files

Latest commit

History

Repository files navigation

voxel-photo-album

Order to run things

The classes I am going for

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages