Skip to content

Latest commit

 

History

History
198 lines (147 loc) · 10.3 KB

README.md

File metadata and controls

198 lines (147 loc) · 10.3 KB

iPhone App Icons

Repo for playing around with App Store app icons. Current scripts in repo:

Output producing scripts

Feature generation scripts (output in features_output dir)

  • create_bovw_features.py: use GFTT and RootSIFT to generate keypoint features for each icon, cluster the keypoints into a visual vocabularly, and apply vocab to the icons to create bag of visual word features
  • create_color_features.py: use color histograms to generate color features for each icon
  • create_imagenet_features.py: use pretrained resnet50 model from keras with imagenet weights to generate features for each icon

Dominant color examples

The image at the top of the readme shows apps sorted by their dominant color. A good guess of how to find an image's dominant color might be to take the average of all the pixels' colors in the image. It turns out that this isn't very representative of an image consistenting of more than one hue. To find an image's dominant color we can perform k means to cluster the pixel colors and then take the centroid value of the largest cluster. This technique can be extened to create a color palette with n colors from an image; to do this we just take n of our k centroids.

Note: The right value of k will hinge on the images you're working with, for app icons a setting k = 3 seems to perform well.

Additional note on color: Images are typically stored in the RGB colorspace, but the HSV colorspace relates more to how we perceive color. Because of this difference, all the dominant color calculations in this repo have been done in the HSV color space.


k = 1 (average color)

k = 3


k = 1 (average color)

k = 2

Who cares?

One possible application of this dominant color extraction is for use in plots. A user who is familiar with a popular app will recognize its color in a plot and this could help make the user experience simpler. An example plot with fake data shows this concept below.

The plot avoids the issue of apps having similiar/identical dominant colors. A strategy of using secondary colors and/or adjusting colors to be disimilar could likely a strategy to deal with these issues when they come up.

Note the plot data is just a random walk, it doesn't actually relate to any app metric (on purpose)


Search output highlights

The main query image to be used in search engine examples will be the 'Toca Life: City' icon. There were a lot of apps by Toca Boca AB in the top charts when the icons were scraped, and they all have similar artwork. An ideal search engine would return all Toca Boca apps first when we search for icons similar to 'Toca Life: City'.

Using only keypoint features

The search using only bag of visual word (BOVW) features performs fairly well, we found 2 of the other Toca Boca apps in the top 5 results. The introduction of color features would likely weed out the 'Yu-Gi-Oh! Duel Links' result.

Notice that we also return the query image itself with a dist of 0; this is an uninteresting result, but it reassures us that our distance metric is doing what we mean it to.

Using only color features

The search using only color features performs arguably better than the BOVW only search. Again we found 2 of the other Toca Boca apps, but this time they are the first 2 icons proposed. It is of note that 'Toca Life: Hospital' stayed in our top results; this icon is so similar to our query icon that it is returned no matter which feature set we used. Despite keeping 'Toca Life: Hospital', we lost our other Toca Boca result when we dropped the BOVW features.

Using keypoint and color features

The search using both color and BOVW features performs as well as we could have hoped. All of our top 5 results are apps from the Toca Boca family.

Candy Crush is another good group of apps to test our engine out on. Searching with Candy Crush Jelly Saga shows another great result.

Note: other icon queries produce interesting results using this engine, but these examples were shown since they're rare cases where they have arguably 'right' results due to them being in such popular 'families' of apps at the time the icons were scraped.

    

Using deeplearned features (late addition)

An additional set of features was introduced to evaluate in this search setting. Features were generated using the keras resnet50 model that comes with pretrained weights produced with imagenet. These features did not perform as well as BOVW + color features on the Toca Boca test. However, these features performed exceptionally well in some cases as seen below the Toca Boca result.

There were attempts to use these deeplearned features with the BOVW + color features, but due to the high number of resnet features produced (2048) they dominate the search. Dimension reduction strategies (TSNE & PCA) were looked into to fix this issue, but the reduction in dimensions resulted in a huge loss in perceived performance.

    


Cluster output highlights

Note these results are from various levels of k when applying k means to the icons. The clustering was just for exploration as opposed to optimizing some level of k that yielded the best results.

Using only keypoint features

Below are 2 results from clustering using only the keypoint features (while ignoring color features). You can see the focus on similar shapes/patterns that appear in the icons. On the left we can see the Tidal & Dropbox logo designs focus on repeating diamond patterns. On the left we can see similiarities in the sharp corners, a human-ish body, and the 'stars' in the top of the icons.

    

Using only color features

Below are 2 results from clustering using only the color features (while ignoring keypoint features). Interestingly, the icons in the cluster on the left show a very similar art style (for the most part) in addition to their similar colors that caused them to be grouped together.

    

Using keypoint and color features

Below are 2 results from clustering using both BOVW and color features. The result on the right is especially interesting since our features were able to match two apps showing the word 'SLOTS' without the use of OCR. The result on the left seems to be driven mostly by color features.