Skip to content

An application that predicts post popularity from the features of a given Instagram post.

Notifications You must be signed in to change notification settings

geoffreyangus/likeness

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

likeness

An application that predicts post popularity from the features of a given Instagram post.

Notes:

12/1

  • Command used to scrape Instagram images
instagram-scraper -f ig_users.txt --retain-username --media-types image --maximum 50 --media-metadata --include-location

11/15

  • Now the dataset is cached (so it doesn't have to read in a bunch of different JSON files every time).
  • There is now a consistent train/test data split.
  • The algorithm now runs scikit linear regression model.

11/14

10/26

  • Downloaded OpenCV!
  • Working on the infrastructure to analyze images in bulk.
  • Possible cause of skew: new pictures (< 1 week old). New pictures may not be fully saturated. Their like counts could still be in flux.
  • Baseline algorithm is implemented as follows:
    • Input: a user's profile and a photo.
    • A mean is calculated across the like counts of the 50 most recent photos.
    • A coefficient is calculated based on the features of the photo. For the baseline, this coefficient is either 1.1 (if face is detected) or 0.9 (if face is not detected).
    • Output: coefficient multiplied by the mean.
  • Baseline algorithm has 46.60% average error rate. This means that on average, the algorithm overestimated or underestimated the like count for a given image of a user by 46.60%. The human error rate was ~30%, so there is a large amount of avoidable bias to overcome.
  • Possible ideas:
    • Belly button detection
    • Proportion of photo that is face
    • If user's face is in photo

10/25

  • Scraped the most recent ~50 photos of the top 40 instagram celebrities using instagram-scraper. Ignored @taylorswift because she only had 25 photos. Sad.
  • We say we have approximately 50 photos because we are excluding videos.

About

An application that predicts post popularity from the features of a given Instagram post.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages