Fellowship Week 8: Collection and Classification of Social Media

While I’ve been working on finishing the final data visualizations and interface for the August show at Gray Area, I’ve also been trying to figure out how to find relevant media to the neighborhood culture and the urban environment.

Classification and Relevance Score

I’m doing bag of words analysis and algorithmically scoring each Photo and Tweet based on whether relevant or interesting to one of the 3 theme maps: where people are going, what people are doing, and how people are feeling.

The algorithm scores each media based upon the following attributes:

  • Number of likes
  • Number of relevances keywords, e.g. airbnb, drought, bus
  • Number or retweets or comments
  • High Positive or Negative sentiment
  • Occurrence of parts of speech, such as gerund representing activity, i.e. eating
  • Twitter Relevance score
  • Objects sensed in the photo by Open CV
  • Nearness to other relevant media

I’d also like to include color in this analysis. Perhaps, considering color in relation to sentiment? I also want to explore ways to find less prominent voices and media in other languages.

Examples:

Here are some examples of media that score high in my algorithm and also seem interesting:

Pamela_Ocampo_on_Twitter___Mexican_polka_and_a_screeching__ay_yi_yi___blaring_out_of_an_open_window__Yup__I_m_home_____The_Mission__https___t_co_CW5yizJLOF_
Current_Call_and_Jon_Holm_on_Instagram__“Yes__the_rent_in_SF_is_practically_the_most_expensive_in_the_world__but_at_least_my_place_comes_with_its_own_yoga_studio_”
Addie_Hadden_on_Twitter___When_you_come_home_and_see_your_parking_spot_is_taken_http___t_co_LVoFCIuXce_
Kevin_Atkinson_on_Instagram__“Just_another__San_Francisco_has_no_summer__day__NoFilter__SummerInSF__SF__DoloresPark”

Emojis

I wrote some code to capture emoji characters in tweets. These were the most used emoticons in the Mission District last week (order by usage):

😀 💙 😎 😭 😊 👓 😩 😀 🌊 🎉 👌 💕 🌞

🌲 🌴 📷 😉 🏄 🙏

When the sentiment of the post was negative, these were popular:

😭 💩 😩 😾 😔 😐

And here’s emoji’s, when the post was positive:

💙 😍 💜 🎉 🌉 😉 😌

Friction

Of particular interest to this analysis are words and sentiment that cuts across the theme maps and different topics. Here are some example of words that match this criteria:

  • Cheap
  • Clean
  • Comfort
  • Crowded
  • Dirty
  • Exclusive
  • Legal
  • Luxury
  • Natural
  • Night
  • Paradise
  • Perfect
  • Rich
  • Slow
  • Summer
Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s