You aren't signed in     Sign In    Help

Percolation analysis 4 (the network is important)

As defined earlier, N represents the number of contacts of Viewer, who have faved Owner's photo.
50 stands for "50 or more".
Of course this relation cannot continue linearly forever, but in this range, the linear fit is rather good.
The percentage of events in which Viewer faved Owner's photo.
Percolation analysis 4 (the network is important) by GustavoG moved to http://23hq.com/GustavoG.
Now I get to the main question I tried to address in this analysis. Given the basic pattern and the qualifications, what is the probability that the Viewer added the Owner's photo as a favorite? How does this probability depend on whether Viewer's contacts faved that photo?

First, some numbers. I observed in my sample almost 680 million instances of the basic pattern - that is, a Viewer having contacted at least one person who faved the Owner's photo. In almost 20 million cases, the Viewer has added the Owner as a contact. In slightly over 3 million cases, the Viewer has faved the image. Finally, the sample includes 961390 cases of the Viewer having both faved the image and added the Owner as a contact. I mention these numbers just to give an indication that the sample used is large.

(Incidentally, if I don't require even one contact of Viewer to have faved the image, the pattern exists some 2.47 10^10 times in my sample, including 40 million in which Viewer contacted Owner, 4 million in which Viewer faved the photo, and 1.34 million in which Viewer did both.)

Now for the interesting part about how it depends on the number of contacts that faved.

If Viewer has not added Owner as a contact (see blue graph), the probability that Viewer faved Owner's photo depends linearly with N: the more of Viewer's contacts that faved the photo, the more likely it is that Viewer will fave it as well. It is striking that this relationship is so strongly linear, but even more striking that if one extrapolates to N=0... the probability is nearly zero. In other words, if the person who posted the photo isn't in your contact list, and none of your contacts faved the photo, it is very unlikely that you faved it.

What if Viewer did add Owner as a contact? (see red graph) Again, the probability of Viewer having faved Owner's photo grows linearly with N, but it does so twice as fast. What about N=0, in this case? If none of your contacts faved the photo, but Owner is in your contact list, the probability you faved the photo is ~2.3%.

One way to account for all this would be the following scenario.
1) Imagine a Viewer that already has a number of contacts and is used to browsing the contacts' favorites. Now and then, Viewer will find a photo of interest and will fave it. Some photos will not capture Viewer's attention the first time they show up among a contact's favorites, but as they show up again and again in other contacts' favorite galleries... eventually they'll capture Viewer's attention, leading to a fave.
2) Viewer also visits the contacts' photostreams, gets exposed to their new photos, and faves some.
3) When browsing favorites, one sees both their thumbnails, and the photographer's flickr name. If the photographer is among Viewer's contacts, Viewer will recognize the name, and pay more attention to the image... and Viewer might have already seen the image in the photographer's photostream. These effects, combined, make it easier for a photo to capture Viewer's attention while browsing contact favorites, leading to more frequent faves.

Under the previous graph, I wrote about the inability to distinguish between "faving after contacting" and "contacting after faving", both of which happen in flickr. In that case, causality was hard to infer. This new graph shows that there is a strong relation between faving and one's contact network, which is not quite the same as with contacting the Owner. To clarify what I mean by this, consider these two statements:

1) "The more contacts of mine that faved an image, the more likely I am to fave it."
2) "When I fave an image, I tend to add as contacts the people that faved it."

The first statement is very intuitive and will happen if people browse their contacts' favorite galleries - and we know people do this.
The second statement is very counterintuitive. While technically possible, I don't think (based on my experience in flickr) that this is something people do.

When considering these results and scenarios like the one I described above, one has to keep in mind that Viewer is an intelligent human being, with a personality and a personal sense of aesthetics. It may sound very strange to claim that, even having no idea what Viewer's aesthetics and interests are, one can predict (in the sense of giving a probability) whether Viewer will fave a photo, solely based on the behavior of Viewer's contact network. Given enough Owners, photos, Viewers and contacts, though, one can make a statistical statement about the general trend. This is not unlike the axioms of Hari Seldon's Psychohistory: you can't predict the behavior of one person, but you can predict the behavior of many people.

In conclusion, information about photos does appear to percolate through the social network, via favorites.

(See the last slide...) 
This photo has notes. Move your mouse over the photo to see them.

Comments

view profile

Pacdog  Pro User  says:

Is the beer analysis in here also?

*looks*

It seems people seem to be more willing to comment after just one beer!
Posted 14 months ago. ( permalink )

view profile

Pacdog  Pro User  says:

*goes and sits in corner*

=o/
Posted 14 months ago. ( permalink )

view profile

kristina.lerman says:

Did you take into account how many favorites the images received? If it has hundreds of faves, correlation with the number of contacts will arise as a side effect. I encountered that problem in my first Digg paper. The solution was essentially to compute the probability that having received so many faves (or votes in Digg case), a specific number of votes came from contacts purely by chance.

Too bad Flickr does not offer an easy interface to browse Contacts' recent faves. One tool I had seen that approximates that functionality is people using Fd's Flickr Toys to create array of recent favorites. These will then appear in the Contacts list. It feels weird to see your own image in the contacts list.
Posted 14 months ago. ( permalink )

view profile

GustavoG moved to http://23hq.com/GustavoG  Pro User  says:

This graph doesn't take that into account, but I did run a control splitting the sample according to number of favorites. I'll post that graph next. The bottom line: this is not a side effect of that.
Posted 14 months ago. ( permalink )

view profile

thomas23  Pro User  says:

...It's greek to me!!
Posted 14 months ago. ( permalink )

view profile

green_kermit  Pro User  says:

How about the time after posting to comment? I think thatwould be interesting.
Posted 14 months ago. ( permalink )

view profile

GustavoG moved to http://23hq.com/GustavoG  Pro User  says:

How about the time after posting to comment?

I'm not sure what you mean by that. Care to clarify? :)
Posted 14 months ago. ( permalink )

view profile

mattie_shoes  Pro User  says:

Could this be explained by reasonably tight-knit flickr communities who all view each others photos and have many of the others in the community as a contact too? I'm thinking something like special interest groups. Say you're in an active group of 100 members who like macro pictures of toenails or some crap like that. Then most likely those 100 people will be cross-contact'ing and cross-fave'ing like mad. Since we can't tell the order in which the events occurred... The increasing line may simply be an indication of the quality of said toenail macro photo... Or did I miss something?

Most of the faves on my photos are not from my contacts, but my contacts are far more likely to comment or fave one of my photos than, say, joe flickr user...

--
Seen on my Flickr home page. (?)
Posted 14 months ago. ( permalink )

view profile

mattie_shoes  Pro User  says:

I don't recall what exactly is available in the API, but if it's possible to see the list of people who faved a given photo, and it's possible to see who added person X as a contact when it's not necessarily cross-contacted, then you could probably make a reasonable guess as to what photo caused person Y to add person X as a contact by exhaustively searching person Y's photostream to see which photos person X added. If they've faved a whole bunch of them, I don't suppose you'd know (unless date faved is available), but I think I have over 100 people who've added me as a contact when they faved one or two of my photos.

--
Seen on my Flickr home page. (?)
Posted 14 months ago. ( permalink )

view profile

ardiglione says:

A very interesting analysis of social behaviors. I think that your conclusions could be extended to the interpretation of the effects of advertising or political prefeferences.
Really great!
Posted 14 months ago. ( permalink )

view profile

green_kermit  Pro User  says:

Well how is the time between the posting and the comments affected by the relationships and the quality of the photo. Just looking at my own Recent activity list. So photos have an initial flurry of activity and die off (typically these are photos in groups or mall social networks). Some photos like this one I would class a discussions. Several comments by one person - typically updated every day or so and live for a couple of days maybe a week or so. Some photos achieve a critical mass and are commented on by new people - so I see photos I commented on months ago appear in the recent activity list. So I am thinking that the dispersion of comments over time could be quite important.
Posted 14 months ago. ( permalink )

view profile

robert.jurjevic says:

If Viewer has not added Owner as a contact [...] What if Viewer did add Owner as a contact? [...]
May I ask what do you exactly mean by "Viewer has not added Owner" and "Viewer did add Owner"? I am a bit confused as you marked the graph's abscissa as "contacts who faved (N)" and it would appear as those who faved are necessarily the Contacts of the Owner. Thanks.
Posted 14 months ago. ( permalink )

view profile

GustavoG moved to http://23hq.com/GustavoG  Pro User  says:

The definition of the pattern is given in the first slide, the tests (contacted Owner or not, now many contacts) are defined in the second one. :)
Posted 14 months ago. ( permalink )

view profile

robert.jurjevic says:

Basic heterogeneous pattern:

1) A user ("Owner") posted a photo. [red]
2) Another user ("Viewer") added a third user ("Contact") to his/her social network. [green]
3) The third user ("Contact") marked the photo as a favorite. [pink]
Is the "Contact" necessarily the "Owner" or necessarily not the "Owner"?
1) I define a basic heterogeneous pattern (above).
2) I ask whether Viewer contacted Owner. [yes/no]
3) I ask how many people Viewer has contacted, each of which has faved the photo. [N]
4) I ask whether Viewer ended up faving the photo too. [yes/no]
Could you please clarify for me how the above relates to blue and red graphs?

Thanks.
Posted 14 months ago. ( permalink )

view profile

GustavoG moved to http://23hq.com/GustavoG  Pro User  says:

Is the "Contact" necessarily the "Owner" or necessarily not the "Owner"?

By definition, "Contact" is never the same as "Owner", since "Contact" has faved "Owner"'s photo.. One cannot fave one's own photos.

Could you please clarify for me how the above relates to blue and red graphs?

Isn't that spelled out in the description of this graph? I'm not sure what part is unclear.
Posted 14 months ago. ( permalink )

view profile

kediwah  Pro User  says:

Group think/mob mentality?

Either way, I appreciate these analysis graphs and thoughts.
Posted 14 months ago. ( permalink )

view profile

robert.jurjevic says:

Basic heterogeneous pattern:

1) A user ("Owner") posted a photo. [red]
2) Another user ("Viewer") added a third user ("Contact") to his/her social network. [green]
3) The third user ("Contact") marked the photo as a favorite. [pink]
Okay, nevertheless I do not see a reason why you study this pattern when it would appear that the fact that "Contact" may have faved the photo has nothing to do with the fact that "Viewer" and "Contact" are contacts (i.e. "Contact" should be unaware of the fact that the "Viewer" viewed the photo); also it would appear that you can only know (not sure though) that "Contact" and "Viewer" are contacts (you do not know who made contact who nor you know if this happened before or after the photo has been viewed).

1) I define a basic heterogeneous pattern (above).
2) I ask whether Viewer contacted Owner. [yes/no]
3) I ask how many people Viewer has contacted, each of which has faved the photo. [N]
4) I ask whether Viewer ended up faving the photo too. [yes/no]
Okay, 2 makes some sense for me as it may be likely that "Viewer" may have contacted the "Owner" as he or she might like the photo, or he or she might find the "Owner" interesting, etc.; 3 does not make sense for me as I've explained earlier (i.e. "Contacts" should be unaware of the fact that the "Viewer" viewed the photo); 4 makes sense to me as the "Viewer" might have faved the photo if he or she liked it.
Posted 14 months ago. ( permalink )

view profile

GustavoG moved to http://23hq.com/GustavoG  Pro User  says:

So this is not related to what I posted in this graph, but I thought you all would be interested in seeing this TechCrunch report.
Posted 14 months ago. ( permalink )

view profile

OneEighteen  Pro User  says:

I just ran across this thread. Access to these databases would seem to be a gold mine for social studies.

You probably already thought of this, but I didn't see it. It might be interesting to look for indications that some Flickr members more influential than others. Do the photos they fave tend to get faved/viewed by a lot of other people? Like a girl in school who wears a certain kind of shoes and suddenly everyone just has to have that kind of shoe? I suspect this is very true given the recent work about tipping points.

Nodes based around common preferences/related faves might be divided into two basic groups. The larger group would be centered around people whose aesthetic reflects the group aesthetic. To use a crude example, they like photos of cats and fave a lot of "interesting" cat photos that a lot of other people share an appreciation of.

The more interesting subgroup would be the nodes of Flickr members or small groups of members whose favorites reflect an avant garde sense what is "art" and what is not.

To use another example: I don't look at all my contacts faves. I like photos of maritime themes so I look at some of my contacts photos because we share an interest in photos of a nautical nature. I have other contacts whose faves I look at because I like their taste in photos. They have a mixed group of faves that I find to be striking, thought provoking images without regard to the subject. They are images that will influence my own taste in photos and my efforts in photography.

That second group and the influence the have on the entire community would be worthwhile to identify.
Posted 12 months ago. ( permalink )

view profile

GustavoG moved to http://23hq.com/GustavoG  Pro User  says:

I did some yet-unpublished analyses suggesting who the most "central" members are (based on social network), but not of "trendsetting" as you suggest. Could be done now thanks to recent API changes.
Posted 12 months ago. ( permalink )

view profile

OneEighteen  Pro User  says:

I made you a contact so I could follow your work some more. Thanks for posting.
Posted 12 months ago. ( permalink )

view profile

cPutter says:

It makes sense to me that people that are contacts of one another will tend to fav the same photos. The reason they are contacts in the first place is because they find each others photos appealing aesthetically, implying they share a taste in style of photography they like. Thus it shouldn't be strange that they fav the same photos of other people without collaboration beforehand.

How they discover the same photos is obviously in interesting question though. Through explore? Groups? Contacts list? Sharing? Chance? or browsing each others fav lists...



--
Seen on your photo stream. (?)
Posted 12 months ago. ( permalink )

view profile

GustavoG moved to http://23hq.com/GustavoG  Pro User  says:

The reason they are contacts in the first place is because they find each others photos appealing aesthetically

That is just one of several possible reasons. :)
Posted 12 months ago. ( permalink )

Would you like to comment?

Sign up for a free account, or sign in (if you're already a member).

[?]

GustavoG moved to http://23hq.com/Gu stavoG's photostream

FlickrLand analyses, 2008 (Set)

24
items
Part of: Analyses

This photo also belongs to:

Flickr API (Pool)

Flickr Hacks (Pool)

GustavoG's public photos (Pool)

Tags

Click this icon to see all public photos and videos tagged with cgi cgi

Additional Information

All rights reserved Anyone can see this photo

Add to your map