Update: Added 4th assumption, Saturday, June 27
As promised, 48 hours ago I downloaded several Eclipse products across all five published platforms using Bittorrent. Now that two days have passed, and my downloads have been made available for others to upload, I thought I'd consolidate upload bittorrent upload data for the sake of gauging popularity.
Some notes on my bittorrent process:
- I downloaded the torrent files at about 10:30pm, two nights ago. I collected the data at about 11:30pm tonight. I'm calling it 48 hours. Anyone who cares enough, call the data police.
- I did nothing to cap bandwidth for any of these Eclipse distributions.
I collected data on both megabytes uploaded from my bittorrent client, as well as ratio of upload to download. Since all copies of my products were fully downloaded, an upload ratio of 2.5 means that virtually 2.5 copies of that distribution were uploaded from my machine. Of course bittorrent doesn't ship data in full files, it's just pieces of the distrubutions, here and there.
These are my three primary, and therefore, potentially disputable, assumptions:
- More popular products will be uploaded by more people.
- Upload ratios are a better measurement of popularity than megabytes uploaded. If product X is 50% larger than the size of product Y, equal bandwidth dedicated X and Y do not denote equal popularity. I just think people using bittorrent aren't really worried about the size of their Eclipse.
- Similarly, I consider negligible any difference in compression ratios between the win32 zip file format and other gzipped tar files.
- 48 hours of data collection is more than enough time to collect data, and taking more than the first 48-hours of data will not yield significantly different results.
- I presume that files that are not as well seeded as others will take more time to initially download, and as such, will not contribute much to the other uploads during the first part of this process, and so may exaggerate the results slightly. Given that, I suspect the correctness of the 48-hour window will be the most disputed assumption.
- Update: fourth assumption: People don't care about how long it takes, if they're using bittorrent. I assume it's a "set and forget" type of tool.
In retrospect, this turned out some really cool data. I'm very sorry now that I did not download
all distributions. This analysis suddenly suffers from its absence.
Let's start with downloads by product:
Here, the clear winner is the JEE distribution. Modeling, which is heavily discussed on the modeling blog and had a crazy number of talks at EclipseCon, has just under 25% the popularity of JEE. I haven't used
WTP in a while but I hope, if it's this popular, that the developer docs reflect the popularity. (Please?)
Next we move to downloads by operating system:
Wow, Windows, huh? That's a surprise, and also, not really a surprise. I'd love to see how these numbers compare next year. Will we see an complete inversion of Linux 32 and Linux 64 in a year? Two years? I predict four years.
Let's look at all the download ratio data without grouping by products or operating system.
This chart really highlights both the JEE and Win32 popularity. I'm pleased to see the CDT platform is well used by the Linux32 community. I wonder why, on the Classic platform, OSX Carbon is uploaded slightly more than OSX Cocoa? Are variances on that scale negligible?
I hear the Linux community say "Rob,
come on. We just shut down the spin machine from that ludicrous
browser comparison. How about something that reflects reality?" Reality, whatever. You punks are lucky drawing charts is fun, so I'll do one more for ya. Here's the same data, with OSX and Linux products as single pieces of data.
Seriously, this does have something more interesting to say:
- RCP is more popular among the OSX and Linux communities than Win32.
- Without JEE, Win32 is not nearly as popular a platform.
- CDT and modeling are not particularly popular among the OSX community.
- CDT is loved by the Linux users.
I mentioned earlier that downloads by megabyte are not interesting to me. That doesn't stop me from graphing it. Here's a chart:
and heck, here's another:
Do these last two charts tell you anything different from the download ratio charts? I'll leave that up to someone else to discover. Add it in the comments.
Here's the last three pieces of data I want to share tonight:
Total Uploads: 145
Total MB uploaded: 18,781.1 MB
Upload average (assuming 48 hours): 111.295 kbps
You could say, then, that 145 people got their instance of Eclipse from my bittorrent client.
Thank you, Verizon FIOS. Thank you, Eclipse Foundation!