The Cellar

The Cellar (http://cellar.org/index.php)
-   Technology (http://cellar.org/forumdisplay.php?f=7)
-   -   Interesting "Laws" (http://cellar.org/showthread.php?t=26511)

ZenGum 12-20-2011 09:19 PM

Hey, UT, how hard/easy would it be to analyse the cellar threads in terms of the number of posts? Then analyse that data in terms of the first digit? We could check this law on ourselves.

Lamplighter, remember that we are only focusing on the first digit. Lets take the number of posts in a thread as an example. To keep it simple I'll pretend that threads can't have more than 999 posts, but I'll explain later how to deal with the fact that they can.

Any thread with 1, 11, 12, 13, 14, ... 19, 100, 101, 102 ... etc goes in the "starts with a 1" category.

Threads with 2, 20, 21, 22 ... 29, 200, 201 ... etc will go in the "starts with a 2" category.

We could continue this all the way to 9, and the possibility of any starting digit seems equal.

BUT! In reality, many threads have only a single post, or just a handful. Many struggle into the teens or twenies before they die. Fewer make it into the 30s and 40s, still fewer into the 80s and 90s. This means that there will be more thread totals starting with a 1 than any other digit.

The same pattern happens whe we consider the 100s and 200s and so on.

And if we want to go past 999 posts, the same pattern will apply. 1,000 to 1,999 all start with 1, and so on. It is the same pattern as before.

In a sentence: thread post counts will usually start with lower digits because threads die before they can get to the higher digits.


Well, that is how it is for things like thread post counts. Here, they grow from one upwards without missing a step. You have to go through 1 to get to 2, and you might stop along the way, which is why there are more 1s than 2s. You have to go through the teens before you get to the 20s, and you might stop on the way, so again there are more 1s than 2s.

However, the case with things like river lengths is different, or at least it seems different to me. You can have a 2 mile river without there being a 1 mile river, so there is no risk of "stopping along the way". So the frequency of 1s and 2s in things like this is ... umm ... not explained in the way it is for thread post totals.

In fact, I cannot explain it and have never heard of a good explanation. It just is. And you'd think that changing the units of measurement - yards to feet, for example, should shift the results, since a 1 yard river is a 3 foot river ... but it doesn't, since all those 0.34 yard rivers are now 1.1 foot rivers.

It's freaking weird, now that I come to think of it.

tw 12-20-2011 10:54 PM

Nine is a most frequent digit whenever I buy gas. Nine appears more often than any other number in the price. Today, it was $3.299 per gallon. Nine gallons is a typical fillup. When they ask me how much, I say, "Give me the whole nine yards".

Whenever I buy gas, nine times out of ten, even the weather is nice. Change one letter and another nine appears.

Good weather always leaves me feeling on cloud nine. How can this be? Well, I always avoid one - the loneliest number.

ZenGum 12-20-2011 11:17 PM

:lol: are you :rasta:?

Remember, though, it is the first digit the law applies to.

(That makes it sound like a rude hand gesture, doesn't it?)

tw 12-20-2011 11:26 PM

Quote:

Originally Posted by ZenGum (Post 781828)
(That makes it sound like a rude hand gesture, doesn't it?)

Nine.

classicman 12-21-2011 12:24 AM

Nein!

Clodfobble 12-21-2011 08:36 AM

Quote:

Originally Posted by ZenGum
However, the case with things like river lengths is different, or at least it seems different to me. You can have a 2 mile river without there being a 1 mile river, so there is no risk of "stopping along the way". So the frequency of 1s and 2s in things like this is ... umm ... not explained in the way it is for thread post totals.

In fact, I cannot explain it and have never heard of a good explanation. It just is. And you'd think that changing the units of measurement - yards to feet, for example, should shift the results, since a 1 yard river is a 3 foot river ... but it doesn't, since all those 0.34 yard rivers are now 1.1 foot rivers.

I have two things to put forth on this. First, consider that all units of measurement were created by people for their usefulness. Yards vs. feet is not such a big difference after all. Miles is getting closer to causing problems, but still these are units that people chose for a reason. Subconsciously we prefer things that are measured in less than 10, or in multiples of ten, because they are easier for our brains to keep track of, so the units we derive are going to reflect that preference.

Second, it is less likely to have a 3-mile river without stopping at a 2-mile river. Because there is a statistical probability of all the things that cause rivers to be diverted, blocked, or run out of water. Think of it more like a series of coin tosses. The probability of flipping all heads gets less and less the more flips you require (yes I understand each flip is independent, but considering the probability from the beginning before you start flipping.) The probability that wellspring's water will go 10 feet without a problem? Pretty high. The probability that it can go 1 mile without encountering a boulder or a beaver dam? Less likely, but still pretty good. The probability that it can go 3 miles without such a problem? Even less. The problem is that you can't just have the third mile of a river without having the first and second miles. The nature of measurement means you must always start at 1.

And anyway, here is an example that doesn't fit: adult male heights, measured in feet. You're going to have a huge frequency of 5s and 6s, and almost zero prevalence of 1s. While there is a certain probability on any given day of your life that you might be maimed and lose your legs, the chances are small and the majority of individuals make it to the 5-6 foot range. If you were to consider the final height of every person born, not just those that make it to adulthood, then you'd have to count all those short people who die in childhood and you might very well get the same distribution. But only in countries with a reasonably high infant mortality rate, in the US the distribution would still be radically skewed towards 5s and 6s.

ZenGum 12-21-2011 06:36 PM

First two paragraphs have me thinking hard....

Third one ... that is covered in the bit about this law working best for measurements scattered over several orders of magnitude, using power laws. It doesn't work for measurements around a tight bell curve.

Pete Zicato 12-22-2011 09:57 AM

Cole's Law


Quote:

Thinly sliced cabbage.

infinite monkey 12-22-2011 10:04 AM

lol @ pete.

This thread must be where all the smart people hang out. I'm going down the street. My head hurts. ;)

HungLikeJesus 12-22-2011 11:28 AM

1 Attachment(s)
I did a quick analysis of the page views of the Image of the Day forum for the last year. Here is the distribution of first digits. I think the analysis would have been better if I included a longer period of time.

All of the 8s are 800 to 899; all of the nines except two are 900 to 999. It may be too small of a distribution, because 90% of the values are between 500 and 4,000.

Lamplighter 12-22-2011 12:31 PM

Imma gonna guess...

What if views of IOD are bimodally distributed ( popular vs not-so-popular )

If so, the population of IOD's with less than 1000 views might follow Benford's Law
And, the population of IOD's with more than 1000 views might also follow Benford's Law

So if the graph were drawn with 2 cycles (1-100-1000),
there would be two peaks (bimodal) at the 1's,
each falling off and following the Benford distribution after the 1's.

Otherwise, the IOD's would have to be assumed to be equally popular,
and then the distribution doesn't follow the prediction.

HungLikeJesus 12-22-2011 12:38 PM

Maybe roughly the same number of people look at the IOTD each day. I should probably have picked a different forum to get a wider range. Or maybe most of the views are due to spiders and robots.

ZenGum 12-22-2011 05:32 PM

Note this is page views not number of posts. It is still an interesting result.

Clod, your second paragraph has me persuaded. I think. It feels like when you're wrestling with the anthropocentric principle, that ... wait does this really work? moment. I think it does, as of right now. Thank you.

HungLikeJesus 01-19-2012 02:04 PM

Quote:

Originally Posted by HungLikeJesus (Post 780792)

Hey, I just noticed that they left off consideration of the digit 0 (and the digit þ)

Pete Zicato 01-19-2012 02:20 PM

Here they are HLJ!

Half of all people are below average.

Kaa's Law: In any sufficiently large group of people most are idiots.


All times are GMT -5. The time now is 05:04 PM.

Powered by: vBulletin Version 3.8.1
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.