Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: Watch Out For Cowboys UDFA Tim Benford

Logarithmic Decay: How the NFL Draft is Like a Sprite

Not a lemon-lime soda.

Anyone familiar with my articles and comments on Niners Nation is well aware of my beliefs about what good teams accomplish in the NFL draft (or what good fantasy football teams accomplish in a fantasy draft; or what good NCAA March Madness pool players accomplish in filling out their brackets). If you're not familiar, my philosophy basically has 2 components:

  1. Good teams avoid picking busts with their more crucial picks (e.g., 1st-rounders, NCAA Final Four teams)
  2. Good teams find diamonds in the rough with their less crucial picks (e.g., 2nd-rounders, NCAA Cinderellas)

As I presented prior to last year's draft, the Patriots exemplify the first component to a tee, whereas Bill Walsh's 49ers exemplified the second component.

Going into this year's draft, I wanted to put my philosophy to the test. After all, I formed these beliefs based on anecdotal evidence from my own fantasy football/NCAA pool experiences, general observations from my NFL fandom, and pattern recognition in last year's pre-draft posts. In other words, there's not really any science behind it. So, the goal this year was to blind myself with science, see whether I've been full of crap all these years, and prepare to apologize to anyone I've ever argued with vis-à-vis my beliefs.

Although a noble cause, one pretty significant problem arose when I sat down to actually do the testing: There don't seem to be any objective definitions of a "bust" or "diamond in the rough" as these terms relate to the NFL draft. Just a light perusing of SB Nation uncovers many subjective definitions of a bust...like this one from our own smileyman, this one from Sklz711 at Windy City Gridiron, and this one from Rafael Vela (via One.Cool.Customer) at Blogging the Boys. And that's just on SB Nation! Imagine how many there must be throughout the internets.

With respect to definitions of "diamond in the rough," there's even less out there from a subjective standpoint, and obviously nothing out there from an objective standpoint. Apparently, a diamond in the rough to NFL fans is what obscenity was to Justice Potter Stewart, we know it when we see it.

So, before I could test my own beliefs about good teams, busts, and diamonds in the rough, I had to kind of invent the wheel so to speak. Namely, I had to develop my own objective definitions of a bust and diamond in the rough. Today, in Part 1 of this 3-part series, I present that analysis, as well as the definitions that arose from it.

After the jump, find out what the heck that title means...

Star-divide

THE PHANTOM SPRITE

If I were to grossly summarize all of the subjective definitions of "bust" and "diamond in the rough," I'd say the basic point is that a bust performs much worse than expected, whereas a diamond in the rough performs much better than expected. So, with that in mind, the question becomes: What's "expected?" That, after all, is the crux of the definitional dilemma. Well, statistical prediction is all about expectations, so this particular question lends itself well to an answer via stats.

To determine the "expected performance" of a draft pick, I looked at all NFL draft picks from 1994-2005. I chose 1994 because that's when the salary cap kicked in, and I chose 2005 because all of the draftees that year have now had the opportunity to play an average-length career. Using pro-football-reference's draft database, I gathered an amazing amount of data related to each draft pick, the most important/relevant of which are below:

  • Round selected
  • Pick selected
  • Position
  • Career length
  • Career approximate value (Career AV)

Of the 5 stats above, Career AV was my measure of performance. Granted, AV is flawed in several ways that I've discussed previously. But, as a very general measure of player value that can be used to compare players across positions, it's about the best/only thing out there these days. One minor modification I made to Career AV was that I divided it by career length to put everyone on a level playing field. Obviously, players with longer careers have more opportunities to accumulate value, so that has the potential to skew the data. Also, as many of the players in the data set are still playing, their "career length" for the purposes of my analysis was cut short through no fault of their own; and that definitely has the potential to skew the data. Therefore, by controlling for career length, I limited these potential biases as much as possible.

So, based on these considerations, the performance measure I'm looking at is technically called "Average Weighted Season AV." It's basically the average seasonal value for a player.* From now on, and for the sake of readability, I'm going to simply refer to this as "performance" because that's basically what it is.

ATTACK OF THE SPRITE

When you take the performance of each player selected from 1994-2005, and average these performances for each specific pick number, you get the following graph (click to enlarge):

Draft_performance_by_pick_medium 

The white trendline is what's called a logarithmic decay curve, which basically represents a situation in which there's initial rapid decline, followed by subsequent gradual decline that slows down even further with time...

Lesson time! Either by hearing about carbon dating or the difficulties of nuclear waste disposal, I'm guessing most of you are familiar with logarithmic decay's mathematical doppelganger, exponential decay. And I'm sure pretty much any of you who have dabbled in the business/banking world or have sat around on the weekends watching mold are familiar with its mathematical cousin, exponential growth.

However, unless you're a meteorologist, geophysicist, or atmospheric scientist, you've almost certainly never heard of a real-world phenomenon that exhibits logarithmic decay. Indeed, as I occupy none of the above careers, I had never encountered one either; that is, until I consulted Google. Come to find out that one of the few examples of logarithmic decay occurs about 60 miles above the surface of the Earth; in what's called a sprite. Apparently, there's a mysterious, once-thought-of-as-imaginary phenomenon above thunderstorms wherein a burst of red light shoots out of the top of the storm. These bursts are called sprites (See picture at top of post). Researchers have found that light emission during a sprite decays logarithmically over time; hence, the title of this post. OK, back to the show!

...What's amazing to me is that this logarithmic trendline explains over 85% of the performance variation between pick numbers (See R-squared). When you consider that we're talking about 262 different pick numbers - and 2,992 individual draft picks - that's stunning.

So, the moral of the graph is that, as the NFL draft proceeds, there's a steep decline from the expected performance of the 1st pick to that of the 65th; but after that there's only a gradual decline over the next 200 picks.

Looking a little deeper, and without getting into too much detail (aka see here for detail in plain English), the logarithmic function displayed on the graph suggests that about two-thirds of expected performance has already "decayed" by Pick 65. This has 2 important implications for the development of my objective definitions:

  • 1. It's kind of silly to call any player a bust if he was selected after Pick 64. If he's only supposed to be - at most - 32% as good as the #1 pick, can a team/fan really be that upset if he doesn't even turn out to be that good?
  • 2. Within each round, the actual pick numbers matter a heck of a lot for expected performance from Picks 1-32, somewhat less for Picks 33-64, and hardly at all for the rest of the draft. This is shown in the graph by the steepness of the drop at various points along the curve. Therefore, when defining a bust or diamond in the rough, there's really no need to distinguish between picks within each round after Pick 64; whereas it is important to distinguish between Picks 1-64.

For the non-logarithm-savvy among us, this latter point is also shown using plain ol' correlations: as the draft proceeds, the correlation between player performance and pick number within each round gets weaker and weaker (Rd 1 = -.281, Rd 2 = -.150, Rd 3 = -.132, etc.).

REVENGE OF THE SPRITE

So we now know that (a) expected performance in the NFL draft follows a pattern of logarithmic decay from pick to pick, but (b) it's not that important to distinguish between different picks after the 2nd round. If (b) is the case, then it's useful to re-analyze the data by averaging the performance of all picks within a round, and seeing how expected performance decays from round to round. Below is a graph showing the result of this re-analysis (click to enlarge):

 Draft_performance_by_round_medium

Amazingly, the logarithmic decay curve (aka the white trendline) explains the expected performance variation between rounds almost perfectly (99.4%; See R-squared)! Again, the fact that any relatively simple equation explains the NFL draft this well is downright stunning to me.

When comparing the 2 graphs, it's pretty easy to see why the round-by-round trendline fits better than the pick-by-pick trendline. Specifically, the former does a much better job of explaining the variation of expected performance at the top of the draft. If you look at the first graph, you can see that the expected performance of the first handful of picks is relatively underestimated, i.e., the peaks are below the trendline. In contrast, the second graph pretty much nails its top-of-the-draft estimation on the number.

Although it's nice in and of itself to discover this consistent logarithmic decay pattern in the draft, the point here was to define "bust" and "diamond in the rough." So what does the round-by-round analysis mean for that purpose? Well, the equation you see in the graph predicts that about 50% of expected performance has already "decayed" by Round 3. This has 2 implications that correspond to the previous pick-by-pick implications:

  • 1. It's kind of silly to call any player a bust if he was selected after Round 2. If he's only supposed to be - at most - 50% as good as a 1st-rounder, can a team/fan really be that upset if he doesn't even turn out to be that good?
  • 2. It's now questionable as to whether it's necessary to distinguish between picks at the top of the draft when looking at expected performance.

A NEW SPRITE

OK, so I think I've made it pretty apparent that the expected performance of NFL draft picks follows a logarithmic decay pattern. Now it's time to look for any factors that might influence that overall pattern. The main one that comes to mind is position. Basically, career expectations are different for different positions, so it stands to reason that the trajectory of expected performance over the course of the draft for players of one position might be different than for players of a different position. At least that's the intuitive way of looking at it.

But does the data support the intuition? Below is a chart showing the average expected performance for each position in the draft except for Ks and Ps (click to enlarge):

 Draft_performance_by_position_medium

Even just from an eyeball test, it doesn't appear like there's much of a difference between the top 5 positions, nor between the next 5 positions; but there's definitely a drop-off after WR. Well, if we compare these averages statistically by using a series of t-tests, it turns out that there's a statistically significant difference between the expected performance of a FB draft pick and that of a DE, T, LB, C, RB, G, DT, or DB draft pick. In addition, there's also a statistically significant difference between the expected performance of a TE draft pick and that of a DE, T, or LB draft pick.

In other words, among the 12 positions shown in the chart, only the last 2 are at a statistically significant disadvantage when taken in the NFL draft; but they're not even at a disadvantage with all of the remaining 10. Indeed, out of 66 possible comparisons, only the 11 I listed above were statistically significant. Considering the robustness of the pick-by-pick and round-by-round findings, this lack of evidence seems to suggest that rounds and picks are far more important than positions when defining a bust or diamond in the rough.

There clearly are more factors to be considered besides positions, but I have limited space and time. Feel free to suggest more factors in the comments section. Also, just because I ruled out positions this time, it doesn't mean that's the end of all debate on the matter. More time and research might prove otherwise. 

THE SPRITE STRIKES BACK

The fruits of my labor have led to 2 relatively simple equations for predicting a draft pick's performance; one according to the specific pick at which the player was selected and the other according to the round he was selected. Reliably knowing what to expect from a given draft pick resolves the vast majority of subjectivity in the quest for an objective definition of "bust" and "diamond in the rough." All that's left is the part about how much above or below expectations a player would have to perform in order to be considered a bust or diamond in the rough, respectively.

The pattern that's emerged in my analyses is that there's somewhat of a cut-off point at the end of the 2nd round. Whether we're talking about specific picks or overall rounds, somewhere between one-half and two-thirds of expected performance has "decayed" by that point in the draft, depending on which equation you use. So, the first part of my definition for a "bust" is that he can only be a 1st- or 2nd-round pick; and the first part of my definition for "diamond in the rough" is that he cannot be a 1st- or 2nd- round pick.

Because of the remaining questions regarding whether specific pick numbers matter in the 1st or 2nd round, the second part of my definition for a "bust" relies on the pick-specific logarithmic decay equation, whereas my definition for a "diamond in the rough" relies on the round-specific equation.

Adding the two parts of each definition together we get the following:

  • An NFL draft bust is a player who was selected in the 1st or 2nd round and played 67% or more below the expected performance of his specific pick number.
  • An NFL draft diamond in the rough is a player who was selected after the 2nd round and played 200% or more above the expected performance of his specific round.

There you have it, objective definitions for both a "bust" and a "diamond in the rough."

RETURN OF THE SPRITE

One last thing I'll bring up about how the NFL draft is a real-world example of logarithmic decay. Here's an article I stumbled across by McDonald Mirabile that was published in The Sports Journal. Mirabile did an analysis showing that the amount of money an NFL team spends on their draft picks each season can be almost perfectly predicted by knowing (a) how many picks they had, and (b) the exact pick number of each of those picks. What struck me about this article was that it has a graph showing how rookie salaries change with each subsequent draft pick number. Here's the graph (reproduced from original article; click to enlarge):

Rookie_salaries_medium 

Look familiar? Looks like logarithmic decay to me. So even with salaries, the NFL draft is like a sprite. It seems like logarithmic decay might come to be known as Danny's First Law of the NFL Draft? OK, so ignoring that bit of narcissism, it still is quite interesting to me that both expected performance and expected salary seem to follow the same pattern. I'll leave it to the comments for you to figure out why that is really interesting vis-à-vis overpaying/underpaying draft picks.

BOTTOM LINE

Based on what I've presented in Part 1 of this series, here's what you should remember for Part 2:

  1. Draft pick performance seems to follow a reliable logarithmic decay pattern across picks and rounds.
  2. An NFL draft bust is a player who was selected in the 1st or 2nd round and played 67% or more below the expected performance of his specific pick number.
  3. An NFL draft diamond in the rough is a player who was selected after the 2nd round and played 200% or more above the expected performance of his specific round.

Tomorrow, Part 2, in which I'll identify the busts and finally test my theory about how good teams avoid them like the plague.

* On pro-football-reference, Career AV is not just a simple sum of a player's Season AVs. Rather, it's a weighted sum such that a player's best season is weighted @ 100%, his 2nd-best season is weighted @ 95%, his 3rd-best season is weighted @ 90%, and so on. That's why I say my performance measure is actually "Averaged Weighted Season AV."

Comment 48 comments  |  3 recs  | 

Do you like this story?

Comments

Display:

Perfect example of writers with too much time on their hands.

Time to get on the clock. “With the 13th selection the 49ers select ______.”

by Jaxson876 on Apr 21, 2010 4:54 PM PDT via mobile reply actions  

You don't have to read it

and you don’t have to make a deragatory comment about it if you don’t understand it.

Member of the legendary David Carr thread, 6 March 2010

by smileyman on Apr 21, 2010 6:00 PM PDT up reply actions  

agreed

cognitive angst

This was some good work. How hard is it to look up correlation on wiki, or anything on wiki for that matter?

I don’t think that at all… and of course this is all speculation on your behalf
by Drew K on Apr 14, 2010 2:05 PM PDT

by goatfather on Apr 21, 2010 6:08 PM PDT up reply actions  

We learned this in 7th grade though?

I’m sure Everyone here is over 12 years old.

What we've got here is a failure to communicate.

by SportsChicken on Apr 21, 2010 6:38 PM PDT up reply actions  

lol

so true, but only chronological age, not intellectual.

I don’t think that at all… and of course this is all speculation on your behalf
by Drew K on Apr 14, 2010 2:05 PM PDT

by goatfather on Apr 21, 2010 6:44 PM PDT up reply actions  

-1

What we've got here is a failure to communicate.

by SportsChicken on Apr 21, 2010 6:25 PM PDT up reply actions  

just think...

if i wasn’t wasting all that time writing this post, i would have had more time to waste poking fun at people who make comments like this. good thing i didn’t have that time. this reply would have been way more combative.

by (Florida) Danny Tuccitto on Apr 21, 2010 6:59 PM PDT up reply actions  

This is quite poetic my friend. Dare I say

“Mathimatico-poetic” (Stanley Salthe)

I quite enjoyed this. You know what is actually interesting and makes some sense – on the similar function between salary and draft selections and the original analysis based on expected performance. These are both energetic-exchange relations. Their might be some thermodynamics at issue here. Amazing that the physio-chemical domain can share similar observational properties with phenomenon that occur at much longer time scales. Physiochemical domain → Biochemical→ Neurophysiological

I also noticed that you have embarked upon new avenues of inquiry and methodology. I admire the swiftness with which you perform and analyze this stuff.

Are you working on a BS, or a Ph. D.?

I don’t think that at all… and of course this is all speculation on your behalf
by Drew K on Apr 14, 2010 2:05 PM PDT

by goatfather on Apr 21, 2010 5:05 PM PDT reply actions  

I'm so glad I kinda understand statistics, and you make it so simple applying it to football!

Excellent job Florida Danny

That’s nice to see the relationship with “performance” related to each draft pick from round 1 to 7, I don’t think I’ve seen that quantified before.

The “expected” performance is based on player values. So the 3rd figure, with performance by position shows the average of all players at that position. Plus the t-test checks the average value against the other position average value. But I wonder what the variability in those position player values are. For example, those top 10 positions are not different, but maybe some of them have few excellent players and many below average, or many players that are all average, etc.

Did you separate out the Positions based on different picks/rounds? (and still see that the positions don’t really matter on expected values…)

I guess draft picks on average are paid what they deserve based on your graph and the salary graph! Nice.

It appears that the pick number and round number show slightly different interpretations of the transition point of bust/not bust, or atleast how much the value drops off after 2nd-3rd round. Might do all of the analysis separately based on these two patterns.

Good read, can’t wait to see what you find

by fortyniners on Apr 21, 2010 5:08 PM PDT reply actions  

I like your passion

and enjoyed this post, good work! Very interesting.

by Steve Young on Apr 21, 2010 6:01 PM PDT reply actions  

Danny does the chart still look the same if you compare within the first round?

I know that the money chart does, was just wondering if the bust chart does.

Member of the legendary David Carr thread, 6 March 2010

by smileyman on Apr 21, 2010 6:01 PM PDT reply actions  

Nice.

I am not a fan of statistics, I am more of an algebra guy, but this was a very interesting and well written article. I really like your definitions.

by jbrown63 on Apr 21, 2010 6:43 PM PDT reply actions  

Hey! I resent that!
However, unless you’re a meteorologist, geophysicist, or atmospheric scientist, you’ve almost certainly never heard of a real-world phenomenon that exhibits logarithmic decay

I’m a Religious Studies major and was well aware of of the concept of ‘nuclear half-life.’

The lesson, as always: marry a physicist.

Jason Hill is turning the corner!

by grantmp on Apr 21, 2010 6:56 PM PDT reply actions  

Strange.

As a physics major, that just seems like an odd marriage to me…

No philisophical disagreements between the two of you?

I was "Deific16"
The cake is a lie.

by Sultan of Seitan on Apr 22, 2010 2:23 AM PDT up reply actions  

Thought

I was the only Religious Studies Major… we are a rare breed. Our department here at VCU will likely be shut down, so my degree is even more rare.

by Steve Young on Apr 22, 2010 11:31 AM PDT up reply actions  

Draft Update

Right now I am standing inline in front of the radio city music hall… I would guess there are about 1000 people here. The NFL is giving out 1500 tickets, so I hope I get one.

Only 2 hours left, more updates to come.

by twolfe2 on Apr 21, 2010 7:04 PM PDT via mobile reply actions  

update

I guess they are starting early as people already have wrist bands. I will let you know if I get one.

by twolfe2 on Apr 21, 2010 7:24 PM PDT via mobile up reply actions  

Sooooo jealous

Member of the legendary David Carr thread, 6 March 2010

by smileyman on Apr 21, 2010 7:56 PM PDT up reply actions  

got em!!

I am now sporting a draft wrist band!!

How much do you think I can get for it on ebay?

by twolfe2 on Apr 21, 2010 8:32 PM PDT via mobile up reply actions  

lol

I don’t think that at all… and of course this is all speculation on your behalf
by Drew K on Apr 14, 2010 2:05 PM PDT

by goatfather on Apr 22, 2010 8:31 AM PDT up reply actions  

Will you be able

to take pics inside? If so you should post a bunch of crazed Niner fan reactions after the pics!

I don’t think that at all… and of course this is all speculation on your behalf
by Drew K on Apr 14, 2010 2:05 PM PDT

by goatfather on Apr 22, 2010 8:31 AM PDT up reply actions  

I love your posts even though I either read them after practice or early in the morning so most of it goes above my head. But when I do get them they are an excellent read.

by manraj7 on Apr 21, 2010 8:24 PM PDT via mobile reply actions  

Brilliant

The fact the rookie pay scale chart matches the performance chart leads me to believe teams have known this for a long time,

by bignerd on Apr 21, 2010 8:25 PM PDT reply actions  

Yeah I tend to agree

I think that the big problem with the rookie pay scale is just in the top 10 picks, because that’s where the huge investment comes in. If Bradford goes number 1 to the Rams he’ll likely get somewhere close to $50 million guaranteed, without taking a single snap.

That kind of expenditure can cripple a team, especially one that obviously needs to fill other roster spots.

Member of the legendary David Carr thread, 6 March 2010

by smileyman on Apr 21, 2010 8:32 PM PDT up reply actions  

Seems like teams overpay the first few picks

Am I reading your graph right by seeing that the actual performance is a bit below the trend line for the top 5 or so picks? But that the pay scale does the opposite? In other words, teams overpay the top few picks. The number one makes about 5 times as much as the number 32, but by the graph is expected to be about twice as good.

If I was a mediocre team with a lot of holes to fill, which I probably would be if I had the number 1-5 picks, I would trade out of the top few picks every time. Because if the value on the trade chart matches the salary graph, you can end up with a lot more for your money.

by ljl on Apr 21, 2010 8:45 PM PDT reply actions  

I believe that is why the owners are pushing for changes to the rookie wage scale. A few years ago picks in the Top 5 became an issue, now I would say the problem has extended to the Top 10 picks.

by bignerd on Apr 21, 2010 8:53 PM PDT up reply actions  

Teams try to trade out all the time

but the rest of the league doesn’t want to be stuck with the huge salaries that a #1 overall pick gets. This is why I think a top 10 rookie pay scale will be really important in new negotiations

Member of the legendary David Carr thread, 6 March 2010

by smileyman on Apr 21, 2010 8:53 PM PDT up reply actions  

I think

one thing a rookie pay scale would encourage is that a team take the best player available, and not the best position suited to the salary attached to the draft pick.

by Andrew Davidson on Apr 21, 2010 8:56 PM PDT up reply actions  

That's another good point

Member of the legendary David Carr thread, 6 March 2010

by smileyman on Apr 21, 2010 8:57 PM PDT up reply actions  

Even more sense for the 49ers

to keep # 13 and 17 rather than trading them up to get inside the top 10

by fortyniners on Apr 21, 2010 10:39 PM PDT up reply actions  

Danny I think you're going to want to read this study

Cade Massey from Yale’s Management School and Richard Thaler from the Chicago School of Business wrote a paper on the value attached to first round picks. They spefically went after the idea that the first overall pick is worth twice as much as the 7th overall pick and four times as much as the 20th overall pick (according to the accepted draft chart). It makes for some interesting reading. The original was done in 2006. They’ve since come out with an updated one

Massey-Thayer 2006

Massey-Thaler 2010

Advanced NFL Stats took a brief look at the study and had a nice article about it in April

Re-Thinking Massey-Thaler

Member of the legendary David Carr thread, 6 March 2010

by smileyman on Apr 21, 2010 9:04 PM PDT reply actions  

A better fit and some criticism

Interestingly, Massey-Thaler use a Weibull distribution type function to fit in into the expected performance by pick number data. It should be better for the Danny’s data too – partially because in this case it is a three parameter function: AV = a exp(b(PICK-1)^c).

My larger criticism is the definitions of “bust” and “diamond in a rough”. First, why limit the definitions by the rounds? Why the limits can’t be generalized to all rounds? Second, how the percentage limits 67% below and 200% above for “bust” and “diamond in a rough” are obtained? Are these determined by some (which quantiles?) quantiles of a distribution that AV/E{AV} falls into?

by VH on Apr 23, 2010 5:46 AM PDT up reply actions  

Danny,

What’s the y-axis in relation to? 9.00 to 0.00? I understand that it is to show a relation between draft picks, but what caused that scale? sorry if i missed this during the reading.

by shulkdog on Apr 21, 2010 9:47 PM PDT reply actions  

Not to sound like a [site decorum]

…but isn’t the theory that teams that do well in the draft (defined as picking well in both the first and later rounds) will see success fairly self-evident?

by tedler on Apr 21, 2010 10:13 PM PDT reply actions  

That's not what this article is about at all.

Member of the legendary David Carr thread, 6 March 2010

by smileyman on Apr 21, 2010 10:32 PM PDT up reply actions  

This was directed at the philosophy he brings up at the beginning, rather than the article as a whole

by tedler on Apr 21, 2010 10:41 PM PDT up reply actions  

Gotcha

but you have to define what drafting well means and that’s what he was saying.

Avoid the busts and find diamonds in the rough

Member of the legendary David Carr thread, 6 March 2010

by smileyman on Apr 21, 2010 10:58 PM PDT up reply actions  

The problem is that the philosophy is fairly self-evident, but the terms of the philosophy are completely undefined – which means there is no good way to evaluate success (there are a lot of bad ways, certainly). What Danny is trying to do is to define those terms so that we can actually begin to evaluate success trends within the original conceit.

"I just struck out looking three times, but in any other ballpark those would have been home runs." - Aubrey Huff

by howtheyscored on Apr 22, 2010 11:37 AM PDT up reply actions  

Danny, this is awesome. Totally appreciate it, have been waiting for something like this.

I am extremely intrigued and really want to see some more,

However, can you elaborate on this please?

…What’s amazing to me is that this logarithmic trendline explains over 85% of the performance variation between pick numbers (See R-squared). When you consider that we’re talking about 262 different pick numbers – and 2,992 individual draft picks – that’s stunning.

I am having a hard time understanding where this R^2 is coming from and how it is being displayed on the graph.

If I am understanding the graph corectly (expected performance by pick number);
The Red Portion = The range is the average AV of all players picked at each pick # from 1994-2005.

So, Y=((94pick1AV+95p1AV+96p1AV+97p1AV+98p1AV+99p1AV+00p1AV+01p1AV+02p1AV+03p1AV+04p1AV+05p1AV)/11) = Average AV for pick 1?
And so on for each individual pick? (Awesome job by the way, if so, that is a lot of plugging #’s.)

And the white line is a fit of ln(x) so that it reflects data set given by the Average AV’s by pick #?

What is (R^2=.858) in this graph? I am guessing that I am not understanding something with the math…

I was "Deific16"
The cake is a lie.

by Sultan of Seitan on Apr 22, 2010 2:55 AM PDT reply actions  

Ah, never mind with the r^2 question.

A quick google search told me to quit taking calc classes and take stats if I want to know about r^2.

I was "Deific16"
The cake is a lie.

by Sultan of Seitan on Apr 22, 2010 3:13 AM PDT up reply actions  

Caution: More Math Below

An interesting exercise is to take the inverse of your logarithmic decay function. This inversion produces an equation that relates a player’s draft position to a function that decays exponentially as his performance increases (remembering that a low draft position is better). Now this equation may seem weird at first glance, but it represents a very important function: a cumulative distrbution function (CDF). Basically if you want to know how many players will have a career AV of x or beter, a CDF is designed to give you this information. So, if you put in the average performance of a 1st overall pick, the CDF will tell you that one player per draft is expected to perform at or above this level. If you put in the average performance of the 10th overall pick, the CDF will tell you 10 players are expected to perform at this level or better, etc. Now, the CDF for player performances is useful to know because taking the derivative of the CDF gives you a function describing the distribution of talent within the draft class. For this dataset, the distribution function we obtain is an exponential decay. Somewhat as expected, the function says that there are a lot of players who are not so talented and few players who are very talented.

Now, while an expoential distribution of talent in the draft class is not so unexpected, one would instead expect something more like a power-law distribution which tends to pop up in situations like this (this type of distribution is useful for these situation because it displays a property called scale-invariance, meaning that if you zoom in or out on the function, the function basically maintains the same shape). Now, assuming a power-law distribution of talent, we can derive the form of an equation to fit the performance vs draft position graph: y = ax^-b (where a and b are the free parameters). It would be intersting to see how fitting your dataset to this power-law function fares compared to the logarithmic function.

by Yggdrasil on Apr 22, 2010 9:37 PM PDT reply actions  

Thanks for the props Danny.

Smileyman pointed me this way. What a great post, loved every second of it. For those of you who are interested in further applications of this data, I have two very recent posts on Blogging The Boys (here and here) also dealing with Career AV.

And congrats on Davis and Iupati. Would have loved to get our hands on any of the two.

by One.Cool.Customer on Apr 23, 2010 5:21 AM PDT reply actions  

Comments For This Post Are Closed


User Tools

Media Requests please email ninersnation@gmail.com

FanPosts

Community blog posts and discussion.

Recommended FanPosts

Small
Site Decorum: Remember, We Are ALL 49er Fans
Steve_young_small
Game Day Food

Recent FanPosts

Small
Concussions...
Small
Is Harbaugh lying or does he mean what he says?
872_small
Where have you seen 49er players?
Download2_small
Can the 49'ers Maintain their Turnover Differential in 2012?
Sfak_small
Why are you a 49er fan?
6a00e5500c77218833011168f234b4970c_small
FOX: "How To Save The Sport"
Small
Old Spice Patrick Willis Football ProCamp
Dave_small
Call For Moderators

+ New FanPost All FanPosts >


Head Ball Coach

Dave_small David Fucillo

Howtheyscoredcat_small howtheyscored

313483_2054510893373_1562580382_31984672_1965025_n_small James Brady

Coordinator

Pirates_small smileyman

Bowman_avi_sm_small Tre9er

Assistant Coach

Pixies_logo_small (Florida) Danny Tuccitto

Memento-lies_small urnext

Me_on_beach_small WesHanson

Dylan_cannes_small Dylan DeSimone

Officiating Crew

Jackalope_card_small wjackalope

These3words_small these3words

Joe_and_bill_small twolfe2

428030_10150598134996875_112852666874_9167376_1157036734_n_small mikeinsp