Using Hollinger's Draft Rater to Make a Point About False Positives and Negatives and the Klay vs. Kawhi Question

Which one of these guys has a better chance of being a star?

That title was a mouthful, but hopefully I can make this post a bit more digestable. I should also note before getting into this, that ostensibly, this post is about Hollinger's draft rater and some statistical mumbo jumbo, but really, that's all a prelude to the following conclusion: The Warriors likely made a mistake by passing on Kawhi Leonard in favor of Klay Thompson. I should also note that you can easily find all the statistical mumbo jumbo on Type I and II errors on Wikipedia.

The point of a statistical test is to identify *real* trends, groups, effects, etc. For example, in medicine one wants to know whether a drug has a real effect. The results of a medical diagnostic test are supposed to tell us whether a patient really has *something* or really doesn't have *something*, with *something* being the thing that is being tested for obviously. If a patient really has something, an ideal test would tell us she definitely has that something we're testing for. Conversely, if she really doesn't have the something, then hopefully, our test will tell us she really doesn't have something.

But in the real world, diagnostic tests are not perfect. Sometimes, the patient really does have something, but the test says she doesn't. That is known as a "false negative" or a "Type II error". Of course, this is bad, because the patient might really have cancer, for example, but according to the test, she is perfectly fine. Conversely, the patient might be perfectly healthy, but the test result suggests she really has something. This is known as a "false positive" or a "Type I error". If you were a patient, even though this would be a scary initial result, more careful follow-up diagnostic would hopefully contradict the false positive.

We need a couple more pieces of jargon before continuing. Doctors want to avoid false negatives, because as I said above, if you have cancer and the test says you don't, that's a scary situation. Better to not have cancer but think you do (initially, anyway), than the other way. Or as Clarence famously said in True Romance, "It's better to have a gun and not need it, than to need a gun and not have it." (Is the gun the cancer or the test?) So, we'd probably rather our test give us a false positive than a false negative, and we'd definitely want it to give as many true positives as possible. There is a numerical way to define this, and it is called "sensitivity":

text{sensitivity} = frac{text{true~positives}}{text{true~positives+false~negatives}}

A test that is 100% sensitive finds all true positives and has zero false negatives. In other words, if a test is 100% sensitive and you really have cancer, the test will say you have it. If a test is 50% sensitive, then it will only give you the correct diagnosis half the time.

What about detecting true negatives? If you don't have cancer, a good test will say you are healthy, and not put you through needless further emotional turmoil (and further medical costs). That is called the "specificity" of the test:

text{specificity} = frac{text{true~negatives}}{text{true~negatives+false~positives}}

Note how that definition is very similar to the previous one, but simply reverses the positives and negatives. If you really don't have cancer, a test that is 100% specific will tell you that you really don't have it, while a test that is only 50% specific will incorrectly diagnose you half the time.

Let's bring it back to basketball now. I'm going to make the argument that Hollinger's Draft Rating system is very similar to a medical diagnostic, or at least, should be treated as such. In this case the *something* we are looking for in a player is *NBA talent*. In that context, I would argue that Hollinger's rating system appears to be quite sensitive, but not extremely specific.

Given the definitions just presented, what that means is that his "test" usually identifies the players who are going to be stars, but also mis-identifies many players as stars, who turn out to be non-stars (or even non-starters or even non-NBA players in the case of Joe Alexander, for example). Putting this another, perhaps, more useful way, if a player is *really* going to be a star, Hollinger's test is almost always going to show that, and if he is definitely not going to be a star, Hollinger's test will almost always show that, too. There are certain caveats, like one and done players, but even there, it appears to be useful most of the time.

So, what does that have to do with Kawhi Leonard and Klay Thompson you ask? Here's a list that Hollinger presented of the wings with the highest ratings since 2002:

  1. Kevin Durant 17.67
  2. Dwyane Wade 17.05
  3. Carmelo Anthony 16.63
  4. Danny Granger 15.43
  5. Rudy Gay 15.10
  6. Luol Deng 14.46
  7. Josh Childress 13.37
  8. Kawhi Leonard 13.21
  9. Mike Dunleavy 12.95
  10. Dajuan Wagner 12.72

That's a pretty good list to be on, don't you think? Now, here's the ratings for perimeter players in the 2011 draft class:

  1. Kyrie Irving 15.14
  2. Kawhi Leonard 13.21
  3. Kemba Walker 12.75
  4. Tyler Honeycutt 12.56
  5. Jordan Hamilton 11.90
  6. Alec Burks 11.87
  7. Klay Thompson 10.88
  8. Norris Cole 10.85
  9. Jimmer Fredette 10.45
  10. Chris Singleton 10.15
  11. Brandon Knight 10.02
  12. Darius Morris 9.57
  13. Brad Wanamaker 9.57
  14. Reggie Jackson 9.45
  15. Damian Saunders 9.20

Klay Thompson comes in at 10.88. So, the question is how many players with a rating this low become stars (or even good starters)? If there are plenty of those, than my claim that the test is very sensitive would be wrong.

Going back to 3 previous editions of his draft rater, here are the best guards/wings I could find with ratings lower than 11.0 (oh, and btw, Hollinger specifically states his system actually works best for wings):

That's it. For reference, Reggie Williams had a 12.22 rating. Stephen Curry had a 14.18 rating. 

Now, you might say, well, Kawhi Leonard may turn out to be Josh Childress or Dajuan Wagner. True. But using this rating system, I would argue his ceiling is closer to Mike Dunleavy (easily better than anyone on the other list) or Luol Deng, whereas the best that we can hope for Klay Thompson is someone who is better than DeMar DeRozan or Jodie Meeks.

Maybe that is good enough for you. But I would have preferred to take a player who has a chance to be very good rather than a player who — at his best — is likely to be a fringe starter or a starter that nobody is really all that excited about having on their team.

I should end by saying, I truly hope that Klay Thompson will be the player that forces Hollinger to significantly overhaul his rating system and figure out how he missed so badly. But *that* would be a significant surprise, and should not be anywhere close to the expectation, based on the objective statistical analysis.

Post comment as twitter logo facebook logo
Sort: Newest | Oldest

I'm pretty sure that the first 14 teams (and even Indiana, really) saw that Kawhi was a guy who didn't have SF skills and was far too small to be a PF (he's Klay Thompson's size). He'll likely be a good energy guy off the bench, but you don't out him out there against starters for an extended amount of time because he won't be able to go around them or shoot over them and because they'll be able to shoot over him or dribble around him. Remember, Kawhi is a guy that averaged 15.5/10.5/44%fg in the Mountain West division...not exactly a great league (3 teams in the NCAA tourney). He's a guy that got overblown as an NBA prospect for some reason, likely because his team had a great record. Its good to give credit to guys that have team success, but that does not mean they will make it in the NBA. It happens every year with guys that can play on the college level and not the NBA level. The guy ranked above and below Leonard on that list prove this (and they were considered better NBA prospects than Leonard).

Thanks for the response, EvanZ, and by the way I really enjoy your stuff. I have no idea whether Hollinger uses out-of-sample validation, but I am always suspicious of proprietary models that don't give me any reason to believe they are well-constructed. From my perspective Hollinger uses a lot of hand waving. And when I have revisited his models after the fact I have always been struck by what a poor guide to the draft class they would have been. But you may be right that Hollinger has done a pretty good job of ruling out non-stars in the past. Obviously, I would be thrilled if Thompson turned out to be another Russell Westbrook or Eric Gordon, but if those are the only two he's missed that's still pretty impressive. Of course, he used different models to make those predictions. I suppose this latest iteration of the model is based in some way on previous versions, but we have no way of really knowing. So I have my doubts. My doubts are probably fed by my general frustration with PER. PER does a good job telling us LeBron James and Dwight Howard are pretty good. But six-year old niece can tell us that. PER doesn't necessarily do a good job of identifying good role players or telling us how much players contribute to team offense and team defense, which is what would make it useful. Of course, it doesn't really matter that his draft rater is built on PER, since I don't need PER to tell me that the players on the top list are better than the players on his bottom list. Not that I have any reason to believe Hollinger is wrong in this particular case, outside of the fact that Jerry West knows a lot about basketball. So, fingers crossed. Still, I wonder why Leonard dropped. Teams didn't seem to value him as highly as we were told they should. What do they know that we don't? Which raises the question, is Hollinger's record actually better than simple draft position for predicting player success? Maybe the best we can do is say Thompson has about as much chance succeeding as an 11th pick does.

I have no idea if Kawhi Leonard or Klay Thompson will be any good. But I'm not entirely sure that Hollinger does either. That's because I'm not sure that his elaborate multivariable analysis is at all statistically significant. Hollinger never gives us reason to think it is. As he does every year, he rebuilt his model from the ground because his predictions weren't very good last year. In other words, hiss model is entirely retrodictive. He has managed, in hindsight, to predict the past. So he can't point to any confirming successes his model has had of accurate prediction. Maybe this time he has actually hit on a formula he will be able to use to separate stars from scrubs. Or maybe he has just hit on a way sorting past stars from scrubs into separate categories, which we have little reason to think will continue to work in the future. In other words, his analysis may be the equivalent of saying that the Raiders have never won a Thursday night game in a dome in October, when they have only played ever played one or two such games. That's my guess. Because Hollinger's track record is pretty poor. Didn't he have Michael Beasley as one of the best prospects ever just a couple of years ago?

Robert, thanks for the comment. A couple of points. "As he does every year, he rebuilt his model from the ground because his predictions weren’t very good last year. In other words, hiss model is entirely retrodictive." The truth is Hollinger doesn't reveal enough of his method to know how he puts it together. Your assumption is that he does not use out of sample validation, but do you know that for sure? "Didn’t he have Michael Beasley as one of the best prospects ever just a couple of years ago?" Sure, but that's one of the main points of the post. Sensitivity vs. specificity. The fact that he identifies players that don't eventually become stars means the model is not very specific. But what about players are not identified that *do* go on to be stars? As far as I can tell, Westbrook and Eric Gordon are the two most prominent players that the model "missed" (and even they had ratings higher than 11). Anyway, if Hollinger is wrong this year, and Klay Thompson does go on to become a star, I'll have to re-visit my claim! (And will be happy to do so.)

You're a badass. I want you to be wrong, but I can't argue with this.

Thanks! I really hope I'm wrong, too.