AHA Data Oddities #9 & #10

AHAers,

I find these ‘data oddities’ while going through the data and seeing something that catches my eye. Typically my first instinct is that I have a typo in one of my formulas causing the model to pull the wrong number. Almost every time it ends up being a ‘data oddity’ where it’s either a weird ‘glitch’ in the AHA.com data or it’s simply just a unique, record-setting, stat. Below I have examples of each.

Continue reading

AHA Data Oddities #7 & #8

AHAers,

Glad everyone is enjoying the new Player Statistics spreadsheet. Over 300 people have downloaded it in just over 24 hours! Thanks for all the positive feedback as well.

Below I have some more oddities to share with you. Both examples surround subgoalies, which I have deemed as the “bane of my existence.” Subgoalie names appear to be manually typed in by the score keepers and don’t always follow the pattern of “#00 J. Doe”. Typos, misspellings, wrong names and odd formatting can really degrade the quality of the data. It was to the point where I thought it might be cleaner to just default every Sub Goalie’s name to “Sub Goalie,” but you would miss a lot of information. In the end, it was best to convert it to the “Doe, J.” format as best I can, and leave any typos alone. When searching for players, goalies especially, it’s best to be vague with the name to try to capture all of their stats.

Continue reading

AHA Data Oddities #1 & #2

Fellow AHA Data Dorks,

As I’ve been working on the historical AHA stats project, I’ve noticed some oddities in AHA’s stat tracking. Some are just truly odd, some are not odd, but just an annoyance to deal with in the data. I thought it would be good to share some of these examples because:

  1. My own spreadsheets can only be as accurate as to the data that feeds it, and some of these examples will help explain issues you will see yourself; and,
  2. Some of you might just be curious.

I’ll plan to post a couple of examples here and there, so stay tuned for more. I’ve noticed a lot of oddities from the Player-side stats, but I’ll start with some oddities on the Team-side stats:

Continue reading