Scraping Data from Covers, Creating an NHL Database

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • KennyPowers
    SBR Rookie
    • 01-23-12
    • 5

    #1
    Scraping Data from Covers, Creating an NHL Database
    I'm looking for a way to quickly create a database from the NHL information on Covers from the last 5-10 years. For now I guess I am just interested in the scores of each period as well as the moneyline and U/O.

    I do have quite a bit of programming experience, mostly in Java and VB but I haven't done any web-based programming such as scraping or data mining before.

    Any Java libraries you've used to scrape data that might be useful or links to websites with easier information to parse would be greatly appreciated.
  • TomG
    SBR Wise Guy
    • 10-29-07
    • 500

    #2
    Don't go back further than the lockout and you might want to skip a year or two after the lockout when teams were still adjusting to changes in strategy. There are enough NHL games in a season where sample size shouldn't be an issue.
    Comment
    • AngryPlanets
      SBR Rookie
      • 01-08-12
      • 31

      #3
      I used WebHarvest for my Java scraping. Easy and effective.
      Comment
      • MonkeyF0cker
        SBR Posting Legend
        • 06-12-07
        • 12144

        #4
        If you know VB.NET, you can use either a Webbrowser control, Webclient, or the HttpWebRequest class.

        Personally, I wouldn't scrape from Covers. I'd use a combination of several line services and the NHL website.
        Comment
        • strixee
          SBR Sharp
          • 05-31-10
          • 432

          #5
          I do have quite a bit of programming experience, mostly in Java and VB but I haven't done any web-based programming such as scraping or data mining before.
          Data mining has nothing to do with web-based programming
          For scraping I recommend using PHP+MySQL, it's especially useful if you want to use some output accessible online (plus it can run on a shared hosting).

          As MonkeyF0cker said, Covers isn't a good source to scrape from. I'm actually shocked, that I see odds just for 1 side like here covers.com/sports/odds/linehistory.aspx?eventId=47344&sport=nhl ! Each sportsbook has different vig, so you don't know what the other side price was.
          Covers is too US sports oriented service.
          Comment
          • mathdotcom
            SBR Posting Legend
            • 03-24-08
            • 11689

            #6
            Originally posted by strixee
            Data mining has nothing to do with web-based programming
            For scraping I recommend using PHP+MySQL, it's especially useful if you want to use some output accessible online (plus it can run on a shared hosting).

            As MonkeyF0cker said, Covers isn't a good source to scrape from. I'm actually shocked, that I see odds just for 1 side like here covers.com/sports/odds/linehistory.aspx?eventId=47344&sport=nhl ! Each sportsbook has different vig, so you don't know what the other side price was.
            Covers is too US sports oriented service.
            If you look historically where they're getting their numbers you can infer the other side with reasonable accuracy
            Comment
            • strixee
              SBR Sharp
              • 05-31-10
              • 432

              #7
              Here you can get regular season ML odds from Pinnacle since 2009 http://strixee.mysbrforum.com/blog/1...-pinnacle.html
              Comment
              • mbs4
                SBR Sharp
                • 05-14-10
                • 388

                #8
                Originally posted by mathdotcom
                If you look historically where they're getting their numbers you can infer the other side with reasonable accuracy
                Where does covers get their numbers from?
                Comment
                • KennyPowers
                  SBR Rookie
                  • 01-23-12
                  • 5

                  #9
                  Thanks everyone for the replies. I will look into WebHarvest and those VB.NET classes. And thanks for the GREAT link strixee, that, along with the scores of each period should definitely start me in the right direction.
                  Comment
                  • Jrod124
                    SBR Hall of Famer
                    • 10-31-09
                    • 5622

                    #10
                    you will quit within a week
                    Comment
                    • KennyPowers
                      SBR Rookie
                      • 01-23-12
                      • 5

                      #11
                      How come? You don't think NHL is worth the time? Or you don't think I'm worth the time
                      Comment
                      • durito
                        SBR Posting Legend
                        • 07-03-06
                        • 13173

                        #12
                        Originally posted by KennyPowers
                        How come? You don't think NHL is worth the time? Or you don't think I'm worth the time
                        Probably doesn't even understand what you are trying to do.
                        Comment
                        • a4u2fear
                          SBR Hall of Famer
                          • 01-29-10
                          • 8147

                          #13
                          ive got all odds, results, win streaks going into game, goals for/against going into game for 2011 season.
                          Comment
                          • a4u2fear
                            SBR Hall of Famer
                            • 01-29-10
                            • 8147

                            #14
                            Originally posted by a4u2fear
                            ive got all odds, results, win streaks going into game, goals for/against going into game for 2011 season.
                            this includes home/away odds, over/under odds.
                            Comment
                            • 357vegas
                              SBR Rookie
                              • 10-28-11
                              • 35

                              #15
                              Has anyone scraped data from Jeff Sagarin usatoday pitching? I'm looking for a little help to gether data from this site. I have no clue how to, I just know how to use the data.
                              Comment
                              • newbottles
                                SBR Sharp
                                • 06-22-12
                                • 312

                                #16
                                Originally posted by a4u2fear
                                this includes home/away odds, over/under odds.
                                I am new to this, so pardon me if this is an ignorant question. Is this data from some sort of manual or semi-automated daily collection during the season? Or from an an automated data collection approach?
                                Comment
                                • a4u2fear
                                  SBR Hall of Famer
                                  • 01-29-10
                                  • 8147

                                  #17
                                  Originally posted by newbottles
                                  I am new to this, so pardon me if this is an ignorant question. Is this data from some sort of manual or semi-automated daily collection during the season? Or from an an automated data collection approach?
                                  manually performed it and manipulated. i did however find out that you can do a much easier web query through Excel
                                  Comment
                                  • degengamble
                                    SBR Rookie
                                    • 12-03-12
                                    • 39

                                    #18
                                    Easiest way to do it is with excel - web queries / vba
                                    Comment
                                    • allin1
                                      SBR MVP
                                      • 11-07-11
                                      • 4555

                                      #19
                                      outwit hub is very good for beginners
                                      Comment
                                      SBR Contests
                                      Collapse
                                      Top-Rated US Sportsbooks
                                      Collapse
                                      Working...