K
KG
I am attempting to import a large amount of webpages containing historical
odds on NBA games (about 4 years of data). The data is available on webpages
similar to this example:
http://archive.scoresandodds.com/grid_20070418.html
As you can see, the date is used in the URL, meaning it should be possible
to make a macro to automatically import each page from then until now into
Excel.
The problem is, however, complicated further due to these factors:
1) NBA odds aren't posted daily meaning some dates/URLs wont have any data
to import (for instance the next two pages/days in the sequence, that is
20070419.html and 20070420.html doesn’t contain NBA data, while
20070421.html once again has the NBA table)
2) Each page also contains tables with data for others sports such as
baseball, but I only need data from the NBA table to be imported.
What is the easiest way to achieve this goal? Is it possible to make a
macro/VBA that loops through all dates since april 2007 and imports only the
NBA table and discards pages where no NBA table is present?
I am new to Excel and would appreciate any help and solutions you can
provide.
Thank you!
odds on NBA games (about 4 years of data). The data is available on webpages
similar to this example:
http://archive.scoresandodds.com/grid_20070418.html
As you can see, the date is used in the URL, meaning it should be possible
to make a macro to automatically import each page from then until now into
Excel.
The problem is, however, complicated further due to these factors:
1) NBA odds aren't posted daily meaning some dates/URLs wont have any data
to import (for instance the next two pages/days in the sequence, that is
20070419.html and 20070420.html doesn’t contain NBA data, while
20070421.html once again has the NBA table)
2) Each page also contains tables with data for others sports such as
baseball, but I only need data from the NBA table to be imported.
What is the easiest way to achieve this goal? Is it possible to make a
macro/VBA that loops through all dates since april 2007 and imports only the
NBA table and discards pages where no NBA table is present?
I am new to Excel and would appreciate any help and solutions you can
provide.
Thank you!