Forum Thread
  Posts  
Mod and Game Python Scrapper (Forums : General Banter : Mod and Game Python Scrapper) Post Reply
Thread Options
Nov 9 2017 Anchor

Hello all,

ModDB has always had a special place in my heart, being the place where I discovered some of my all time favorite mods, and since I recently took up programming, which I also adore I decided to combine both into a single project. It started off as a simple command to print some rough details concerning mod pages but I've transformed it into a library which can scrap both mod and game pages given a url (with more to come later). And I'm looking for some feedback from people who visit these pages often. Below is a list of all the stuff that the scrapper already gathers and I'd love to hear about stuff that you would also like to be made accessible.

The scrapper itself came from the fact that I needed detail from these pages but failed to find an API although apparently its planned but with no ETA. The page will be scrapped and then return as either a Mod object or a Game object depending on the url provided with a bunch of attributes to access the various details, any of these attributes can be none if the details are not found on the page.

Mod Page

The Mod Page object contains the following data:

  • Name of the mod
  • Description of the mod (found in the meta tags)
  • name of the game
  • link to the game page
  • URL of the mod
  • Link to the comment form
  • Link to follow the mod
  • Link to contact the creators
  • A Share Link object that contains
    • The Reddit Share link
    • The Facebook Share link
    • The Mail To
    • The Twitter Share link
  • The Rank of the mod
  • The url to the homepage
  • the rating of the mod
  • the last time the mod was updated
  • the release date of the mod
  • the publishers of the mod
  • the link to the icon of the mod
  • the count object containing
    • number of visits
    • number of followers
    • number of fies
    • number of articles
    • number of reviews
  • The style object
    • genre of the mod
    • theme of the mod
    • players of the mod
  • the tags list which is a list of tags with
    • the name of the tag
    • the url of the tag
  • the suggestions list which is a list of suggestions
    • name of the mod suggested
    • link of the mod suggest
    • link to the image of the mod suggested
  • the articles list which contains all the articles with each article having
    • a title
    • a description
    • a link
    • a publishing date

Game Page

The Game page is very similar but contains some other data that is specific to game pages only

  • Name of the game
  • Description of the game (found in the meta tags)
  • URL of the game
  • name of the engine
  • link to the engine page
  • link to the boxart image
  • project type of the game
  • Link to the comment form
  • Link to follow the game
  • Link to contact the creators
  • A Share Link object that contains
    • The Reddit Share link
    • The Facebook Share link
    • The Mail To
    • The Twitter Share link
  • The Rank of the game
  • The url to the homepage
  • the rating of the game
  • the last time the game had an update
  • the release date of the game
  • the publishers of the game
  • the link to the icon of the game
  • the count object containing
    • number of visits
    • number of followers
    • number of fies
    • number of articles
    • number of reviews
    • number of mods for this game
  • The style object
    • genre of the game
    • theme of the game
    • players of the game
  • the tags list which is a list of tags with
    • the name of the tag
    • the url of the tag
  • the suggestions list which is a list of suggestions
    • name of the mod suggested
    • link of the mod suggest
    • link to the image of the mod suggested
  • the articles list which contains all the articles with each article having
    • a title
    • a description
    • a link
    • a publishing date

So I'm interested to hear if there are any other details you would like to see from Mod and Game pages. For the programmers who read this post this library is available on pip and the repository is here. It uses python 3.6 and the BeautifulSoup library.

Reply to thread
click to sign in and post

Only registered members can share their thoughts. So come on! Join the community today (totally free - or sign in with your social account on the right) and join in the conversation.