On the 1st of March, 2020, Riot Games announced that they would be:
- removing the League of Legends boards
- not be providing an official archive
- not providing an online-accessible read-only copy
This was met with a not-insignificant amount of backlash; the boards have been the place where the community has engaged for several years (since at least 2013). There are several years of history in the boards; memes that have been long forgotten or perhaps are still around.
I am a software engineer in my day job. I took a few hours to write a script that would essentially scrape all of the posts from the boards and download them into an archive for the purpose of redistribution across the internet for free. what you do with this data is up to you. Perhaps you will recreate the boards?
The archives enclosed are for the European and North American English boards. I did not back up the non-English speaking boards as the resources required for me to back up the two largest boards were expensive enough. I apologize to any non-English speakers out there.
The European archive contains approximately 480,000 discussions and the North American archive approximately 1,300,000 discussions. In this context, a discussion is an entire thread: The original post plus all comments.
The European archive (eu.tar.gz) is a standard tarball, compressed with the highest level of GZip compression. Inside this tarball is a folder containing a set of files, with each file corresponding to a single board (category) and discussion:
Each of these files is compressed further with Brotli encryption. Brotli encryption is more CPU intensive than GZip compression but results in smaller file sizes.
This archive is approximately 725MB in size and 2GB when uncompressed.
The North American archive is formatted differently. Due to the vastly increased number of discussions and my learnings from attempting to scrape the European boards, I had to adopt a different format to be able to scrape 1.3million discussions in any reasonable amount of time.
Within the North American archive (na.txt.gz) is a single text file which is compressed with the highest level of GZip compression. Each line in that text file will be a JSON blob corresponding to a single discussion. Unfortunately, due to what appears to be a bug in my code, there are some lines that may be corrupted.
While there are only about 100 discussions missing from this archive in total when line counts are compared, you may encounter issues when attempting to parse this line-by-line. Please make sure that the parser you write is forgiving.
This archive is approximately 2.1GB in size when compressed and 12GB in size uncompressed.
At the request of the mods, I have removed the link to the torrent and included a MEGA link. Unfortunately, there's a cap on the amount of transfers from MEGA, so I can't guarantee uptime.
Source: Original link
© Post "I have created a backup archive of all* of the LoL boards posts" for game League of Legends.
Top 10 Most Anticipated Video Games of 2020
2020 will have something to satisfy classic and modern gamers alike. To be eligible for the list, the game must be confirmed for 2020, or there should be good reason to expect its release in that year. Therefore, upcoming games with a mere announcement and no discernible release date will not be included.
Top 15 NEW Games of 2020 [FIRST HALF]
2020 has a ton to look forward to...in the video gaming world. Here are fifteen games we're looking forward to in the first half of 2020.