
Originally Posted by
mrgrimezHD
Many things can be factored into why servers crash, Developers can rent out huge sections known as 'Server Farms' and still encounter errors which can well and truely bite the bullet and as the phrase goes.. ''**** up''.
Server Load - It can be one of the main reasons why Servers crash out, hence why Betas are released pre launch in order to 'Stress Test' the servers allowing people to join in mass quantities. The servers are monitored during this time and analyzed to gain information for future tweaks / updates.
Networking - The infrastructure of a game is a complex wall of code, in which it talks to servers and gives out hundreds of thousands of command lines in order for a game to run smoothly online, every time you connect to a server, lobby or even log online the servers have to handle all this information in a split second, which can sometimes bog down and become static, thus rendering it to a 'crashed' state.
Configuration Errors - As mentioned above, everything you see visually is basically an image map or modelling, but beneath all of that is complex code which is used for everything to run smoothly. However if a single piece of code gets mis typed whether it be an individual letter or symbol then it can be catastrophic.
Server Hardware Issues - Servers are massive machines, which in order to handle thousands of packets (information to me and you) it needs to be pretty beefy, especially when Online Gaming is concerned. What some people don't understand is the Servers are doing all of the work behind the scenes and if Hardware Failure happens then it can have a domino effect if that particular server is linked to another.
Back up Systems - Think of these like the 'Restore Point' for windows, only these are designed to replace EVERYTHING which was stored on a server in the first place, everything down to for example Program scripts / C++ Coding files / .SYS files and everything else which makes a game run, it can take MASSIVE amounts of CPU resources and if these Back Up Systems kick in and a problem occurs, then it can take a while to sort the problem out.
Basically it's like looking for a needle in a haystack.
There's many more reasons why a Server can be fine one minute and be rendered useless the next, all that I know is to keep a 'farm' of these servers alive can be very expensive, let alone having a dedicated team to correct the problems if they arise.