While googling for devops case studies, i found this very interesting blog post by Doug Seven on how Knight Capital Group (an American Global Finance) failed code deployment on one of the eight production instances lead to the bankruptcy of KCG in August 2012.
At the first read I felt it may be a hypothetical case or something that is shown too aggravated. But with minimal digging I could correlate some of the facts even thought I am not totally sure about the the whole story. Anyways here the point is not about the story but about Dev & Ops nightmare.
Ignorance is not a bliss
Blog may be talking about 2012, but even today we just have a below 40% adoption of devops which seriously means lots of organization out there still doing manual heavy lifting of syncing of code between the dev/qa/prod environment. With manual intervention there is always a possibility of mistakes. If not today… tomorrow. Now the question to be asked is When or Why? Yes, with so much at stake organization cannot leave it to chance because things are fine at this point.
Human mistakes are inevitable. It may be in any form on the PDCA (Plan Do Check Act) cycle. So better to alway automate the processes which is repeatable. Definitely high risk processes like software deployments so that we have more reliability with each repetition.