Slow recovery from 'largest global IT outage in history' could take weeks

Check-in desks at American Airlines at Dublin Airport Terminal 2 on Friday. Picture: Collins Photos
Services began to come back online on Friday evening after an IT failure that wreaked havoc worldwide.Â
But full recovery could take weeks, experts have said, after airports, healthcare services and businesses were hit by the âlargest outage in historyâ.
Flights and hospital appointments were cancelled, payroll systems seized up and TV channels went off air after a botched software upgrade hit Microsoftâs Windows operating system.
It came from the US cybersecurity company CrowdStrike, and left workers facing a âblue screen of deathâ as their computers failed to start. Experts said every affected PC may have to be fixed manually, but as of Friday night some services started to recover.

As recovery continues, experts say the outage underscored concerns that many organizations are not well prepared to implement contingency plans when a single point of failure such as an IT system, or a piece of software within it, goes down. But these outages will happen again, experts say, until more contingencies are built into networks and organizations introduce better back-ups.
A Microsoft spokesperson said on Friday: âWeâre aware of an issue affecting Windows devices due to an update from a third-party software platform. We anticipate a resolution is forthcoming.âÂ
Texas-based CrowdStrike confirmed the outage was due to a software update from one of its products and was not caused by a cyber-attack.
Its founder and chief executive, George Kurtz, said he was âdeeply sorry for the impact that weâve caused to customersâ, adding there had been a ânegative interactionâ between the update and Microsoftâs operating system.
CrowdStrikeâs stock price fell dramatically over the course of the day, dropping by as much as 13% at some points in trading.
Elon Musk, owner of Tesla, said the outage caused âa seizure to the automotive supply chainâ while banks in Kenya and Ukraine reported issues with their digital services, and supermarkets in Australia had problems with payments.

According to the service status monitoring website Downdetector, users in the UK were reporting issues with the services of Visa, BT, big supermarket chains, banks, online gaming platforms and media outlets.
The
and channels were also temporarily off-air in the UK before resuming broadcasting, while Australiaâs was also affected.In financial services, Metro Bank reported problems with its phone lines in the UK and Santander said card payments âmay be affectedâ.Â
Monzo said some customers were reporting issues, while some bankers at JP Morgan were unable to log on to their systems and the London Stock Exchange said there were problems with its news service.
Troy Hunt, a leading cybersecurity consultant, said the scale of the IT failure was unprecedented.
âI donât think itâs too early to call it: this will be the largest IT outage in history,â he tweeted.
âThis is basically what we were all worried about with Y2K, except itâs actually happened this time,â he added, referring to the millennium bug that worried IT experts in the run-up to 2000 â but ultimately did not cause serious damage.
The UKâs chartered institute for IT, the BCS, said it could take days and weeks for systems to recover, although some fixes will be easier to implement.
âIn some cases, the fix may be applied very quickly,â said Adam Leon Smith, a BCS fellow. âBut if computers have reacted in a way that means theyâre getting into blue screens and endless loops it may be difficult to restore and that could take days and weeks.âÂ
Organisations with thousands of PCs distributed in different locations face a tougher task, he added.
âItâs just sheer numbers. For some organisations it could certainly take weeks,â he said.

From Amsterdam to Zurich, Singapore to Hong Kong, airport operators flagged technical issues that were disrupting their services. While some airports halted all flights, in others airline staff had to check-in passengers manually.
Among the companies affected on Friday was Ryanair, Europeâs largest airline, which said on its website: âPotential disruptions across the network due to a global third-party system outage ⊠We advise passengers to arrive at the airport three hours in advance of their flight to avoid any disruptions.âÂ
Heathrow, Europeâs biggest airport, said it was âworking hardâ to get passengers âon their wayâ.
In the US, flights were grounded owing to communications problems that appear to be linked to the outage. American Airlines, Delta and United Airlines were among the carriers affected.
Berlin airport temporarily halted all flights on Friday. The aviation analytics company Cirium said 5,078 flights â 4.6% of those scheduled â were cancelled globally on Friday, including 167 UK departures and 171 arrivals.
Queues and blank screens at airports as Microsoft IT outage disrupts travel â video GP practices in the UK said they were unable to access patient records or book appointments. Surgeries reported on social media that they could not access the EMIS Web system.
Reports from the Netherlands also suggested there may be problems within the health service.
The Israeli health ministry said âthe global malfunctionâ had affected 16 hospitals, while in Germany the Schleswig-Holstein university hospital in the north of the country said it had cancelled all planned operations in Kiel and LĂŒbeck.
Ted Wheeler, the mayor of Portland, Oregon, issued an emergency declaration stating that certain essential city services including emergency communications were affected by the outage.
The University of Surreyâs Professor Alan Woodward said the outage was caused by an IT product called CrowdStrike Falcon which monitors the security of large networks of PCs and downloads a piece of monitoring software to every machine.
However, Ciaran Martin, the former chief executive of the UK's National Cyber Security Centre, said that unlike adversarial cyber-attacks, this problem had already been identified and a solution had been flagged.
âThe recovery is not about getting on top of the situation but getting back up. I think itâs unlikely to be very newsworthy in terms of ongoing disruption this time next week,â he said.
The problems for businesses in the US were also compounded by problems with Microsoftâs Azure cloud computing business that occurred on Thursday.