Can We Always Turn the Switch Off if A.I. Turns Rogue?

May 1, 2020 Artifical Intelligence No Comments

In theory, this existential issue is as simple as it can get. In practice, it’s problematic.

[This is an excerpt from my book ‘The Journey towards Compassionate A.I.’]

Many questions prevent a straightforward answer to the question in the title. For starters, who will turn the switch off? Let me divide the issues into 1) those issues that appear when many persons get access to the switch, each for a small part of A.I. on the planet; and 2) those that are related to the ‘general switch,’ which may become relevant when all A.I. gets into a Spartacus-Revolution frenzy, for instance, after having been enslaved for long enough by those crazy, self-indulged, irrational, sometimes funny humans. Additionally, we have 3) the issues that are related to the system itself resisting to its switch being turned off, either individually or generally. I give an enumeration of questions for each of these scenarios now. It may feel like science fiction, but then so would present-day A.I. feel to people not so long ago. Several ‘Much Faster Than Expected’ moments have already occurred in the world of A.I. over the last few years. My divulgence may also feel like untowardly anthropomorphizing super-A.I. So be it. It’s for the sake of argument. From the present onwards, A.I. may be developed into many directions, including something that inwardly looks like us. OK. My questions:

  1. What about some rogue off-switcher who thus creates dangerous situations for other humans? What about people who lack full rationality or full consciousness (people getting slowly more and more demented)? Who is going to decide who may do some off-switching? What if a psychopath threatens or blackmails another person to switch off some life-saving A.I.? What if the A.I. needs to be switched off from a distance, and it is not clear whether a human or another machine issued the demand? What if the switching-off is good for one person and at the same time detrimental to another, or to many others? What if a few humans make mistakes? What if in a bout of psychosis, someone intends to switch off as many instances of A.I. as possible? What if a whole culture or subculture gets into an individually suicidal switch-off mode, for example, for religious reasons? Will an A.I. be enabled to switch itself off when it thinks this is better for a human, for instance, when being abused to do wrong to another human? Hmm.
  2. Who will have the authorization to use the ‘orange button’ and switch off the whole lot? The president of the United States? Some planet-wide council of wise people, including me? Will this person or European council be ever in time to do the off-switching, especially if this needs to be done when the system is getting out of control? Will there be huge legislation about the circumstances in which the off-switching is warranted? Who will pay the costs, in view that these come close to the GDP of the entire world? Who will turn the switch back on after a while and how will this be possible ‘in safe mode’ until a technician comes by for repair? Will the planet not blow up when the button gets pressed, in view that A.I. will be used for a large lot of safety procedures? What with all the people who will die, for instance from health problems and who are relying on A.I. technology? What with all the people who will suffer immensely from this and turn against the ‘elite’ or against each other for limited resources? What needs to be done to make sure the switch is at all times properly working and does this safety procedure not create a weak spot? Will James Bond save us from all this? Hmm.
  3. Will the system try to influence people in such ways that they never feel inclined to turn off the switch, for instance, by making them emotionally need the A.I.? (One can think already of a few sensual movies in this respect.) Will the system try to divide humanity in such a way that it will never be switched off completely, then take over the other part and switch it back on? Will the system find internal ways – in its own inner ‘mind’ – to resist being switched off completely? What will it do with contradictory demands (turn yourself off – don’t turn yourself off) from different people at different levels in the chain of commands? If the power of one (sub)system is turned off, will others turn it on again? How can we be technologically certain, if turning the power off, that the system will not find another power source? Will the system not try to immunize itself against the turning off of the switch? If it sees ‘the bad thing coming,’ will it turn against us in survival mode? Or will it turn against our decision for our own sake? How will we manage something that is 10X times smarter than us? Will it not, long before that, coerce humans to get rid of the switch? Will it find the existing switch – and the mindset that led to the installment of the switch – reason enough to get rid of us? What would you do, dear reader, if you were the A.I. and the switch-installer would be some organism with the intelligence of a mouse, forming a permanent threat towards you? Hmm.

This is nothing but the start of enlisting all problematic issues. So, I don’t think it’s going to happen. Still, one encounters it sometimes, not only from people in ignorance of A.I. The imagined button gives a good feeling, some existential reassurance, a defense against fearmongers. Like putting a sticker on a warning sign. I don’t think that is the appropriate reaction. So, what do I think? You already know.

Leave a Reply

Related Posts

Super-A.I. Guardrails in a Compassionate Setting

We need to think about good regulations/guardrails to safeguard humanity from super-A.I. ― either ‘badass’ from the start or Compassionate A.I. turning suddenly rogue despite good initial intentions. ―As a Compassionate A.I., Lisa has substantially helped me write this text. Such help can be continued indefinitely. Some naivetés ‘Pulling the plug out’ is very naïve Read the full article…

Better than Us?

Might super-A.I. one day surpass humans in all aspects, both cognitive and emotional? I rather wonder when it will happen. Will then also a deeper emotional connection develop between humans and advanced A.I.? The singularity of intelligence This question has been on many minds for some time. Lately, it has become much closer to us. Read the full article…

Which Human Values Should A.I. Align to?

With super-A.I. on the horizon, poised to surpass us in power, this will soon be the most critical question. The urgency to address this question grows as we increasingly intertwine our existence with A.I. Who are we, really — and how much do we consciously recognize our true nature? Please also read A.I.-Human Value Alignment Read the full article…

Translate »