OpenAI and Google DeepMind develop machine learning algorithm for a safer AI

OpenAI and Google DeepMind are working together to make artificial intelligence safer.

The two companies have produced an algorithm that learns from human feedback, providing a more reliable machine learning process.

The move comes after concerns over the rise of AI highlighted by tech luminaries such as Bill Gates, Elon Musk, and even Stephen Hawking weighing in on the need for securer machine learning platforms.

OpenAI and DeepMind have sought to develop a process that will help to make AI safer to use and more easily trainable.

They have achieved this through ‘reinforcement learning’, strengthening the intelligence of the algorithm through several stages of human engagement. The process involves the algorithm completing tasks within a particular environment whilst participants provide responses which are fed back to the machine.

This allows the algorithm to learn and alter its behavior according to desired actions it receives. During the tasks, the algorithm keeps adapting its next moves according to information given in the form of a ‘reward predictor’.

Demonstrations present how the algorithm successfully achieves tasks, responding to human participants training it to recognise and decide what stages it must take to improve future judgement.

One example involved people ‘training’ a graphic of a lamp to do back flips. They would watch two clips, then select the video where they felt the AI graphic was best performing. This was then fed to the algorithm to adapt its following sequence by gaining an awareness of what the preferred course of action would be.

Despite the progress that has been made there are several concerns regarding this method, as training algorithms this way is limited to the particular skill’s ability of the person supplying the information. This can have adverse effects if strong feedback is not provided, taking away the efficiency of machine learning training.

The process in whole provides a way for machine learning to develop and extend intelligence whilst completing complex tasks. This is especially useful in industry sectors such as autonomous driving which rely on AI methods to monitor efficiency in vehicles. It allows machines to predict how to overcome regular challenges.

Training algorithms to become more advanced through authentic human interactions, rather than programmed predictions proves to be highly beneficial. Machine learning devices process selective information regarding frequent tasks carried out by people, to better understand future behaviors.

Sign up for our weekly news round-up!

Sign up to the newsletter: In Brief

Read more: Google DeepMind M2M starts dreaming

Sign up for our regular news round-up!

Sign up for our weekly news round-up!

Sign up to the newsletter: In Brief

I would also like to subscribe to:

Thank you for subscribing