OpenAI Gym

A toolkit for developing and comparing reinforcement learning algorithms. It supports teaching agents everything from walking to playing games like Pong or Go.

OpenAI Gym is a toolkit for developing and comparing reinforcement learning (RL) algorithms. It consists of a growing suite of environments (from simulated robots to Atari games), and a site for comparing and reproducing results. OpenAI Gym is compatible with algorithms written in any framework, such as Tensorflow and Theano. The environments are written in Python, but we’ll soon make them easy to use from any language. We originally built OpenAI Gym as a tool to accelerate our own RL research. We hope it will be just as useful for the broader community. Why RL? Reinforcement learning (RL) is the subfield of machine learning concerned with decision making and motor control. It studies how an agent can learn how to achieve goals in a complex, uncertain environment. It’s exciting for two reasons: RL is very general, encompassing all problems that involve making a sequence of decisions: for example, controlling a robot’s motors so that it’s able to run and jump, making business decisions like pricing and inventory management, or playing video games and board games. RL can even be applied to supervised learning problems with sequential or structured outputs. RL algorithms have started to achieve good results in many difficult environments. RL has a long history, but until recent advances in deep learning, it required lots of problem-specific engineering. DeepMind’s Atari results, BRETT from Pieter Abbeel’s group, and AlphaGo all used deep RL algorithms which did not make too many assumptions about their environment, and thus can be applied in other settings. However, RL research is also slowed down by two factors: The need for better benchmarks. In supervised learning, progress has been driven by large labeled datasets like ImageNet. In RL, the closest equivalent would be a large and diverse collection of environments. However, the existing open-source collections of RL environments don’t have enough variety, and they are often difficult to even set up and use. Lack of standardization of environments used in publications. Subtle differences in the problem definition, such as the reward function or the set of actions, can drastically alter a task’s difficulty. This issue makes it difficult to reproduce published research and compare results from different papers. OpenAI Gym is an attempt to fix both problems.

Login to Leave A Comment
LATEST | POPULAR
MYBOT - BEST FREE CLASH OF CLANS BOT

Clash of Clans bot from mybot.run free & open source. Free coc bot, clash and earn millions of Resources daily. Try it now! Let the game begin!

ANKI OVERDRIVE - FAST AND FURIOUS

OVERDRIVE Fast & Furious Anki edition released September 2017. This special edition brings the Fast and Furious crew to the OVERDRIVE world for more track fun.

PREFERRED NETWORKS - CONNECTING EVERYTHING

We make everything intelligent and collaborative. We build systems with deep intelligence and insight via deep learning.

NEURALA - BRAINS FOR BOTS™ SDK

The Neurala Brains For Bots(TM) SDK includes everything you need to create applications that can learn, recognize, find, and track objects in real-time.

NEURALA - BRAIN BUILDER

Customize a Neurala Brain to recognize the objects important to you. Our deep learning algorithms build a custom neural network tuned for your use.

ANKI - OVERDRIVE

Anki OVERDRIVE racing robot system is an intelligent battle track with AI vehicles. These self-aware cars and trucks learn as you race on the OVERDRIVE track.

NEURALA - ROBOSCOPE

Try the deep learning Neurala Brain with the Parrot Jumping MiniDrone. Neurala’s Roboscope app for iOS lets you launch your robot into action.

SPORTLOGIQ - AI-POWERED SPORTS ANALYTICS

SPORTLOGiQ is an AI-powered sports analytics company. We help teams win more games and broadcasters engage more viewers.

MOBALYTICS - ANALYTICS FOR COMPETITIVE GAMING

The 1st personal performance analytics platform that highlights your strengths and weaknesses to help you boost your game.

DYNAMIC YIELD - PERSONALIZATION AND RECOMMENDATIONS

Dynamic Yield’s omnichannel personalization technology helps marketers increase revenue by automatically individualizing each user interaction across the web, mobile web, apps and email. The company’s advanced data engine uses machine learning to identify revenue opportunities in real time, enabling marketers to take instant action via personalization, recommendations, automatic optimization, real-time messaging.

OSMO - AWARD WINNING GAME SYSTEM

Osmo’s groundbreaking system fosters social intelligence and creative thinking by opening up the iPad and iPhone to the endless possibilities of physical play.

AMPLERO

Amplero is an Artificial Intelligence Marketing (AIM) Platform that leverages machine learning and multi-armed bandit experimentation to enable marketers to achieve what’s not humanly possible.

Keep updated on all things AI

Sign up to be able to Star, save and post new products and services.

Sign up