Releases: openai/gym
0.26.2
Release notes
This is another very minor bug release.
Bugs Fixes
- As
reset
now returns(obs, info)
then in the vector environments, this caused the finalstep
's info to be overwritten. Now, the final observation and info are contained within the info as "final_observation" and "final_info" @pseudo-rnd-thoughts - Adds warnings when trying to render without specifying the
render_mode
@younik - Updates Atari Preprocessing such that the wrapper can be pickled @vermouth1992
- Github CI was hardened to such that the CI just has read permissions @sashashura
- Clarify and fix typo in
GraphInstance
@ekalosak
0.26.1
Release Notes
This is a very minor bug fix release for 0.26.0
Bug Fixes
- #3072 - Previously
mujoco
was a necessary module even if onlymujoco-py
was used. This has been fixed to allow onlymujoco-py
to be installed and used. @YouJiacheng - #3076 -
PixelObservationWrapper
raises an exception if theenv.render_mode
is not specified. @vmoens - #3080 - Fixed bug in
CarRacing
where the colour of the wheels were not correct @foxik - #3083 - Fixed BipedalWalker where if the agent moved backwards then the rendered arrays would be a different size. @younik
Spelling
0.26.0
Release notes for v0.26.0
This release is aimed to be the last of the major API changes to the core API. All of the previously "turned off" changes of the base API (step termination / truncation, reset info, no seed function, render mode determined by initialization) are now expected by default. We still plan to make breaking changes to Gym itself, but to things that are very easy to upgrade (environments and wrappers), and things that aren't super commonly used (the vector API). Once those aspects are stabilized, we'll do a proper 1.0 release and follow semantic versioning. Additionally, unless something goes terribly wrong with this release and we have to release a patched version, this will be the last release of Gym for a while.
If you've been waiting for a "stable" release of Gym to upgrade your project given all the changes that have been going on, this is the one.
We also just wanted to say that we tremendously appreciate the communities patience with us as we've gone on this journey taking over the maintenance of Gym and making all of these huge changes to the core API. We appreciate your patience and support, but hopefully, all the changes from here on out will be much more minor.
Breaking backward compatibility
These changes are true of all gym's internal wrappers and environments but for environments not updated, we provide the EnvCompatibility
wrapper for users to convert old gym v21 / 22 environments to the new core API. This wrapper can be easily applied in gym.make
and gym.register
through the apply_api_compatibility
parameters.
Step
Termination / truncation - TheEnv.step
function returns 5 values instead of 4 previously (observations, reward, termination, truncation, info
). A blog with more details will be released soon to explain this decision. @arjun-kg- Reset info - The
Env.reset
function returns two values (obs
andinfo
) with noreturn_info
parameter for gym wrappers and environments. This is important for some environments that provided action masking information for each actions which was not possible for resets. @balisujohn - No
Seed
function - WhileEnv.seed
was a helpful function, this was almost solely used for the beginning of the episode and is added togym.reset(seed=...)
. In addition, for several environments like Atari that utilise external random number generators, it was not possible to set the seed at any time other thanreset
. Therefore,seed
is no longer expected to function within gym environments and is removed from all gym environments @balisujohn - Rendering - It is normal to only use a single render mode and to help open and close the rendering window, we have changed
Env.render
to not take any arguments and so all render arguments can be part of the environment's constructor i.e.,gym.make("CartPole-v1", render_mode="human")
. For more detail on the new API, see blog post @younik
Major changes
- Render modes - In
v25
, there was a change in the meaning of render modes, i.e. "rgb_array" returned a list of rendered frames with "single_rgb_array" returned a single frame. This has been reverted in this release with "rgb_array" having the same meaning as previously to return a single frame with a new mode "rgb_array_list" returning a list of RGB arrays. The capability to return a list of rendering observations achieved through a wrapper applied duringgym.make
. #3040 @pseudo-rnd-thoughts @younik - Added
save_video
that usesmoviepy
to render a list of RGB frames and updatedRecordVideo
to use this function. This removes support for recordingansi
outputs. #3016 @younik RandomNumberGenerator
functions:rand
,randn
,randint
,get_state
,set_state
,hash_seed
,create_seed
,_bigint_from_bytes
and_int_list_from_bigint
have been removed. @balisujohn- Bump
ale-py
to0.8.0
which is compatibility with the new core API - Added
EnvAPICompatibility
wrapper @RedTachyon
Minor changes
- Added improved
Sequence
,Graph
andText
sample masking @pseudo-rnd-thoughts - Improved the gym
make
andregister
type hinting withentry_point
being a necessary parameter ofregister
. #3041 @pseudo-rnd-thoughts - Changed all URL to the new gym website https://www.gymlibrary.dev/ @FieteO
- Fixed mujoco offscreen rendering with weight and height value > 500 #3044 @YouJiacheng
- Allowed toy_text environment to render on headless machines #3037 @RedTachyon
- Renamed the motors in the mujoco swimmer envs #3036 @lin826
0.25.2
Release notes for v0.25.2
This is a fairly minor bug fix release.
Bug Fixes
- Removes requirements for
_TimeLimit.truncated
in info for step compatibility functions. This makes the step compatible with Envpool @arjun-kg - As the ordering of
Dict
spaces matters when flattening spaces, updated the__eq__
to account for the.keys()
ordering. @XuehaiPan - Allows
CarRacing
environment to be pickled. Updated all gym environments to be correctly pickled. @RedTachyon - seeding
Dict
andTuple
spaces with integers can cause lower-specification computers to hang due to requiring 8Gb memory. Updated the seeding with integers to not require unique subseeds (subseed collisions are rare). For users that require unique subseeds for all subspaces, we recommend using a dictionary or tuple with the subseeds. @olipinski - Fixed the metaclass implementation for the new render api to allow custom environments to use metaclasses as well. @YouJiacheng
Updates
- Simplifies the step compatibility functions to make them easier to debug. Time limit wrapper with the old step API favours terminated over truncated if both are true. This is as the old done step API can only encode 3 states (cannot encode
terminated=True
andtruncated=True
) therefore we must encode to onlyterminated=True
ortruncated=True
. @pseudo-rnd-thoughts - Add Swig as a dependency @kir0ul
- Add type annotation for
render_mode
andmetadata
@bkrl
0.25.1
Release notes
- Added rendering for CliffWalking environment @younik
PixelObservationWrapper
only supports the new render API due to difficulty in supporting both old and new APIs. A warning is raised if the user is using the old API @vmoens
Bug fix
- Revert an incorrect edition on wrapper.FrameStack @ZhiqingXiao
- Fix reset bounds for mountain car @psc-g
- Removed skipped tests causing bugs not to be caught @pseudo-rnd-thoughts
- Added backward compatibility for environments without metadata @pseudo-rnd-thoughts
- Fixed
BipedalWalker
rendering for RGB arrays @1b15 - Fixed bug in
PixelObsWrapper
for using the new rendering @younik
Typos
- Rephrase observations' definition in Lunar Lander Environment @EvanMath
- Top-docstring in
gym/spaces/dict.py
@Ice1187 - Several typos in
humanoidstandup_v4.py
,mujoco_env.py
, andvector_list_info.py
@timgates42 - Typos in passive environment checker @pseudo-rnd-thoughts
- Typos in Swimmer rotations @lin826
0.25.0
Release notes
This release finally introduces all new API changes that have been planned for the past year or more, all of which will be turned on by default in a subsequent release. After this point, Gym development should get massively smoother. This release also fixes large bugs present in 0.24.0 and 0.24.1, and we highly discourage using those releases.
API Changes
Step
- A majority of deep reinforcement learning algorithm implementations are incorrect due to an important difference in theory and practice asdone
is not equivalent totermination
. As a result, we have modified thestep
function to return five values,obs, reward, termination, truncation, info
. The full theoretical and practical reason (along with example code changes) for these changes will be explained in a soon-to-be-released blog post. The aim for the change to be backward compatible (for now), for issues, please put report the issue on github or the discord. @arjun-kgRender
- The render API is changed such that the mode has to be specified duringgym.make
with the keywordrender_mode
, after which, the render mode is fixed. For further details see https://younis.dev/blog/2022/render-api/ and #2671. This has the additional changes- with
render_mode="human"
you don't need to call .render(), rendering will happen automatically onenv.step()
- with
render_mode="rgb_array"
,.render()
pops the list of frames rendered since the last.reset()
- with
render_mode="single_rgb_array"
,.render()
returns a single frame, like before.
- with
Space.sample(mask=...)
allows a mask when sampling actions to enable/disable certain actions from being randomly sampled. We recommend developers add this to theinfo
parameter returned byreset(return_info=True)
andstep
. See #2906 for example implementations of the masks or the individual spaces. We have added an example version of this in the taxi environment. @pseudo-rnd-thoughts- Add
Graph
for environments that use graph style observation or action spaces. Currently, the node and edge spaces can only beBox
orDiscrete
spaces. @jjshoots - Add
Text
space for Reinforcement Learning that involves communication between agents and have dynamic length messages (otherwiseMultiDiscrete
can be used). @ryanrudes @pseudo-rnd-thoughts
Bug fixes
- Fixed car racing termination where if the agent finishes the final lap, then the environment ends through truncation not termination. This added a version bump to Car racing to v2 and removed Car racing discrete in favour of
gym.make("CarRacing-v2", continuous=False)
@araffin - In
v0.24.0
,opencv-python
was an accidental requirement for the project. This has been reverted. @KexianShen @pseudo-rnd-thoughts - Updated
utils.play
such that if the environment specifieskeys_to_action
, the function will automatically use that data. @Markus28 - When rendering the blackjack environment, fixed bug where rendering would change the dealers top car. @balisujohn
- Updated mujoco docstring to reflect changes that were accidently overwritten. @Markus28
Misc
- The whole project is partially type hinted using pyright (none of the project files is ignored by the type hinter). @RedTachyon @pseudo-rnd-thoughts (Future work will add strict type hinting to the core API)
- Action masking added to the taxi environment (no version bump due to being backwards compatible) @pseudo-rnd-thoughts
- The
Box
space shape inference is allowshigh
andlow
scalars to be automatically set to(1,)
shape. Minor changes to identifying scalars. @pseudo-rnd-thoughts - Added option support in classic control environment to modify the bounds on the initial random state of the environment @psc-g
- The
RecordVideo
wrapper is becoming deprecated with no support forTextEncoder
with the new render API. The plan is to replaceRecordVideo
with a single function that will receive a list of frames from an environment and automatically render them as a video usingMoviePy
. @johnMinelli - The gym
py.Dockerfile
is optimised from 2Gb to 1.5Gb through a number of optimisations @TheDen
0.24.1
This is a bug fix release for version 0.24.0
Bugs fixed:
- Replaced the environment checker introduced in V24, such that the environment checker will not call
step
andreset
during make. This new version is a wrapper that will observe the data thatstep
andreset
returns on their first call and check the data against the environment checker. @pseudo-rnd-thoughts - Fixed MuJoCo v4 arguments key callback, closing the environment in renderer and the mujoco_rendering close method. @rodrigodelazcano
- Removed redundant warning in registration @RedTachyon
- Removed maths operations from MuJoCo xml files @quagla
- Added support for unpickling legacy
spaces.Box
@pseudo-rnd-thoughts - Fixed mujoco environment action and observation space docstring tables @pseudo-rnd-thoughts
- Disable wrappers from accessing
_np_random
property andnp_random
is now forwarded to environments @pseudo-rnd-thoughts - Rewrite setup.py to add a "testing" meta dependency group @pseudo-rnd-thoughts
- Fixed docstring in
rescale_action
wrapper @gianlucadecola
0.24.0
Major changes
- Added v4 mujoco environments that use the new deepmind mujoco 2.2.0 module.
This can be installed throughpip install gym[mujoco]
with the old bindings still being
available using thev3
environments andpip install gym[mujoco-py]
.
These newv4
environment should have the same training curves asv3
. For the Ant, we found that there was a
contact parameter that was not applied inv3
that can enabled inv4
however was found to produce significantly
worse performance see comment for more details. @rodrigodelazcano - The vector environment step
info
API has been changes to allow hardware acceleration in the future.
See this PR for the modifiedinfo
style that now uses dictionaries instead of a list of environment info.
If you still wish to use the list info style, then use theVectorListInfo
wrapper. @gianlucadecola - On
gym.make
, the gymenv_checker
is run that includes calling the environmentreset
andstep
to check if the
environment is compliant to the gym API. To disable this feature, rungym.make(..., disable_env_checker=True)
. @RedTachyon - Re-added
gym.make("MODULE:ENV")
import style that was accidentally removed in v0.22 @arjun-kg Env.render
is now order enforced such thatEnv.reset
is required beforeEnv.render
is called. If this a required
feature then set theOrderEnforcer
wrapperdisable_render_order_enforcing=True
. @pseudo-rnd-thoughts- Added wind and turbulence to the Lunar Lander environment, this is by default turned off,
use thewind_power
andturbulence
parameter. @virgilt - Improved the
play
function to allows multiple keyboard letter to pass instead of ascii value @Markus28 - Added google style pydoc strings for most of the repositories @pseudo-rnd-thoughts @Markus28
- Added discrete car racing environment version through
gym.make("CarRacing-v1", continuous=False)
- Pygame is now an optional module for box2d and classic control environments that is only necessary for rendering.
Therefore, install pygame usingpip install gym[box2d]
orpip install gym[classic_control]
@gianlucadecola @RedTachyon - Fixed bug in batch spaces (used in VectorEnv) such that the original space's seed was ignored @pseudo-rnd-thoughts
- Added
AutoResetWrapper
that automatically callsEnv.reset
whenEnv.step
done is True @balisujohn
Minor changes
- BipedalWalker and LunarLander's observation spaces have non-infinite upper and lower bounds. @jjshoots
- Bumped the ALE-py version to
0.7.5
- Improved the performance of car racing through not rendering polygons off screen @andrewtanJS
- Fixed turn indicators that were black not red/white in Car racing @jjshoots
- Bug fixes for
VecEnvWrapper
to forward method calls to the environment @arjun-kg - Removed unnecessary try except on Box2d such that if
Box2d
is not installed correctly then a more helpful error is show @pseudo-rnd-thoughts - Simplified the
gym.registry
backend @RedTachyon - Re-added python 3.6 support through backports of python 3.7+ modules. This is not tested or compatible with the mujoco environments. @pseudo-rnd-thoughts
0.23.1
This release contains a few small bug fixes and no breaking changes.
- Make
VideoRecorder
backward-compatible togym<0.23
by @vwxyzjn in #2678 - Fix issues with pygame event handling (which should fix support on windows and in jupyter notebooks) by @andrewtanJS in #2684
- Add py.typed to package_data by @micimize in https://github.com/openai/gym/p
- Fixes around 1500 warnings in CI @pseudo-rnd-thoughts
- Deprecation warnings correctly display now @vwxyzjn
- Fix removing striker and thrower @RushivArora
- Fix small dependency warning errorr @ZhiqingXiao
0.23.0
This release contains many bug fixes and a few small changes.
Breaking changes:
- Standardized render metadata variables ahead of render breaking change @trigaten
- Removed deprecated monitor wrapper and associated dead code @gianlucadecola
- Unused striker and thrower MuJoCo envs moved to https://github.com/RushivArora/Gym-Mujoco-Archive @RushivArora
Many minor bug fixes (@vwxyzjn , @RedTachyon , @rusu24edward , @Markus28 , @dsctt , @andrewtanJS , @tristandeleu , @duburcqa)