Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Tasks
1
Sizes
Sub-tasks
Languages
Licenses
Other
Reset Tasks
Multimodal
Visual Question Answering
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Table to Text
Multiple Choice
Text Retrieval
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Tabular to Text
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Datasets
30
new
Full-text search
Edit filters
Sort: Trending
Active filters:
reinforcement-learning
Clear all
jat-project/jat-dataset
Viewer
•
Updated
Feb 16
•
617
•
26
DIBT/aya_dutch_dpo
Viewer
•
Updated
3 days ago
•
1
•
2
stonet2000/robot_demos_with_state_reset
Updated
Oct 24, 2023
•
1
•
2
wdcqc/starcraft-remastered-melee-maps
Updated
Jan 6, 2023
•
2
jrahn/yolochess_deepblue
Viewer
•
Updated
Feb 3, 2023
jrahn/yolochess_lichess-elite_2211
Viewer
•
Updated
Feb 8, 2023
•
4
sunzeyeah/chinese_chatgpt_corpus
Updated
Mar 23, 2023
•
4
•
79
crystalai/autotrain-data-crystal_alchemist-vision
Updated
Aug 25, 2023
•
1
PetraAI/PetraAI
Updated
Sep 14, 2023
•
47
•
16
haosulab/ManiSkill2
Updated
Jan 26
•
2
•
4
aarontung/test
Viewer
•
Updated
Sep 7, 2023
•
1
•
1
imone/D4RL
Updated
Aug 30, 2023
•
2
neovalle/H4rmony
Viewer
•
Updated
6 days ago
•
1
•
15
im-Kitsch/minari_d4rl
Updated
Sep 13, 2023
erhwenkuo/hh_rlhf-chinese-zhtw
Viewer
•
Updated
Oct 4, 2023
•
6
jxu124/OpenX-Embodiment
Updated
Nov 1, 2023
•
9.17k
•
21
nmd2k/apps_rlaif
Viewer
•
Updated
Nov 27, 2023
deepghs/quality_rlhf
Viewer
•
Updated
Nov 24, 2023
Trofish/Korean-RLHF-Full-process
Preview
•
Updated
Jan 11
•
4
davanstrien/haiku_dpo
Updated
Mar 13
•
6
•
39
neovalle/H4rmony_dpo
Viewer
•
Updated
Feb 5
•
177
•
10
fblgit/simple-math-DPO
Viewer
•
Updated
Jan 27
•
4
•
12
puyuan1996/pong_muzero_2episodes_gsl400_v0.0.4
Updated
Mar 7
dynamicslab/KoopmanRL
Updated
Feb 29
DIBT/10k_prompts_ranked
Viewer
•
Updated
Mar 7
•
1.51k
•
122
swaroop-nath/opin-pref
Viewer
•
Updated
Feb 23
Felladrin/ChatML-H4rmony_dpo
Viewer
•
Updated
Feb 23
gallantVN/en_vi_DPO
Viewer
•
Updated
Mar 3
•
1
haosulab/ManiSkill
Updated
Mar 15
•
2
gbenson/webui-dom-snapshots
Viewer
•
Updated
1 day ago
•
64