Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Tasks
1
Sizes
Sub-tasks
Languages
Licenses
Other
Reset Tasks
Multimodal
Visual Question Answering
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Table to Text
Multiple Choice
Text Retrieval
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Tabular to Text
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Datasets
400
new
Full-text search
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
RekaAI/VibeEval
Viewer
•
Updated
4 days ago
•
23
•
27
google/docci
Updated
4 days ago
•
124
•
15
visual_genome
Preview
•
Updated
Jun 29, 2023
•
2.5k
•
59
MMInstruction/M3IT
Updated
Nov 24, 2023
•
7.05k
•
96
McGill-NLP/WebLINX
Viewer
•
Updated
Mar 29
•
690
•
45
CaptionEmporium/anime-caption-danbooru-2021-sfw-5m-hq
Viewer
•
Updated
1 day ago
•
4
huggan/wikiart
Viewer
•
Updated
Mar 22, 2023
•
644
•
82
silk-road/MMC4-130k-chinese-image
Viewer
•
Updated
May 16, 2023
•
6
pixparse/idl-wds
Viewer
•
Updated
Mar 29
•
5.29k
•
104
conceptual_captions
Viewer
•
Updated
Jan 18
•
14.9k
•
54
wikimedia/wit_base
Viewer
•
Updated
Nov 4, 2022
•
177
•
34
gigant/oldbookillustrations
Viewer
•
Updated
Dec 18, 2023
•
46
•
25
kakaobrain/coyo-700m
Viewer
•
Updated
Aug 30, 2022
•
240
•
110
jainr3/diffusiondb-pixelart
Viewer
•
Updated
May 11, 2023
•
127
•
15
dinhanhx/crossmodal-3600
Viewer
•
Updated
Jun 6, 2023
•
3
•
2
wendlerc/RenderedText
Updated
Jul 12, 2023
•
4
•
14
zzliang/GRIT
Viewer
•
Updated
Jul 4, 2023
•
13
•
102
DBQ/Net.a.Porter.Product.prices.Tunisia
Viewer
•
Updated
Nov 19, 2023
•
1
phiyodr/InpaintCOCO
Viewer
•
Updated
6 days ago
•
9
•
1
pixparse/pdfa-eng-wds
Viewer
•
Updated
Mar 29
•
4.56k
•
83
CaptionEmporium/furry-e621-sfw-7m-hq
Viewer
•
Updated
Mar 21
•
2
csebuetnlp/illusionVQA-Comprehension
Viewer
•
Updated
4 days ago
•
26
•
1
Afeng-x/Draw-and-Understand
Preview
•
Updated
Apr 1
•
3
Xiao215/pixiv-image-with-caption
Viewer
•
Updated
17 days ago
•
5
•
1
rootsautomation/RICO-Screen2Words
Viewer
•
Updated
20 days ago
•
33
•
2
FreedomIntelligence/MileBench
Viewer
•
Updated
7 days ago
•
1
•
1
Voxel51/Total-Text-Dataset
Viewer
•
Updated
5 days ago
•
1
Hamdy20002/COCO_Person
Viewer
•
Updated
3 days ago
•
14
•
1
red_caps
Updated
Jan 18
•
102k
•
54
facebook/winoground
Updated
21 days ago
•
13.3k
•
75
Previous
1
2
3
...
14
Next