JoyCaptionBETA
Advanced Open-Source Image Captioning Tool
An uncensored Visual Language Model (VLM) designed for generating detailed image captions across multiple styles and formats.
Artistic Styles
JoyCaption covers a diverse range of artistic styles, ensuring comprehensive caption generation for any type of image.
Prompt Modes
Choose from 10 different prompting modes including descriptive captions, Stable Diffusion prompts, MidJourney and more.
Training Images
Over 50,000 images used in model training, creating a robust foundation for accurate and diverse captioning.
JoyCaption Live Demo
Upload an image and see JoyCaption in action. Choose from different caption styles and options.
Loading...
JoyCaption Key Features
Open & Uncensored
Released with open weights and no restrictions, JoyCaption provides equal coverage for both SFW and NSFW content.
Open Source
Fully open model with available training scripts that encourages community contributions and transparent development.
Multiple Models
Supports various complexity levels from 17GB bfloat16 to 4-bit quantized versions for lower VRAM requirements.
Wide Content Support
Covers digital art, photorealistic images, anime, furry art, and diverse styles, ethnicities, and orientations.
JoyCaption Prompt Modes
JoyCaption offers 10 different prompting modes to suit various use cases and preferences.
Descriptive Caption
General purpose detailed description of the image contents.
Stable Diffusion
Optimized for text-to-image generation with Stable Diffusion models.
MidJourney
Prompt style optimized for the MidJourney image generation model.
Danbooru
Tagging style reminiscent of anime/manga image boorus.
JoyCaption Use Cases
AI Art Training
Generate high-quality captions for training or fine-tuning text-to-image diffusion models with appropriate descriptions.
Dataset Captioning
Quickly and efficiently caption large datasets of images for machine learning training purposes.
Content Marketing
Accelerate the creation of image descriptions for content marketing, social media, and SEO optimization.
Developer Integration
Integrate into your workflows with tools like ComfyUI, batch processing, or via vLLM with an OpenAI-compatible API.
JoyCaption Caption Gallery
See examples of JoyCaption outputs in different styles and for various image types.

Art Portrait
A painted portrait of a woman with flowing red hair, green eyes, and delicate features, digital art style

Landscape Photography
Mountains reflected in a still lake at sunset, with dramatic clouds and golden lighting

Anime Character
1girl, blue_hair, red_eyes, school_uniform, smile, looking_at_viewer, headphones, outdoors

3D Render
A futuristic sci-fi city with tall spires, flying vehicles, and holographic displays, rendered in 3D with ray tracing
JoyCaption How It Works
JoyCaption's image analysis and caption generation process explained.
Image Analysis
The model analyzes your uploaded image, identifying objects, people, styles, colors, and composition.
Context Processing
The identified elements are processed through the language model to create contextually relevant descriptions.
Caption Generation
Based on your selected prompting mode, the final caption is generated with appropriate styling and formatting.
JoyCaption Resources
Hugging Face Demo
Try JoyCaption directly in your browser with the official Hugging Face Space demo.
GitHub Repository
Access the source code, contribute to development, or download for local implementation.
Community Models
Explore related models, forks, and community implementations on Civitai.
Start Captioning Your Images Today
Experience the power of open-source, uncensored image captioning with JoyCaption Beta One.