visual-storytelling

Star

Here are 64 public repositories matching this topic...

UCSC-VLAA / story-iter

Star

[ICLR 2026] A Training-free Iterative Framework for Long Story Visualization

storytelling generative-model generative-art image-generation visual-storytelling diffusion-models

Updated Apr 2, 2026
Python

haoningwu3639 / StoryGen

Star

[CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models

video-generation visual-storytelling diffusion-models

Updated Dec 2, 2024
Python

eric-xw / AREL

Star

Code for the ACL paper "No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling"

rl inverse-reinforcement-learning adversarial-learning vision-and-language visual-storytelling adversarial-reward-learning

Updated Jan 19, 2021
Python

Pendulibrium / ai-visual-storytelling-seq2seq

Star

Implementation of seq2seq model for Visual Storytelling Challenge (VIST) http://visionandlanguage.net/VIST/index.html

keras recurrent-neural-networks seq2seq encoder-decoder visual-storytelling

Updated Aug 17, 2018
Python

DuNGEOnmassster / VideoGen-of-Thought

Star

[Neurips 2025 NextVid Workshop Oral✨] Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention

video video-generation visual-storytelling diffusion-models multishot-video-generation

Updated Sep 22, 2025
Python

dianaglzrico / neural-visual-storyteller

Star

An implementation of the paper "Contextualize, Show and Tell: A Neural Visual Storyteller." presented at the Storytelling Workshop, co-located with NAACL 2018.

natural-language-processing deep-neural-networks computer-vision natural-language-generation visual-storytelling

Updated Mar 10, 2019
Python

wanshuiyin / ARIS-Movie-Director

Star

Agentic, long-horizon visual generation: a fuzzy story → a cross-model-audited image-based movie. Brings ARIS's research-wiki + multi-agent debate to multimodal generation (intelligence lives in the agent; the diffusion model just renders). Image-based today, video next.

image-generation ai-agents claude multimodal visual-storytelling aris llm generative-ai anthropic ai-filmmaking agent-skills comic-generation gpt-image cross-model-verification

Updated Jun 23, 2026
Python

Autfy / StoryBoard-AI

Star

将故事和角色图像转化为电影分镜脚本，生成专业的画面提示词和图生视频提示词。

gemini videos gemini-api visual-storytelling animatic gemini-pro veo3 veo-3-1

Updated Feb 8, 2026
TypeScript

ydsgangge-ux / StoryCanvas

Star

自由画布叙事创作系统 — AI-powered story canvas for novels, screenplays & world-building. Supports multi-timeline narratives, 35 block types, 35 story cards. 在画布上摆放、连接、组织叙事元素，AI 协助将结构转化为高质量正文。支持小说、剧本、史诗级世界构建，对标魔兽世界·冰与火之歌·从零开始的异世界生活的创作体量。

canvas worldbuilding screenplay creative-writing visual-storytelling narrative-design novel-writing ai-writing llm story-canvas

Updated Jun 23, 2026
TypeScript

passerby233 / VSCMR-Visual-Storytelling-with-Corss-Modal-Rules

Star

Visual Storytelling with Cross-Modal Rules

vision-and-language visual-storytelling multi-modal-rule-mining

Updated Feb 26, 2020
Jupyter Notebook

marialymperaiou / knowledge-enhanced-multimodal-learning

Star

A list of research papers on knowledge-enhanced multimodal learning

Updated Dec 8, 2022

Stry233 / Visual-Story-Generation-Based-on-Emotional-and-Keyword-Scheme

Star

We designed an end-to-end framework that encourage interactive narrative experience based on keyword control and image generation.

storytelling language-model emotion-detection visual-storytelling

Updated Mar 5, 2024
Jupyter Notebook

microsoft / Kahani

Star

Official Codebase for "Kahani: Culturally Nuanced Visual Storytelling Pipeline for Non-Western Cultures"

culture visual-storytelling

Updated Feb 25, 2025
Python

vwprompt / vwp

Star

TACL 23: Visual Writing Prompts: Character-Grounded Story Generation with Curated Image Sequences

storytelling dataset story characters coherence nlg multimodality nlg-dataset story-generation multimodal-learning vision-and-language multimodal-deep-learning visual-storytelling vision-language vision-language-model

Updated Sep 10, 2025
Python

ehcastroh / intro_DATAVIZ

Star

DATA-X: m130 - Introduction to Visual Principles Using Matplotlib and Seaborn. Provides users with the necessary foundations for building and understanding current state of the art visualizations. An additional aim is to provide users with an understanding of both the theory and techniques of various visualization paradigms. Finally, this series…

data-visualization seaborn matplotlib visualizations matplotlib-tutorial visual-storytelling seaborn-tutorial data-x uc-berkeley-engineering

Updated Feb 23, 2021
Jupyter Notebook

chandan047 / GCN-GLAC

Star

Graph convolution-based visual storytelling

image-captioning image-to-text encoder-decoder bilstm visual-storytelling graph-convolution text-generation-using-rnn gcn-glac glac-net

Updated Feb 28, 2021
Jupyter Notebook

emineugurlu / ASIA

Star

A high-impact landing page design exploring advanced CSS compositing. Features dynamic video masking, interactive 'mix-blend-mode' typography, and a hardware-accelerated cinematic UI architecture.

video-background frontend-engineering visual-storytelling advanced-css css-blend-mode ui-ux-design landing-page-design web-motion