MovieActions-1K: Multimodal Video-Text Collection from Hollywood Films

MIT

Description

Overview

This dataset features 1,000 short movie clips, each paired with detailed text descriptions in a video-text modality. Sourced from the Hollywood2 dataset (as introduced in the paper below), it focuses on diverse human actions in realistic movie scenes. The 1,000 clips were curated through a structured process to ensure relevance and clarity. This collection is ideal for Generative AI (GenAI) applications like video-to-text generation, text-to-video synthesis, and multimodal content creation, while also supporting action recognition, video analysis, and related tasks in computer vision and NLP.

Dataset Details

  • Video Specifications:
    • Duration: Around 20 seconds on average for efficient processing.
    • Quality: Realistic movie quality with natural settings, no artificial filters, capturing authentic human movements.
    • Camera Angle: Varied angles typical of film footage to highlight actions and contexts.
  • Annotations: Text descriptions detailing actions, movements, and scene context for paired learning.
  • Size: 1,000 videos selected from the original Hollywood2 collection.
  • Format: Standard formats for ML pipeline integration.

Potential Applications

  • Generative AI Focus:
    • Fine-tuning models for video captioning.
    • Generating narrative descriptions from visuals.
    • Text-based video retrieval.
    • Creating synthetic movie content via multimodal learning.
  • Other Uses:
    • Action detection in computer vision.
    • Scene understanding and behavior analysis in video analytics.
    • Benchmarking on realistic movie data for research.

Inspired by the Hollywood2 dataset, this resource empowers GenAI innovation in movie-related AI.


Citation

If using this dataset, please cite: @InProceedings{marszalek09, author = "Marcin Marsza{\l}ek and Ivan Laptev and Cordelia Schmid", title = "Actions in Context", booktitle = "IEEE Conference on Computer Vision & Pattern Recognition", year = "2009" }

Dataset Information

Number of Datapoints

1000

Created By

daksh

Timestamps

Created At

August 19th, 2025 9:13 AM

Last Updated

August 20th, 2025 3:45 PM

Sr. No.VideoDescription
1

The video unfolds in a dimly lit interrogation room, setting a dramatic and suspenseful tone. The scene features two women as the main characters, eng...

2

In this dramatic scene set in a dimly lit room, the atmosphere is charged with tension as two women engage in a heated argument while a man observes, ...

3

The video unfolds within the confines of a dimly lit prison, establishing a tense and eerie atmosphere. Two main characters are seated behind bars, en...

4

The video unfolds in a dimly lit room, creating an intimate and suspenseful atmosphere characteristic of the noir aesthetic, yet the scene is set with...

5

The video presents a drama scene set in a dimly lit room, where the suspenseful atmosphere is crafted through the use of low-angle shots and soft ligh...

6

In this intense drama scene set in a dimly lit room, the atmosphere is charged with tension as a man, a woman, and a girl are the central figures. The...

7

In this gripping thriller scene, the setting is a dimly lit interrogation room, where the atmosphere is charged with tension and suspense. The room's ...

8

In this gripping thriller scene, the setting is a dark, possibly abandoned location that heightens the suspenseful atmosphere. The main character is p...

9

In this noir-themed scene, the setting is a dark, urban environment, captured through stark black-and-white cinematography that emphasizes high contra...

10

The video unfolds in a noir drama setting, specifically within a dimly lit restaurant or lounge, where the atmosphere is thick with tension and suspen...

PRODUCTS

AI Engine

Enterprise solutions

De-AI Workflows

Akai Marketplace

Akailon

COMPANY

About Us

Careers

Security

Privacy

RESOURCES

Partners

Contact

Blog

GUIDE

Akailon Guide

Get started with labeling

Partner with us

Marketplace

Onboarding Enterprise

SOCIALS

Copyright © 2025 Akai Space Labs. All rights reserved.

Terms & Privacy policies