Get in Touch

Course Outline

Hunyuan Multimodal Foundations and Lab Setup

  • Grasping Hunyuan's multimodal capabilities for image, 3D, and video use cases.
  • Identifying practical business scenarios for creative, product, and content teams.
  • Preparing the lab environment, sample assets, and model access.
  • Executing initial generation tasks and reviewing outputs.

Prompt Design and Workflow Patterns

  • Structuring prompts to ensure consistent multimodal results.
  • Utilizing text prompts, reference images, and fundamental input settings.
  • Selecting appropriate workflows for image, video, or 3D generation.
  • Iterating on prompts based on output quality and business objectives.

Image Generation and Review Labs

  • Producing marketing, product, and concept images from prompts.
  • Refining visual style, composition, and content consistency.
  • Evaluating outputs for utility, quality, and brand alignment.
  • Organizing image outputs for approval and subsequent use.

Video Generation Labs

  • Creating short video outputs from prompts and prepared inputs.
  • Managing style, scene intent, and output variation.
  • Assessing videos for clarity, continuity, and practical applicability.
  • Preparing video outputs for demonstrations or content workflows.

3D Asset Creation Labs

  • Generating basic 3D assets from text or image inputs.
  • Inspecting geometry, texture quality, and asset usability.
  • Exporting assets for visualization, prototyping, or content pipelines.
  • Comparing scenarios where 3D generation is suitable versus image or video workflows.

Integration, Governance, and Next Steps

  • Delivering generated assets via simple applications, services, or APIs.
  • Linking multimodal outputs to product, content, and review workflows.
  • Applying practical checks for quality, brand safety, copyright compliance, and responsible usage.
  • Planning pilot use cases and subsequent steps for internal adoption.

Requirements

  • Foundational understanding of AI and generative AI concepts.
  • Experience using web applications, APIs, or standard developer tools.
  • Basic proficiency in Python or scripting languages.

Target Audience

  • Developers creating AI-enhanced product features.
  • Technical product managers and solution architects.
  • Innovation, media, and digital teams managing image, video, or 3D content.
 14 Hours

Number of participants


Price per participant

Upcoming Courses

Related Categories