Final Year Project

AniAd

End-to-End AI-Powered Text-to-Animation Pipeline

Transform written scripts into fully animated videos using cutting-edge AI technologies spanning NLP, Computer Vision, Deep Learning, and Generative Models.

8 AI Modules
30% Completed
4 AI Domains
Overview

What is AniAd?

AniAd is a revolutionary AI system that automates the entire animation production pipeline. From a simple text script to a fully rendered animated video with characters, voices, music, and synchronized animations.

This project represents the convergence of multiple AI domains into a single, cohesive system:

Natural Language Processing Deep Learning Computer Vision Generative AI Speech Synthesis 3D Reconstruction
Script
AI Processing
Animation
System Architecture

Modular AI Pipeline

8 interconnected modules working in harmony

01

Script Processing

NLP-powered analysis to extract dialogues, scenes, and emotions from raw scripts.

NLP Text Analysis Emotion Detection
02

Voice-over & Dialogue

Text-to-speech generation with natural voice synthesis and lip-sync capabilities.

TTS Speech Synthesis Lip Sync
03

Background Music & Sound

AI-generated background music and contextual sound effects matching scene mood.

Audio Gen Music AI SFX
04

Emotion Recognition & Animation

Detects emotions and maps them to character expressions and body language.

Emotion AI Expression Mapping Animation
05

Character & Environment

Generates 2D/3D characters and environments from templates or descriptions.

Generative AI Asset Creation Templates
06

2D-to-3D Conversion

Converts 2D images into 3D models using TripoSR, a state-of-the-art transformer-based reconstruction model.

TripoSR 3D Reconstruction Transformers
07

Film/Ad Rendering

Merges and synchronizes all elements to produce professional-quality video output.

Video Rendering Synchronization Export
08

Platform & UI

User-friendly interface for script upload, preview, and final video download.

Frontend Backend Integration
Technology

Tech Stack

AI/ML Frameworks

PyTorch TensorFlow Hugging Face OpenCV

NLP & LLMs

LangChain Transformers spaCy NLTK

3D & Vision

TripoSR Open3D PyTorch3D Blender API

Backend & Tools

Python FastAPI Docker Git
Gallery

Project Screenshots

TripoSR-powered 2D-to-3D Conversion in action

2D to 3D Conversion Output
2D to 3D Conversion
Admin Panel Interface
Admin Panel
Dashboard Interface
Dashboard

Interested in AniAd?

This project is part of my Final Year Project at COMSATS University and demonstrates my expertise in building end-to-end AI systems.