AI Fundamentals: Master the Core of Artificial Intelligence

2.5. Generative AI

Tk 99

Buy Now

Already purchased? To view Sign In

AI Fundamentals Crash Course একটি সম্পূর্ণ বেসিক-টু-ফাউন্ডেশন লেভেলের কোর্স, যেখানে আপনি শিখবেন কীভাবে Artificial Intelligence (AI) কাজ করে এবং আধুনিক AI টুল ও সিস্টেমের মূল কনসেপ্টগুলো পরিষ্কারভাবে বুঝে নিতে হয়—একেবারে শূন্য থেকে।

Definition

Generative AI is a branch of artificial intelligence that creates new data or content, instead of just analyzing existing data.

Key Feature: It generates novel outputs — text, images, video, audio, or 3D models.
Examples:
- ChatGPT → Generates text and conversations.
- DALL·E → Creates images from text descriptions.

Techniques in Generative AI

1. Large Language Models (LLMs)

Neural networks trained on vast text datasets.
Learn word relationships and context to predict and generate coherent text.
Foundation for applications like ChatGPT.

2. Diffusion Models

Primarily used for image and video generation.
Start with random noise and refine it step by step into a realistic image.
Popular in tools like Stable Diffusion.

3. Generative Adversarial Networks (GANs)

Introduced in 2014.
Consist of two parts:
- Generator: Creates synthetic content.
- Discriminator: Judges realism.
Both improve together, producing high-quality, lifelike outputs.

4. Neural Radiance Fields (NeRFs)

Specialized for 3D modeling.
Generate highly realistic 3D scenes and environments.

5. Hybrid Models

Combine multiple approaches (e.g., LLMs + GANs).
Enhance capabilities by leveraging strengths of different techniques.

Impact and Applications

Industry Revolution:

Transforming entertainment, media, architecture, and healthcare by enabling realistic models, creative content, and simulations.
Corporate Influence:

Big Tech companies are investing heavily in Generative AI, expanding its use across text, images, audio, video, and 3D.
Future Potential:

Generative AI is set to reshape business operations, offering powerful ways to create, customize, and manipulate digital content.

Generative AI models

Text Generation Models (LLMs)

GPT (OpenAI) → Powers ChatGPT.
Bard / Gemini (Google DeepMind) → Google’s conversational AI.
Claude (Anthropic) → Known for safe, helpful dialogue.
LLaMA (Meta / Facebook) → Open-source large language model.
Mistral → Lightweight, high-performance LLMs.

Image Generation Models

DALL·E (OpenAI) → Text-to-image generation.
Stable Diffusion (Stability AI) → Open-source image generator.
MidJourney → High-quality, artistic image generation.
Imagen (Google DeepMind) → Text-to-image model.

Video Generation Models

Sora (OpenAI) → Text-to-video generation.
Pika Labs → Creative video generation.
Runway Gen-2 → Video from text or images.
Make-A-Video (Meta) → Research-based text-to-video.

Audio & Speech Generation

VALL-E (Microsoft) → Voice cloning and speech synthesis.
AudioLM (Google) → Natural, high-quality speech & music.
MusicLM (Google) → AI music composition.
ElevenLabs → Popular for realistic text-to-speech.

3D / Specialized Models

Neural Radiance Fields (NeRFs) → 3D scene generation.
Point-E (OpenAI) → 3D model generation from text.
DreamFusion (Google) → 3D generation using diffusion models.

resently

Instructor

Pijush Saha

Pijush Saha is the Digital Marketing Consultant, Coach and Ex Google Employee. He has been working for 12 years in the digital marketing sector involving predominantly in Performance Marketing including SEO, Media Buying, & Web Analytics.

Syllabus

Introduction To AI

6 Lessons

2.5. Generative AI

Tk 99

Definition

Techniques in Generative AI

1. Large Language Models (LLMs)

2. Diffusion Models

3. Generative Adversarial Networks (GANs)

4. Neural Radiance Fields (NeRFs)

5. Hybrid Models

Impact and Applications

Generative AI models

Text Generation Models (LLMs)

Image Generation Models

Video Generation Models

Audio & Speech Generation

3D / Specialized Models

Instructor

Pijush Saha

Syllabus

Introduction To AI

1.1. What is Artificial Intelligence (AI)

1.2. Types of AI

1.3. History of Artificial Intelligence Development

1.4. Data Science, AI, ML & DL

1.5. Data in AI

1.6. Cost of AI Management

Core AI Jargons

2.1. Machine Learning

2.2. Deep Learning

2.3. Computer Vision

2.4. Robotics

2.5. Generative AI

2.6. LLM

2.7. Prompt Engineering

2.8. RAG

2.9. MCP Server

2.10. Big Data

2.11. Context Engineering