Bongiysite

AI Fundamentals: Master the Core of Artificial Intelligence

2.5. Generative AI

Tk 99

Already purchased? To view Sign In

AI Fundamentals Crash Course āĻāĻ•āϟāĻŋ āϏāĻŽā§āĻĒā§‚āĻ°ā§āĻŖ āĻŦ⧇āϏāĻŋāĻ•-āϟ⧁-āĻĢāĻžāωāĻ¨ā§āĻĄā§‡āĻļāύ āϞ⧇āϭ⧇āϞ⧇āϰ āϕ⧋āĻ°ā§āϏ, āϝ⧇āĻ–āĻžāύ⧇ āφāĻĒāύāĻŋ āĻļāĻŋāĻ–āĻŦ⧇āύ āϕ⧀āĻ­āĻžāĻŦ⧇ Artificial Intelligence (AI) āĻ•āĻžāϜ āĻ•āϰ⧇ āĻāĻŦāĻ‚ āφāϧ⧁āύāĻŋāĻ• AI āϟ⧁āϞ āĻ“ āϏāĻŋāĻ¸ā§āĻŸā§‡āĻŽā§‡āϰ āĻŽā§‚āϞ āĻ•āύāϏ⧇āĻĒā§āϟāϗ⧁āϞ⧋ āĻĒāϰāĻŋāĻˇā§āĻ•āĻžāϰāĻ­āĻžāĻŦ⧇ āĻŦ⧁āĻā§‡ āύāĻŋāϤ⧇ āĻšāϝāĻŧ—āĻāϕ⧇āĻŦāĻžāϰ⧇ āĻļā§‚āĻ¨ā§āϝ āĻĨ⧇āϕ⧇āĨ¤

Definition

Generative AI is a branch of artificial intelligence that creates new data or content, instead of just analyzing existing data.

  • Key Feature: It generates novel outputs — text, images, video, audio, or 3D models.

  • Examples:

    • ChatGPT → Generates text and conversations.

    • DALL·E → Creates images from text descriptions.

Techniques in Generative AI

1. Large Language Models (LLMs)

  • Neural networks trained on vast text datasets.

  • Learn word relationships and context to predict and generate coherent text.

  • Foundation for applications like ChatGPT.

2. Diffusion Models

  • Primarily used for image and video generation.

  • Start with random noise and refine it step by step into a realistic image.

  • Popular in tools like Stable Diffusion.

3. Generative Adversarial Networks (GANs)

  • Introduced in 2014.

  • Consist of two parts:

    • Generator: Creates synthetic content.

    • Discriminator: Judges realism.

  • Both improve together, producing high-quality, lifelike outputs.

4. Neural Radiance Fields (NeRFs)

  • Specialized for 3D modeling.

  • Generate highly realistic 3D scenes and environments.

5. Hybrid Models

  • Combine multiple approaches (e.g., LLMs + GANs).

  • Enhance capabilities by leveraging strengths of different techniques.

Impact and Applications

  • Industry Revolution:

    Transforming entertainment, media, architecture, and healthcare by enabling realistic models, creative content, and simulations.

  • Corporate Influence:

    Big Tech companies are investing heavily in Generative AI, expanding its use across text, images, audio, video, and 3D.

  • Future Potential:

    Generative AI is set to reshape business operations, offering powerful ways to create, customize, and manipulate digital content.

Generative AI models

Text Generation Models (LLMs)

  • GPT (OpenAI) → Powers ChatGPT.

  • Bard / Gemini (Google DeepMind) → Google’s conversational AI.

  • Claude (Anthropic) → Known for safe, helpful dialogue.

  • LLaMA (Meta / Facebook) → Open-source large language model.

  • Mistral → Lightweight, high-performance LLMs.

Image Generation Models

  • DALL·E (OpenAI) → Text-to-image generation.

  • Stable Diffusion (Stability AI) → Open-source image generator.

  • MidJourney → High-quality, artistic image generation.

  • Imagen (Google DeepMind) → Text-to-image model.

Video Generation Models

  • Sora (OpenAI) → Text-to-video generation.

  • Pika Labs → Creative video generation.

  • Runway Gen-2 → Video from text or images.

  • Make-A-Video (Meta) → Research-based text-to-video.

Audio & Speech Generation

  • VALL-E (Microsoft) → Voice cloning and speech synthesis.

  • AudioLM (Google) → Natural, high-quality speech & music.

  • MusicLM (Google) → AI music composition.

  • ElevenLabs → Popular for realistic text-to-speech.

3D / Specialized Models

  • Neural Radiance Fields (NeRFs) → 3D scene generation.

  • Point-E (OpenAI) → 3D model generation from text.

  • DreamFusion (Google) → 3D generation using diffusion models.

resently

Instructor

Pijush Saha

Pijush Saha is the Digital Marketing Consultant, Coach and Ex Google Employee. He has been working for 12 years in the digital marketing sector involving predominantly in Performance Marketing including SEO, Media Buying, & Web Analytics.