Data Phoenix icon
  • Home Home
  • News News
  • Digest Digest
  • Events Events
  • Videos Videos
  • Articles Articles
  • ▶️ YouTube ▶️ YouTube
Sign In
Data Phoenix cover image

Subscribe

Success! Now Check Your Email

To complete Subscribe, click the confirmation link in your inbox. If it doesn’t arrive within 3 minutes, check your spam folder.

Microsoft launches Phi-4 'reasoning' models to celebrate Phi-3's first anniversary
AI2's OLMo 2 1B model rivals offerings from leading tech firms
15 Minutes to Go! Don’t Miss Expo Roundtable at AI INFRA SUMMIT 3
Google introduces Veo 2 for video generation in Gemini and Whisk
AI INFRA SUMMIT 3 – Where AI Gets Real. Where Infra Gets Scaled.
Building the Future with AI: Insights from Fireworks AI and Industry Leaders at AI Demo Jam
The biggest announcements from Google Cloud Next 25
OpenAI delays the new GPT-4o image generation feature for free users due to high demand
AI Highlights Review: March 13–24
OpenAI Launches New Tools for Building AI Agents
  • Home Home
  • News News
  • Digest Digest
  • Events Events
  • Videos Videos
  • Articles Articles
  • ▶️ YouTube ▶️ YouTube
Sign In
Data Phoenix cover image

Subscribe

success-filled

Success! Now Check Your Email

To complete Subscribe, click the confirmation link in your inbox. If it doesn’t arrive within 3 minutes, check your spam folder.

Videos
Let's build GPT: from scratch, in code, spelled out
Videos
Jan 27, 2023
by
Sophia

Let's build GPT: from scratch, in code, spelled out

In this video, Andrej Karpathy demonstrates how to build a Generatively Pretrained Transformer (GPT), following the paper "Attention is All You Need" and OpenAI's GPT-2 / GPT-3, and much more. Make sure that you watch at least parts of it!

Let's build GPT: from scratch, in code, spelled out feature image
Sophia profile image
by Sophia
Published Jan 27, 2023

Comments

Read Next

Qwen3: A new generation of language models featuring hybrid thinking Post feature image
News

Qwen3: A new generation of language models featuring hybrid thinking

The Qwen team has released Qwen3, a new family of open-weight language models comprising two MoE and six dense models. The Qwen3 model family features hybrid thinking capabilities, support for 119 languages, and competitive performance.

May 08, 2025
by Ellie Ramirez-Camara
Microsoft launches Phi-4 'reasoning' models to celebrate Phi-3's first anniversary Post feature image
Featured post
News

Microsoft launches Phi-4 'reasoning' models to celebrate Phi-3's first anniversary

Microsoft has introduced three new small language models—Phi-4-reasoning, Phi-4-reasoning-plus, and Phi-4-mini-reasoning—that reportedly deliver complex reasoning capabilities comparable to much larger models while maintaining efficiency for deployment across various computing environments.

May 08, 2025
by Ellie Ramirez-Camara
AI2's OLMo 2 1B model rivals offerings from leading tech firms Post feature image
Featured post
News

AI2's OLMo 2 1B model rivals offerings from leading tech firms

AI2's new Olmo 2 1B model outperforms similar-sized offerings from Google, Meta, and Alibaba across multiple key benchmarks, including GSM8K, TruthfulQA, and DROP, while maintaining a small enough size to run on resource-limited hardware.

May 06, 2025
by Ellie Ramirez-Camara
Data Phoenix icon

SF Bay Area media and education platform focused on AI and Data. As a voice of the AI industry, Data Phoenix delivers news, insights, practical knowledge and helps companies be heard in the community.

    • Home
    • News
    • Digest
    • Events
    • Videos
    • Articles
    • ▶️ YouTube
  • About Us
  • Support Ukraine 🇺🇦
  • Advertise with us
  • Write for Data Phoenix
  • Submit your story

Copyright © 2025 Data Phoenix. Published with Ghost and Data Phoenix.