工gin師 | Poke

34:05

OpenAI GPT o1 新模型用CoT 針對STEM進行優化跟博士學生一樣厲害 (OpenAI o1 System Card導讀)

19:38

[論文導讀] Segment Anything Model 2 (SAM 2)

22:35

[論文導讀] PromptBreeder 結合基因演算法與Prompt Engineering

19:01

[論文導讀] 為何 AI語音聊天沒有人味? 該怎麼做好AI語音聊天對話機制中的Turn Taking與Backchanneling

22:02

[論文導讀] Scaling Laws of RoPE-based Extrapolation 研究LLM Long Context外插

22:02

[論文導讀] GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection 用梯度LowRank近似降低模型訓練硬體需求

16:09

[閒聊] 聊聊 OpenAI推出的最強(2024)影片生成模型 Sora

18:41

[論文導讀] Lumiere: A Space-Time Diffusion Model for Video Generation 最強(?)(2024)影片生成模型

11:02

[論文導讀] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data 搭配ControlNet進行指定深度圖片生成的新玩具

10:19

[論文導讀] LLaMA Beyond English 如何有效率地讓英文語言模型學會中文?

08:34

[論文導讀] 高效率低成本微調大型語言模型 Tuning Language Models by Proxy

16:32

[論文導讀] Mamba : 挑戰Transformer的新星 EP2 Linear-Time Sequence Modeling with Selective State Spaces

26:18

[論文導讀] Mamba : 挑戰Transformer的新星 EP1 Structured State Space Sequence Model (S4)

17:58

[論文導讀] 大小語言模型如何組合? Finetuning/Pretraining怎麼分配資源? An Emulator for Fine-Tuning Large Language

08:58

中研院AI 真的大翻車? 台灣AI困境你可以挺身而出為台灣AI做出重要貢獻

04:30

[論文導讀] Llama 2 Long 支持32k輸入的官方llama2

24:01

[論文導讀] Who's Harry Potter? Approximate Unlearning in LLMs 如何讓語言模型忘記哈利波特?

30:32

[台灣的語言模型] Taiwan LLama2 vs. 聯發創新基地模型

20:16

[論文導讀] Diffusion預訓練 16x加速 Wuerstchen: Efficient Pretraining of Text-to-Image Models

02:49

What is Speculative Sampling? How does Speculative Sampling Accelerate LLM Inference

21:33

[論文導讀] 從多視角解釋Ensemble (ICLR 2023 Honorable Mention)

22:29

[論文導讀] 大模型是怎麼死背樣本的? Can Neural Network Memorization Be Localized?

18:19

[論文導讀] SeamlessM4T MetaAI新推出最強多語言文字/語音翻譯蒟蒻

23:32

[論文導讀] 台灣的llama2出來了? 聊聊微調語言模型

16:52

[專題解說] 可以與3D場景互動的AI? 多模態語言模型從CLIP、BLIP到3D-LLM

08:21

[論文速讀] AnimateDiff 快速客製影片生成器?

10:59

[論文導讀]Task-Specific Skill Localization in Fine-tuned Language Models

14:33

[論文導讀] SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis 最新高擬真SD模型

31:26

[論文導讀] Task Arithmetic in the Tangent Space 在線性區域調整模型吧

25:33

為什麼Apple AR沒有讓你驚豔? 談談AR眼鏡技術的難題

32:40

[論文導讀] DragGAN 滑鼠點點一點，讓AI幫你P圖

12:25

[論文導讀] Emergent Correspondence from Image Diffusion

30:59

[論文導讀] A Mathematical Framework for Transformer Circuits 導讀

25:06

[專題解說] 用兩種觀點入門 DDIM 與 DDPM DDIM教學

22:35

[論文導讀] Shap-E: Generating Conditional 3D Implicit Functions

15:00

deep-floyd/IF 最新開源影像生成模型最強的「文字生成」模型

26:38

[論文導讀] Segment Anything: MetaAI推出最強影像分割模型

25:29

[論文導讀] E4T: 比Dreambooth快數十倍的stable diffusion客製化

22:20

[論文導讀] Anti-Dreambooth 如何保護你的照片防止惡意生成

19:20

[論文導讀] AI影響哪些工作

20:43

[資料庫] 什麼是data warehouse? The Analytics Setup Guidebook 重點整理(上)

13:10

[Stable Diffusion雜談] Lycoris: 更自由的LoRA變體

16:52

[Stable Diffusion雜談] Stable Diffusion DPM-Solver++

27:34

[Stable Diffusion雜談] Stable Diffusion 與 CLIP 的基本原理

19:41

[論文導讀] ControlNet 輸入草稿讓AI幫你完成繪圖 (Stable diffusion)

27:53

[Stable Diffusion雜談] 該怎麼微調你的Stable Diffusion Model

22:29

[論文導讀] Muse: Text-to-Image Generation via Masked Generative Transformers 基於Transformer的AI文字生成影像

08:55

[10分鐘聊論文] Dreamix: Video Diffusion Models are General Video Editors 又一篇AI影片編輯

20:15

[論文導讀] Text-To-4D Dynamic Scene Generation (文字轉3D動畫AI)

27:15

[論文導讀] Tune A Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation 文字生成影片

22:23

如何入門Langevin Dynamics (Diffusion Model的重要算法)

01:29

Shadowing Practice Using Python | Youtube跟讀練習助手[可調速]

27:31

[專題解說] Introduction to Diffusion Model: Control Diffusion with Condition 擴散模型入門：控制生成 [附程式碼] 教學

45:08

[專題解說] Introduction to Diffusion Model 擴散模型入門 [附程式碼] 教學

32:42

[論文導讀] DiffusionDet: Diffusion Model for Object Detection 擴散模型物件偵測

19:09

[論文導讀] Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

26:08

[論文導讀] Deep Equilibrium Approaches to Diffusion Models 應用DEQ之擴散模型

24:40

[論文導讀] Diffusion models as plug-and-play priors

19:11

[論文導讀] Self-conditioned Embedding Diffusion for Text Generation 詞嵌入擴散模型用於文字生成

18:03

[論文導讀]Understanding a Deep Neural Network Based onNeural-Path Coding