Trelis Research | Poke

Trelis Research @UCruC3Lkt_-StdHlPiyWbPSg@youtube.com

18K subscribers - no pronouns :c

Trelis LTD is a research company based in Dublin, founded by

Videos Shorts Live Posts Playlists

Recently Uploaded Popular Oldest

Create a Python Sandbox for Agents to Run Code

April 2025 Channel Update - Repos, Grants, Collabs, and ARC AGI Team

Build Custom LLM Benchmarks for your Application

Is it safe to use Cursor or Windsurf?

How to Build and Publish an MCP Server - A Detailed Guide

Qwen 2.5 Omni - The Most Multi-modal

How does MCP work? How to use MCP?

Price Comparison: OpenAI vs ElevenLabs vs DeepGram for TTS and STT

Fine-tune Text to Speech Models in 2025: CSM-1B and Orpheus TTS

PREVIEW A Prelude to Llama 4 Token Based Audio Models Orpheus, CSM 1B and Moshi

Diarization, Voice and Turn Detection

Create and Fine-tune AI Avatar Videos

Document-Reading Agents with Read-Write Memory

Why use Keyword vs Vector Search?

Memory Management in LLMs and Agents

Train an LLM to Self-Correct with Verifiable Backtracking

How does GRPO work?

Reinforcement Learning for LLMs in 2025

The Best LLM? Google vs OpenAI, Anthropic and DeepSeek

Top Vision Models 2025: Qwen 2.5 VL, Moondream, & SmolVLM (Fine-Tuning & Benchmarks)

Run Distilled DeepSeek v3 Reasoning Models on Any Laptop with LMStudio

How to Boost RAG Accuracy with SmolAgents & BM25

OpenAI’s $500B ‘Stargate’ Project, GPU Export Bans, Musk’s Critique, and DeepSeek’s Reasoning Model

Channel Update - Playlists, Repos, Collabs, Grants, Memberships

Advanced Embedding Models and Techniques for RAG

Reasoning Models and Chinese Models

LiteLLM - One Unified API for for all LLMs

Nvidia RTX 5090 vs 4090, Project Digits & GB NVLink 72 at CES 2025

LLM Evals - Part 2: Improving Performance

How Deepseek v3 made Compute and Export Controls Less Relevant

LLM Evals - Part 1: Evaluating Performance

I Tested Every GPU

Serve Multiple LoRA Adapters on a Single GPU

Why Build Enterprise RAG with Postgres?

Multi modal Audio + Text Fine tuning and Inference with Qwen

How to Build an Inference Service

How to Fine-tune Florence 2: The Best Small Vision Model

Output Predictions - Faster Inference with OpenAI or vLLM

Coding Assistant for Jupyter Lab

Predicting Events with Large Language Models

Fine tune and Serve Faster Whisper Turbo

OpenAI Fine-tuning vs Distillation - Free Colab Notebook

Synthetic Data Generation and Fine tuning (OpenAI GPT4o or Llama 3)

Test Time Compute, Part 2: Verifiers

Test Time Compute, Part 1: Sampling and Chain of Thought

Distillation of Transformer Models

Fine tuning Pixtral - Multi-modal Vision and Text Model

Powering Gigawatt Data Centers

Full Fine tuning with Fewer GPUs - Galore, Optimizer Tricks, Adafactor

Make Cursor Understand Folder Structure - Coding with LLMs

Automated Prompt Engineering with DSPy

Fine Tune Flux Diffusion Models with Your Photos

How to use LLMs for Fact Checking

CONTEXT CACHING for Faster and Cheaper Inference

Run Speech-to-Speech Models on Mac or GPU

LLM Security 101: Jailbreaks, Prompt Injection Attacks, and Building Guards

Create an AI Assistant or Endpoint for your Documents

RAG - but with Verified Citations!