Arize AI | Poke

Arize AI @UCrVHzD-psX5IMCGoEWHXmGw@youtube.com

4.2K subscribers - no pronouns :c

Arize AI is an AI observability and evaluation platform, fro

Videos Live Playlists

Recently Uploaded Popular Oldest

Phoenix: Guardrails AI Integration Walkthrough

Phoenix: Function Call Evaluations

Phoenix: Experiments Walkthrough

Phoenix: Datasets

Introducing: Hosted Phoenix

Introducing: Arize Copilot

Platform demo 2024

Guardrails for AI Applications

Text To SQL: Datasets and Experiments

LLM Evaluation: Getting Started

Tracing a Multimodal Query Application

Running Custom LLM Evaluations ⚙️: Function Calling Agent

LLM App AB Testing Using Projects

Community Paper Reading: RAFT – Adapting Language Model to Domain Specific RAG.

LLM Evals and LLM as a Judge: Fundamentals

LLM Token Counting

LLM Tracing: Getting Started

Debugging Sessions in LLM Chatbot Applications with Tracing and Evals

LLM Interpretability: Exploring the Latest Research from OpenAI and Anthropic

Understanding Embeddings Similarity Search In Production Workflows

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

Improving & Evaluating Your RAG Application with MongoDB and Arize Phoenix

LLM Evals for Router Based Architectures

SQL Generation Evals: LLMs-as-a-Judge

Breaking Down EvalGen: Who Validates the Validators?

Optimizing AI Apps for Real-World Performance with OpenAI and Flipkart

Keys to Understanding ReAct: Synergizing Reasoning and Acting in Language Models

LLM Evaluation In Practice: Timeseries Evals

LLM Evals In Practice: Creating Custom Task Evals

Towards A Production-Ready Customer Feedback LLM: Leveraging Evals for Advanced Feedback Analysis

Optimizing LLM Retrieval Strategies with Arize AI & Qdrant

Chronos: Learning the Language of Time Series

LLM Evaluation Essentials: Benchmarking and Analyzing Retrieval Approaches

LLM Evaluation Essentials: Statistical Analysis of Summarization LLM Evaluations

Arize onboarding: Dashboards for LLM Use Case

Arize onboarding: Generative Embedding

Arize onboarding: LLM Data Ingestion

Arize Onboarding: LLM Tracing

Arize Onboarding: Prompt Playground

Arize Onboarding: Data Ingestion

Arize Onboarding: Exploring Dashboards

Arize Onboarding: Performance Tracing

Arize Onboarding: Setting Monitors for Unstructured Use Case

Arize Onboarding: NLP Embeddings

Arize Onboarding: Uploading Models and Data

Anthropic Claude 3: A GPT-4 Competitor Has Arrived

Reinforcement Learning in the Era of LLMs

Understanding OpenAI's Sora & Evaluating Large Video Model Generation

RAG vs Fine-tuning

RAG Time! Evaluate RAG with LLM Evals and Benchmarking

Arize Product Demo

Phoenix OSS: AI Observability & Evaluation in a Notebook

LangChain RAG featuring Shopify's Madhav Thaker

Vibe-Based Prompt Engineering with PromptLayer's Jared Zoneraich

Troubleshooting and Evaluating LLMs with Amber Roberts

Constructing an Evaluation Approach for Generative AI Models with Hugging Face's Rajiv Shah

Mistral AI (Mixtral-8x7B): Performance, Benchmarks

Fine Tuning and Evaluating LLMs with Anyscale and Arize

LLM Evaluation Essentials: Statistical Analysis of Hallucination LLM Evaluations