Argilla | Poke

Argilla @UCAIz8TmvQQrLqbD7sd-5S2A@youtube.com

523 subscribers - no pronouns :c

Argilla is a collaboration platform for AI engineers and dom

Videos Playlists

Recently Uploaded Popular Oldest

Start a token classification project on the Hugging Face Hub with Argilla, GliNER and NuExtract LLM

Start a text classification project on the Hugging Face Hub with Argilla and SetFit

Argilla Community Everything image: from fine-tuning CLIP models to synthetic image datasets

What is distilabel? A brief feature overview.

Generating and cleaning a preference dataset for DPO / ORPO with LLMs and distilabel

Optimizing RAG Pipelines by fine-tuning custom embedding models on synthetic data with ZenML

ZenML a way to streamline your complex projects with ease

cosine similarity as proxy for quality of sentence pair data

optimizing RAG by choosing the right model

model pooling for diverse synthetic data generation

Ellamind on synthetic data generation with distilabel for pipelining and LLM finetuning

Scaling Synthetic Data Creation with 1 Billion Personas | PersonaHub Dataset Explained

Javier Alonso on lead optimisation at Idealista

Exploring the PRISM Dataset: Conversations, Insights, and Model Performance

Ben Burtenshaw on the Argilla 2.0 SDK refactor

Louis Guitton on NER with Argilla

Weights & Biases on Wandbot

Datamaran on using Argilla in MLOps workflows for ESG governance

Understanding and reproducing DEITA with MantisNLP using distilabel=1.0.0

Elad Levi on AutoPrompt and intent-based prompt calibration and prompt engineering

Daniel van Strien on the Hugging Face hub and synthetic creation of a DPO dataset for Haiku

Seth Levine on the usage of SetFit and BerTopic for unsupervised clustering

Red Cross 510 on NLP for good with SetFit for chat message classification

Prolific on workload distribution, LLM preference data annotation and Phi2 fine-tune Colab

Pitching AI to your boss, SLMs vs LLMs and contributing to open source projects

Kickstart NLP with synthetic data and running LLMs on Google Colab using vLLM

How we cleaned OpenBMB UltraFeedback and Notus

An introduction to distilabel for AI feedback and synthetic data generation

Deploy Argilla on a private Hugging Face space and how to contribute to open source

Deploy Argilla on a public Hugging Face space and create multi-modal datasets

Collect human feedback for evaluating fine-tuned LLMs

Collect human feedback for fine-tuning ChatGPT models

Tutorial: Deploy Argilla on Hugging Face and create your first FeebackDataset

Tutorial: Curate an instruction dataset for supervised fine-tuning

Guía práctica: ¿Cómo mejorar los datos de Alpaca para entrenar un LLM en Español?

Argilla: 3 min walkthrough