Understanding IPOs: A Comprehensive Guide
Top 5 Webflow Agencies Focused On Building Brands For The Future
SeeMoE: Implementing a MoE Vision Language Model from scratch
🧑⚖️ "Replacing Judges with Juries" using distilabel
Fish Speech V1 - New Multilingual Open Source TTS Model
A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI
seemore: Implement a Vision Language Model from Scratch
Google Search with LLM
Token Merging for fast LLM inference : Background and first trials with Mistral
⚗️ 🧑🏼🌾 Let's grow some Domain Specific Datasets together
Expanding Model Context and Creating Chat Models with a Single Click
Mergoo: Efficiently Build Your Own MoE LLM
🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets
Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+
Post-OCR-Correction: 1 billion words dataset of automated OCR correction by LLM
Can We Train Chat Models with Raw Data?
RealWorldQA, What's New?
How to Finetune phi-3 on MacBook Pro
Fine Tuning a LLM Using Kubernetes with Intel® Xeon® Scalable Processors
LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)
Outpainting III - Inpaint Model
Outpainting II - Differential Diffusion
Outpainting I - Controlnet version
Exploring Emotionally Intelligent AI with HelpingAI
Fine-tune Llama 3 with ORPO
RAG chatbot using llama3
Starting Tiny with Protein LLaMA
Mixture of Depth is Vibe
Custom architectures with HuggingFace 🤗
Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU!
On Coding Your First Attention
SVGDreamer: Text Guided Vector Graphics Generation with Diffusion Model
Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data
Gradio integrated with FastAPI and with simple authentication
Design choices for Vision Language Models in 2024
Orchestration of Experts: The First-Principle Multi-Model System
Diffusion Models
RFDiffusion Potentials
It's raining diffusion personalization techniques☔️🎭🖼️
Building a Neural Network Classifier from the Ground Up: A Step-by-Step Guide
History of State Space Models (SSM) in 2022
What Historical AI Breakthroughs Have Been Unappreciated by The Mainsteam Media?
Analysis on evaluating 7 bilions italian LLMs
DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive
Introduction to State Space Models (SSM)
RAG Empowerment: Cohere C4AI Command-R and Transformers Unveiled
🐦 The IBIS Challenge
Transformers
The LASER technique: Evaluating SVD compression
Open Source All About Data Processing, Dataverse
Many-shot jailbreaking
Aurora-M: The First Open Source Biden-Harris Executive Order Red teamed Multilingual Language Model
Train Custom Models on Hugging Face Spaces with AutoTrain SpaceRunner
Gecko: Versatile Text Embeddings Distilled from Large Language Models
Finetune Mixtral 8x7B with AutoTrain
How do Textual Inversion tokens destroy prompts?
Experiments with Bitnet 1.5 (~ngmi~)
Virtual Try-On using IP-Adapter Inpainting
Create Mixtures of Experts with MergeKit
Elevate Responses: RAG with LlamaIndex & MongoDB
Samantha Mistral Instruct 7b - Comprehensive Bulleted Notes
Policy Questions Blog 1: AI Data Transparency Remarks for NAIAC Panel 📚🔍⚖️
Protein similarity and Matryoshka embeddings
A brief analysis of automerger data, feat. SLERP and DARE-TIES LLM merging
Data exploration and filtering with Nomic Atlas
Giskard Bot: Identifying robustness, performance and ethical vulnerabilities in the Top 10 Most Popular Hugging Face Models
Releasing Common Corpus: the largest public domain dataset for training LLMs
What's Automatic Differentiation?
Dive Deeper into Yi-9B
Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity
VideoMamba: State Space Model for Efficient Video Understanding
Better RAG 3: The text is your friend
Better RAG 2: Single-shot is not good enough
Better RAG 1: Advanced Basics
MTEB Leaderboard : User guide and best practices
Revolutionizing Video Transcription: Unveiling Gemma-2b-it and Langchain in the Era of Transformers
Towards actively reasoning LLM systems
SemScore: Evaluating LLMs with Semantic Similarity
Saving Memory Using Padding-Free Transformer Layers during Finetuning
Open-Source SORA Has Arrived! Training Your Own SORA Model!
Large Language Models in Quest for Adventure
Streamline Computer Vision Workflows with Hugging Face Transformers and FiftyOne
Deploying 🤗 Hub models in Vertex AI
Genie: Generative Interactive Environments
Molecule retrieval and editing using multimodal text-structure representations
Breaking resolution curse of vision-language models
🌌 Analysis of Spaces in Hugging Face
Fast, High-Fidelity LLM Decoding with Regex Constraints
Rephrasing the Web A Recipe for Compute and Data-Efficient Language Modeling
Exploring a Public Domain dataset with Visual Topic Modeling
Navigating Complexity with Elegance: The P-FAF Approach to Fractal Word Embeddings
Fine-tuning a large language model on Kaggle Notebooks (or even on your own computer) for solving real-world tasks
Transformers and Quadrant: Revolutionizing Data Integration for NLP Tasks
Reformatted Alignment
Rank-Stabilized LoRA: Unlocking the Potential of LoRA Fine-Tuning
Guide : W-Okada, realtime voice cloning
Detecting LLM-Generated Text with Binoculars
Beyond Traditional Fine-tuning: Exploring Advanced Techniques to Mitigate LLM Hallucinations
Humor Understanding Multi-task Optimization & Ranking
Probabilistic Fractal Activation Function (P-FAF) and Its Advantages Over Traditional Word Vectorization
🥐CroissantLLM: A Truly Bilingual French-English Language Model
Quantization of Transformer Models with Neural Compressor
Introduction to LLE
Serverless Image Similarity with Upstash Vector and Huggingface Models, Datasets and Spaces
Phinetuning 2.0
Building autograd engine tinytorch 03
Building autograd engine tinytorch 02
💻Create a Web Interface for your LLM in Python
makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch
Robust image watermarking with Stable Signature + IMATAG's BZH
Multilabel Classification using Mistral-7B on a single GPU with quantization and LoRA
Building autograd engine tinytorch 01
AI Lineage Explorer: A Step Towards AI Integrity.
Unleashing the Power of Unsloth and QLora:Redefining Language Model Fine-Tuning
Breaking Barriers: The Critical Role of Art and Design in Advancing AI Capabilities
Implementing Fractional GPUs in Kubernetes with Aliyun Scheduler
Extending the Massive Text Embedding Benchmark to French: the datasets
Unleashing the Power of Logprobs in Language Models: A Practical Guide
Conditional Probability
Merge Large Language Models with mergekit
Temporal Scene Generation w/ Stable Diffusion
Unveiling TinyLlama: An Inspiring Dive into a Revolutionary Small-Scale Language Model
Multi-Label Classification Model From Scratch: Step-by-Step Tutorial
Multimodal IDEFICS: Unveiling the Transparency & Power of Open Visual Language Models
What is Probability?
4D masks support in Transformers
Understanding Mixtral-8x7b
Streamlining Data Management with Hugging Face and DVC: A Seamless Integration
TchAIkovsky – Piano MIDI Generation with Transformers
How Your Ordinary 8GB MacBook’s Untapped AI Power Can Run 70B LLM Models That Will Blow Your Mind!
Leveraging Transformers and PyTorch for Multiple Choice Question Tasks
Build an AI Chatbot to Run Code and Tweak plots
Combating Evaluation Data Contamination in LLMs: Strategies for High-Quality Finetuning and Model Merging
Counting 'n' objects
Kubernetes infrastructure for HF models and chat with Cluster.dev
How to build an interactive HF Space to visualize an Image Dataset
Uniting Forces: Integrating Hugging Face with Langchain for Enhanced Natural Language Processing
Drag GAN - Interactive Point-based Manipulation on the Generative Image Manifold
Running Any HuggingFace Model on SageMaker Endpoint: Walk-Through with Cross Encoder Model Example
Predicting the Effects of Mutations on Protein Function with ESM-2
Deploying Your FastAPI Applications on Huggingface Via Docker
What is a Transformer?
📚 Training Data Transparency in AI: Tools, Trends, and Policy Recommendations 🗳️
🏷️ Build AI Feedback (AIF) datasets for LLM alignment with ⚗️ distilabel
Fine-Tuning LLMs: Supervised Fine-Tuning and Reward Modelling
Easy JAX training loops with Flax and Optax
On Learning JAX – A Framework for High Performance Machine Learning
Sentence Mining with OpenAI's Whisper
Illustrated LLM OS: An Implementational Perspective
💨 Introducing Notus: a DPO fine-tune of Zephyr with a focus on high-quality data
Faster Persistent Homology Alignment and Protein Complex Clustering with ESM-2 and Persistence Landscapes
Evaluating Large Language Models on Gender-Occupational Stereotypes Using the Wino Bias Test
Unbelievable! Run 70B LLM Inference on a Single 4GB GPU with This NEW Technique
Clustering Protein Complexes using Persistent Homology and Finetuning ESM-2 for PPI Network Prediction
Streamlining ML Workflows: Integrating MLFlow Tracking with LangTest for Enhanced Model Evaluations
Automatic Hallucination detection with SelfCheckGPT NLI
Extracting Insights from Model Cards Using Open Large Language Models
ESM-2 for Generating and Optimizing Peptide Binders for Target Proteins
Explaining the SDXL latent space
Does Sketching Work?
Understanding Zephyr
Are your NLP models deteriorating post-deployment? Let’s use unlabelled data to find out
Persistent Homology Alignment (PHA): Replacing Multiple Sequence Alignments using ESM-2 and Persistent Homology
In Silico Directed Evolution of Protein Sequences with ESM-2 and EvoProtGrad
QLoRA for ESM-2 and Post Translational Modification Site Prediction
Automating Responsible AI: Integrating Hugging Face and LangTest for More Robust Models
Hugging Face accelerates distribution of models and datasets based on Dragonfly
Introducing the Giskard Bot: Enhancing LLM Testing & Debugging on Hugging Face
Elevate Your NLP Models with Automated Data Augmentation for Enhanced Performance
Goodbye Python, Hello Rust: Building a RAG CLI Application with Orca
StarCoder Memorization Experiment Highlights Privacy Risks of Fine-Tuning On Code
Scaling Self Supervised Learning for Histology: introducing Phikon
Unmasking Language Model Sensitivity in Negation and Toxicity Evaluations
Detecting and Evaluating Sycophancy Bias: An Analysis of LLM and AI Solutions
After 500+ LoRAs made, here is the secret
Detecting the Deceptive: Unmasking Deep Fake Voices
AutoTrain Advanced now supports Experiment Tracking
Hearing is Believing: Revolutionizing AI with Audio Classification via Computer Vision
Next token prediction with GPT
What kind of data lake do we need in the Big Model era?
Fine-tune Flair Models on NER Dataset with 🤗 AutoTrain SpaceRunner
Estimating the Intrinsic Dimension of Protein Sequence Embeddings using ESM-2
Sparse LLM Inference on CPU
Introduction to Dataset Creation - What Makes a Good Dataset?
Building Your First Kubeflow Pipeline: A Comprehensive Guide
Predicting Protein-Protein Interactions Using a Protein Language Model and Linear Sum Assignment
InfiniText: Empowering Conversations & Content with Mistral-7B-Instruct-v0.1
An optimal lossy variant of speculative decoding
Changes of Embeddings during Fine-Tuning of Vision Transformers (ViT)
🕳️ Attention Sinks in LLMs for endless fluency
Understanding InstaFlow/Rectified Flow
Using 🤗 to Train a GPT-2 Model for Music Generation
Making AI-Generated Content Easier to Identify
Samantha and Mistral 7B: A Powerful and Versatile Language Model Duo
IntenLM-20B is officially released on Hugging Face Hub
Trying IDEFICS on a *New Yorker* cartoon dataset
Introducing BlindChat, an open-source and privacy-by-design Conversational AI fully in-browser
ESMBind (ESMB) Ensemble Models
Optimizing Convolutional Neural Networks with Mojo - Part 1
AI Total Cost of Ownership Calculator: Evaluate the cost of in-house AI deployment vs AI APIs
🤗 LLM suggestions in Argilla with HuggingFace Inference Endpoints
Hugging Face and Scrimba partner to teach developers to utilize open-source AI models
ESMBind (ESMB): Low Rank Adaptation of ESM-2 for Protein Binding Site Prediction
Introduction to Quantization cooked in 🤗 with 💗🧑🍳