LMM Fine Tuning - Supervised Fine Tuning Trainer (SFTTrainer) vs transformers Trainer Fine Tuning Large Language Models SFT Trainer vs transformers Trainer Large Language Models LLMs are incredibly powerful tools with wide applications However fo 3 min read 05-10-2024 9
Differences between Langchain & LlamaIndex Lang Chain vs Llama Index Choosing the Right Tool for Your LLM Application The world of large language models LLMs is rapidly evolving and with it a host of too 3 min read 05-10-2024 8
Display Streaming output on Chainlit from AutoGPTQForCausalLM and RetrievalQA.from_chain_type Streaming Output from Auto GPTQ For Causal LM and Retrieval QA from chain type to Chainlit Problem Developers often struggle to visually track the real time pro 2 min read 05-10-2024 7
BFloat16 is not supported on MPS (macOS) B Float16 A Performance Booster Unavailable on mac OS The Problem You want to leverage the speed and efficiency of B Float16 data type for your machine learning 2 min read 04-10-2024 8
Langchain csv_agent with ConversationChain Unleashing the Power of Conversational AI Lang Chains CSV Agent with Conversation Chain The Problem Imagine you have a large dataset stored in a CSV file and yo 2 min read 04-10-2024 9
How do I import and configure an LLM so that auto device_map='auto' is supported or circumvented? Unlocking Auto Device Mapping for LLMs A Guide to Importing and Configuration Large Language Models LLMs are revolutionizing the way we interact with technology 2 min read 04-10-2024 6
Gemini Pro API's response omits the "text" field when I ask to translate a document Gemini Pro API Translation Trouble Missing text Field in Document Translations Problem When using the Gemini Pro API to translate a document the response someti 2 min read 04-10-2024 9
Langchain(HuggingFaceModel) - argument needs to be of type (SquadExample, dict) Demystifying the Argument needs to be of type Squad Example dict Error in Lang Chain with Hugging Face Models Lang Chain is a powerful tool for building LLM pow 2 min read 04-10-2024 7
While using Seq2SeqTrainingArguments function, This error is displayed: Using the `Trainer` with `PyTorch` requires `accelerate>=0.21.0` Using the Trainer with Py Torch requires accelerate 0 21 0 Unlocking the Power of Seq2 Seq Training Problem You re attempting to use the Seq2 Seq Training Argum 2 min read 04-10-2024 8
How to pass custom prompt variables in a chainlit app? Unlocking Dynamic Conversations with Custom Prompt Variables in Chainlit Chainlit is a powerful tool for building interactive conversational applications powere 2 min read 04-10-2024 10
Using only one specific document as source in llm - chainlit Using a Specific Document as a Source in LLM with Chainlit In the realm of natural language processing NLP and language learning models LLMs there are instances 3 min read 04-10-2024 7
Deploying LLM on Sagemaker Endpoint - CUDA out of Memory Taming the CUDA Beast Deploying LLMs on Sage Maker Endpoints with Limited Memory The Problem You ve painstakingly trained your massive language model LLM and ar 3 min read 04-10-2024 10
AttributeError: 'Document' object has no attribute 'get_doc_id' Attribute Error Document object has no attribute get doc id Unlocking the Mystery in Pythons Document Processing You re working on a Python project involving do 2 min read 04-10-2024 6
What differentiates Direct Preference Optimization (DPO) from supervised fine-tuning (SFT) DPO vs SFT Understanding the Nuances of Large Language Model Fine Tuning The world of large language models LLMs is rapidly evolving and with it new techniques 3 min read 04-10-2024 7
How do I embed json documents using embedding models like sentence-transformer or open ai's embedding model? Embedding JSON Documents Unlocking the Power of Semantic Search JSON Java Script Object Notation is a ubiquitous format for storing structured data But how do y 2 min read 04-10-2024 7
How to install and run Ollama server in AWS Kubernetes cluster (EKS)? Unleashing the Power of Ollama on AWS EKS A Comprehensive Guide Ollama is a revolutionary open source platform that empowers you to run large language models LL 3 min read 04-10-2024 7
What does "I" in the section "_IQ" and "_M" mean in this name "Meta-Llama-3-8B-Instruct-IQ3_M.gguf"? Decoding the I and M in Meta Llama 3 8 B Instruct IQ 3 M gguf A Guide to Large Language Model Naming Conventions The name Meta Llama 3 8 B Instruct IQ 3 M gguf less than a minute read 04-10-2024 9
How to set location in Langchain for Google ChatVertexAI model Setting Location for Your Lang Chain Google Chat Vertex AI Model A Guide Ever wanted to customize your Lang Chain Google Chat Vertex AI model to have specific g 2 min read 04-10-2024 8
Implementing Router Chains in Langchain Agents Navigating the Information Highway Implementing Router Chains in Lang Chain Agents Lang Chain agents are powerful tools for building intelligent applications th 3 min read 04-10-2024 9
ORPOTrainer Error: Calculated loss must be on the original device: cuda:0 but device in use is cuda:3 Unraveling the ORPO Trainer Error Calculated loss must be on the original device cuda 0 but device in use is cuda 3 Problem You re attempting to train a machine 2 min read 04-10-2024 9
Error installing Meta-Llama-3-70B model from Hugging Face Hub Meta Llama 3 70 B Installation Headache We ve Got You Covered The Problem Installing Meta Llama 3 70 B from Hugging Face Hub You re excited to work with the pow 2 min read 04-10-2024 10
How to run a local Open Source LLM in llama-index in a restricted environment? Running a Local Open Source LLM in llama index on a Restricted Network In todays world powerful language models LLMs like GPT 3 and La MDA are making waves Howe 3 min read 04-10-2024 16
How to delete a datapoint from vector search index in GCP? Deleting Datapoints from Your Vector Search Index on Google Cloud Platform Vector search a powerful technique for finding similar items based on their features 2 min read 04-10-2024 9
Issues with LLM Retrieving Passwords from Provided Passages Why Your LLM Cant and Shouldn t Find Your Passwords Large Language Models LLMs are powerful tools capable of summarizing information translating languages and e 2 min read 04-10-2024 13
Performing Function Calling with Mistral AI through Hugging Face Endpoint Performing Function Calling with Mistral AI through Hugging Face Endpoint In recent years artificial intelligence has advanced rapidly providing developers with 3 min read 30-09-2024 13