InSwiggy Bytes — Tech BlogbyAmaresh MarripudiHermes: A Text-to-SQL solution at SwiggyCo-authored with Rutvik ReddyJul 26, 202490412Jul 26, 202490412
Mastering LLM (Large Language Model)How Much GPU Memory is Needed to Serve a Large Language Model (LLM)?In nearly all LLM interviews, there’s one question that consistently comes up: “How much GPU memory is needed to serve a Large Language…Aug 17, 20242K20Aug 17, 20242K20
InData Science in your pocketbyMehul GuptaMagentic-One, AutoGen, LangGraph, CrewAI, or OpenAI Swarm: Which Multi-AI Agent Framework is Best?Pros and Cons of popular Multi-Agent Orchestration frameworkNov 14, 202461615Nov 14, 202461615
InGoPenAIbyJason AndersonBuilding Perplexity-Style SearchInspired by Perplexity AI’s innovative approach to information discovery, developers can now create their own AI-enhanced search tools…Jul 3, 20243Jul 3, 20243
InGoPenAIbyM K Pavan KumarHarnessing AI at the Edge: Building a RAG System with Ollama, Qdrant and Raspberry PiLeveraging the compact and cost-effective Raspberry Pi to host embedding models , vector database and language models (LLM) for a…Jun 9, 20241233Jun 9, 20241233
InGoPenAIbyBin WangTop Practical Techniques to Enhance LLM RAG Applications for Optimal PerformanceIn the realm of advanced AI, optimizing Language Model (LLM) applications for Retrieval-Augmented Generation (RAG) is crucial for…Jul 4, 202412Jul 4, 202412
InTDS ArchivebyGiuliano GiacagliaTransformersTransformers are a type of neural network architecture that have been gaining popularity. Transformers were recently used by OpenAI in…Mar 11, 201910.7K38Mar 11, 201910.7K38
InKX SystemsbyMichael RyaboyMultistage RAG with LlamaIndex and Cohere Reranking: A Step-by-Step GuideRetrieval Augmented Generation (RAG) is a powerful technique that allows language models to draw upon relevant information from external…Apr 19, 2024108Apr 19, 2024108
InTowards AIbyMandar Karhade, MD. PhD.Why RAG Applications Fail in ProductionIt worked as a prototype; then all went down!Mar 19, 20242.5K30Mar 19, 20242.5K30
InTDS ArchivebyAlex HoncharIntro to LLM Agents with Langchain: When RAG is Not EnoughFirst-order principles of brain structure for AI assistantsMar 15, 20242.4K17Mar 15, 20242.4K17