InSwiggy Bytes — Tech BlogbyAmaresh MarripudiHermes: A Text-to-SQL solution at SwiggyCo-authored with Rutvik ReddyJul 26, 202412Jul 26, 202412
Mastering LLM (Large Language Model)How Much GPU Memory is Needed to Serve a Large Language Model (LLM)?In nearly all LLM interviews, there’s one question that consistently comes up: “How much GPU memory is needed to serve a Large Language…Aug 17, 202420Aug 17, 202420
InData Science in your pocketbyMehul GuptaMagentic-One, AutoGen, LangGraph, CrewAI, or OpenAI Swarm: Which Multi-AI Agent Framework is Best?Pros and Cons of popular Multi-Agent Orchestration frameworkNov 14, 202415Nov 14, 202415
InGoPenAIbyJason AndersonBuilding Perplexity-Style SearchInspired by Perplexity AI’s innovative approach to information discovery, developers can now create their own AI-enhanced search tools…Jul 3, 2024Jul 3, 2024
InGoPenAIbyM K Pavan KumarHarnessing AI at the Edge: Building a RAG System with Ollama, Qdrant and Raspberry PiLeveraging the compact and cost-effective Raspberry Pi to host embedding models , vector database and language models (LLM) for a…Jun 9, 20243Jun 9, 20243
InGoPenAIbyBin WangTop Practical Techniques to Enhance LLM RAG Applications for Optimal PerformanceIn the realm of advanced AI, optimizing Language Model (LLM) applications for Retrieval-Augmented Generation (RAG) is crucial for…Jul 4, 2024Jul 4, 2024
InTDS ArchivebyGiuliano GiacagliaTransformersTransformers are a type of neural network architecture that have been gaining popularity. Transformers were recently used by OpenAI in…Mar 11, 201938Mar 11, 201938
InKX SystemsbyMichael RyaboyMultistage RAG with LlamaIndex and Cohere Reranking: A Step-by-Step GuideRetrieval Augmented Generation (RAG) is a powerful technique that allows language models to draw upon relevant information from external…Apr 19, 2024Apr 19, 2024
InTowards AIbyMandar Karhade, MD. PhD.Why RAG Applications Fail in ProductionIt worked as a prototype; then all went down!Mar 19, 202430Mar 19, 202430
InTDS ArchivebyAlex HoncharIntro to LLM Agents with Langchain: When RAG is Not EnoughFirst-order principles of brain structure for AI assistantsMar 15, 202417Mar 15, 202417