List: LLM | Curated by Karan Acharya

Dec 15, 2024
11 stories
LLM
In
Swiggy Bytes — Tech Blog
by
Amaresh Marripudi
Hermes: A Text-to-SQL solution at SwiggyCo-authored with Rutvik Reddy
Jul 26, 2024
12
Jul 26, 2024
12
Mastering LLM (Large Language Model)
How Much GPU Memory is Needed to Serve a Large Language Model (LLM)?In nearly all LLM interviews, there’s one question that consistently comes up: “How much GPU memory is needed to serve a Large Language…
Aug 17, 2024
20
Aug 17, 2024
20
In
Data Science in your pocket
by
Mehul Gupta
Magentic-One, AutoGen, LangGraph, CrewAI, or OpenAI Swarm: Which Multi-AI Agent Framework is Best?Pros and Cons of popular Multi-Agent Orchestration framework
Nov 14, 2024
15
Nov 14, 2024
15
In
GoPenAI
by
Dipam Vasani
Should LLMs have unit tests?Spoiler Alert! yes
Jul 10, 2024
Jul 10, 2024
In
GoPenAI
by
Jason Anderson
Building Perplexity-Style SearchInspired by Perplexity AI’s innovative approach to information discovery, developers can now create their own AI-enhanced search tools…
Jul 3, 2024
Jul 3, 2024
In
GoPenAI
by
M K Pavan Kumar
Harnessing AI at the Edge: Building a RAG System with Ollama, Qdrant and Raspberry PiLeveraging the compact and cost-effective Raspberry Pi to host embedding models , vector database and language models (LLM) for a…
Jun 9, 2024
3
Jun 9, 2024
3
In
GoPenAI
by
Bin Wang
Top Practical Techniques to Enhance LLM RAG Applications for Optimal PerformanceIn the realm of advanced AI, optimizing Language Model (LLM) applications for Retrieval-Augmented Generation (RAG) is crucial for…
Jul 4, 2024
Jul 4, 2024
In
TDS Archive
by
Giuliano Giacaglia
TransformersTransformers are a type of neural network architecture that have been gaining popularity. Transformers were recently used by OpenAI in…
Mar 11, 2019
38
Mar 11, 2019
38
In
KX Systems
by
Michael Ryaboy
Multistage RAG with LlamaIndex and Cohere Reranking: A Step-by-Step GuideRetrieval Augmented Generation (RAG) is a powerful technique that allows language models to draw upon relevant information from external…
Apr 19, 2024
Apr 19, 2024
In
Towards AI
by
Mandar Karhade, MD. PhD.
Why RAG Applications Fail in ProductionIt worked as a prototype; then all went down!
Mar 19, 2024
30
Mar 19, 2024
30
In
TDS Archive
by
Alex Honchar
Intro to LLM Agents with Langchain: When RAG is Not EnoughFirst-order principles of brain structure for AI assistants
Mar 15, 2024
17
Mar 15, 2024
17