[Stanford] FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance

  Рет қаралды 26

Trend in Research

Trend in Research

Ай бұрын

arxiv.org/abs/2305.05176
There is a rapidly growing number of large language models (LLMs) that users can query for a fee. We review the cost associated with querying popular LLM APIs, e.g. GPT-4, ChatGPT, J1-Jumbo, and find that these models have heterogeneous pricing structures, with fees that can differ by two orders of magnitude. In particular, using LLMs on large collections of queries and text can be expensive. Motivated by this, we outline and discuss three types of strategies that users can exploit to reduce the inference cost associated with using LLMs: 1) prompt adaptation, 2) LLM approximation, and 3) LLM cascade. As an example, we propose FrugalGPT, a simple yet flexible instantiation of LLM cascade which learns which combinations of LLMs to use for different queries in order to reduce cost and improve accuracy. Our experiments show that FrugalGPT can match the performance of the best individual LLM (e.g. GPT-4) with up to 98% cost reduction or improve the accuracy over GPT-4 by 4% with the same cost. The ideas and findings presented here lay a foundation for using LLMs sustainably and efficiently.

Пікірлер
[CVPR 24 Best Paper] Generative Image Dynamics
16:33
Trend in Research
Рет қаралды 215
IOT UNIT 4 5
1:48
HOD ECE
Рет қаралды 2
ОСКАР vs БАДАБУМЧИК БОЙ!  УВЕЗЛИ на СКОРОЙ!
13:45
Бадабумчик
Рет қаралды 6 МЛН
WHO LAUGHS LAST LAUGHS BEST 😎 #comedy
00:18
HaHaWhat
Рет қаралды 23 МЛН
ЧУТЬ НЕ УТОНУЛ #shorts
00:27
Паша Осадчий
Рет қаралды 7 МЛН
Large Language Models from scratch
8:25
Graphics in 5 Minutes
Рет қаралды 340 М.
[IB Biology HL] 2004 May TZ1 Paper 1 Q15
0:22
Notephilia
Рет қаралды 6
18ECE207J M5 The application layer protocol for embedded system
3:01
[CVPR 24 Best Paper] Rich Human Feedback for Text-to-Image Generation
18:56
D-PSC-MN-01 Dell PowerScale Maintenance Exam Questions
5:47
The spelled-out intro to language modeling: building makemore
1:57:45
Andrej Karpathy
Рет қаралды 655 М.
ОСКАР vs БАДАБУМЧИК БОЙ!  УВЕЗЛИ на СКОРОЙ!
13:45
Бадабумчик
Рет қаралды 6 МЛН