NEW AGENTLESS AI Software Development

  Рет қаралды 2,628

code_your_own_AI

code_your_own_AI

Күн бұрын

A new leader for open-source software development: AGENTLESS. Best in class according to performance metrics on the SWE-bench Lite benchmark, including other open source agents w/ GPT-4 omni and Claude 3.5, like Aider or Devon or SWE-Agent.
AGENTLESS challenges the prevailing notion that complex autonomous agents are necessary for automating software development tasks. It leverages a simplified two-phase process-localization and repair-without the need for agents to decide on future actions or manage complex tools. This simplicity not only reduces the cognitive load involved in understanding and debugging the process but also significantly lowers operational costs. The empirical results demonstrate that AGENTLESS achieves competitive performance metrics on the SWE-bench Lite benchmark, outperforming all other open-source approaches in both effectiveness and efficiency.
The insights from AGENTLESS's performance suggest a paradigm shift in the development of software engineering tools, emphasizing the effectiveness of simpler, more interpretable methods over complex autonomous systems. This approach not only makes it easier to understand and maintain the system but also highlights the potential for significant cost savings and efficiency improvements. The research encourages further exploration into refining these simplistic approaches, suggesting that future advancements could focus on enhancing the accuracy of localization and repair mechanisms and exploring new forms of integration with existing development environments.
All rights w/ authors:
AGENTLESS :
Demystifying LLM-based Software Engineering Agents
arxiv.org/pdf/...
#airesearch
#newtech
#science

Пікірлер: 12
@joehopfield
@joehopfield Ай бұрын
In software engineering class at UCLA (1977) we submitted "PL/C" jobs on punch cards. Intended for teaching, PL/C implemented IBM's PL/1 but "fixed" syntax and a few logic errors for you. Any error became many pages of errors created by PL/C's attempted fixes. So we're nearly back to 50 years ago. 😀
@code4AI
@code4AI Ай бұрын
Thanks for this insight. It proves, that we are trying everything we know to make the machines a little bit more ..... clever ? For sure it is not intelligent.
@Solinfini
@Solinfini Ай бұрын
Amazing stuff! Keep it coming.
@spkgyk
@spkgyk Ай бұрын
The fact that companies are selling something that only works 20% of the time is crazy to me
@code4AI
@code4AI Ай бұрын
State of the art. Of course only in scientific terms, because if you look at some marketing statement: 104% and more .... smile.
@vladrm1
@vladrm1 Ай бұрын
If it saves 5% or more effort in a team of 20+ devs, that's already more than 1 FTE so it could be useful.
@preston_is_on_youtube
@preston_is_on_youtube Ай бұрын
Praise the Lord ☀️ for these sweet drops 🍭 of youtube manna 😭😭😭
@propeacemindfortress
@propeacemindfortress Ай бұрын
when less is more... the we need more recommendations with less... errr.... passt scho^^
@thesimplicitylifestyle
@thesimplicitylifestyle Ай бұрын
😎🤖
@atomobianco
@atomobianco Ай бұрын
Agentless? To me it seems to operate in agentic mode, with multiple steps and LLM requests
@code4AI
@code4AI Ай бұрын
Ah, I understand. If you want to learn about agentic, and why this here is AGENTLESS; I have a dedicated video on what it means to be AGENTIC: kzfaq.info/get/bejne/r7B4l9uKr829mJs.html
On-Device: Functional Tokens (Octopus v2)
26:07
code_your_own_AI
Рет қаралды 3,2 М.
PhD Thesis in 1 Day (300$): Open-Source AI
52:58
code_your_own_AI
Рет қаралды 5 М.
If Barbie came to life! 💝
00:37
Meow-some! Reacts
Рет қаралды 79 МЛН
The CUTEST flower girl on YouTube (2019-2024)
00:10
Hungry FAM
Рет қаралды 7 МЛН
❌Разве такое возможно? #story
01:00
Кэри Найс
Рет қаралды 4,1 МЛН
Мы сделали гигантские сухарики!  #большаяеда
00:44
The "Modern Day Slaves" Of The AI Tech World
52:42
Real Stories
Рет қаралды 506 М.
Turns out REST APIs weren't the answer (and that's OK!)
10:38
Dylan Beattie
Рет қаралды 152 М.
Has Generative AI Already Peaked? - Computerphile
12:48
Computerphile
Рет қаралды 968 М.
Build Agentic AI Apps with the Autogen framework | OD539
45:28
Microsoft Developer
Рет қаралды 8 М.
NP-Hard: The End of AI?
47:39
code_your_own_AI
Рет қаралды 1,8 М.
Stack Overflow stopped caring about developers a long time ago
22:33
Coding with Dee
Рет қаралды 65 М.
What Is an AI Anyway? | Mustafa Suleyman | TED
22:02
TED
Рет қаралды 1,4 МЛН
The Future of Knowledge Assistants: Jerry Liu
16:55
AI Engineer
Рет қаралды 81 М.
Запрещенный Гаджет для Авто с aliexpress 2
0:50
Тимур Сидельников
Рет қаралды 2,4 МЛН
zamzam electronic Samsung S24 Ultra power🔥
0:14
Reversal gamer
Рет қаралды 16 МЛН
ПС 110/10. Кто то подключил "левак" 110000 вольт!?
0:34
Советы электрика
Рет қаралды 1,4 МЛН
Мой новый мега монитор!🤯
1:00
Корнеич
Рет қаралды 8 МЛН
Nokia…
0:19
Eggified
Рет қаралды 2,5 МЛН