No video

NEW AGENTLESS AI Software Development

  Рет қаралды 2,504

code_your_own_AI

code_your_own_AI

28 күн бұрын

A new leader for open-source software development: AGENTLESS. Best in class according to performance metrics on the SWE-bench Lite benchmark, including other open source agents w/ GPT-4 omni and Claude 3.5, like Aider or Devon or SWE-Agent.
AGENTLESS challenges the prevailing notion that complex autonomous agents are necessary for automating software development tasks. It leverages a simplified two-phase process-localization and repair-without the need for agents to decide on future actions or manage complex tools. This simplicity not only reduces the cognitive load involved in understanding and debugging the process but also significantly lowers operational costs. The empirical results demonstrate that AGENTLESS achieves competitive performance metrics on the SWE-bench Lite benchmark, outperforming all other open-source approaches in both effectiveness and efficiency.
The insights from AGENTLESS's performance suggest a paradigm shift in the development of software engineering tools, emphasizing the effectiveness of simpler, more interpretable methods over complex autonomous systems. This approach not only makes it easier to understand and maintain the system but also highlights the potential for significant cost savings and efficiency improvements. The research encourages further exploration into refining these simplistic approaches, suggesting that future advancements could focus on enhancing the accuracy of localization and repair mechanisms and exploring new forms of integration with existing development environments.
All rights w/ authors:
AGENTLESS :
Demystifying LLM-based Software Engineering Agents
arxiv.org/pdf/2407.01489
#airesearch
#newtech
#science

Пікірлер: 12
@joehopfield
@joehopfield 25 күн бұрын
In software engineering class at UCLA (1977) we submitted "PL/C" jobs on punch cards. Intended for teaching, PL/C implemented IBM's PL/1 but "fixed" syntax and a few logic errors for you. Any error became many pages of errors created by PL/C's attempted fixes. So we're nearly back to 50 years ago. 😀
@code4AI
@code4AI 24 күн бұрын
Thanks for this insight. It proves, that we are trying everything we know to make the machines a little bit more ..... clever ? For sure it is not intelligent.
@saulinfinite
@saulinfinite 26 күн бұрын
Amazing stuff! Keep it coming.
@preston_is_on_youtube
@preston_is_on_youtube 26 күн бұрын
Praise the Lord ☀️ for these sweet drops 🍭 of youtube manna 😭😭😭
@thesimplicitylifestyle
@thesimplicitylifestyle 26 күн бұрын
😎🤖
@propeacemindfortress
@propeacemindfortress 26 күн бұрын
when less is more... the we need more recommendations with less... errr.... passt scho^^
@spkgyk
@spkgyk 26 күн бұрын
The fact that companies are selling something that only works 20% of the time is crazy to me
@code4AI
@code4AI 24 күн бұрын
State of the art. Of course only in scientific terms, because if you look at some marketing statement: 104% and more .... smile.
@vladrm1
@vladrm1 24 күн бұрын
If it saves 5% or more effort in a team of 20+ devs, that's already more than 1 FTE so it could be useful.
@atomobianco
@atomobianco 24 күн бұрын
Agentless? To me it seems to operate in agentic mode, with multiple steps and LLM requests
@code4AI
@code4AI 24 күн бұрын
Ah, I understand. If you want to learn about agentic, and why this here is AGENTLESS; I have a dedicated video on what it means to be AGENTIC: kzfaq.info/get/bejne/r7B4l9uKr829mJs.html
On-Device: Functional Tokens (Octopus v2)
26:07
code_your_own_AI
Рет қаралды 3,1 М.
GraphRAG or SpeculativeRAG ?
25:51
code_your_own_AI
Рет қаралды 6 М.
Heartwarming Unity at School Event #shorts
00:19
Fabiosa Stories
Рет қаралды 25 МЛН
Mama vs Son vs Daddy 😭🤣
00:13
DADDYSON SHOW
Рет қаралды 50 МЛН
Why Is He Unhappy…?
00:26
Alan Chikin Chow
Рет қаралды 69 МЛН
NEW TextGrad by Stanford: Better than DSPy
41:25
code_your_own_AI
Рет қаралды 11 М.
5 Craziest AI Agents We've Ever Built
12:05
VRSEN
Рет қаралды 20 М.
Automated Prompt Engineering with DSPy + DSPy Visualization
36:27
5 Easy Ways to help LLMs to Reason
50:37
code_your_own_AI
Рет қаралды 4,4 М.
I Finally Tried The AI-Powered VS Code Killer | Cursor IDE Review
8:52
Your Average Tech Bro
Рет қаралды 19 М.
Inside the Black Box of AI Reasoning
31:48
code_your_own_AI
Рет қаралды 2,5 М.
Masterclass on AI by Microsoft
20:50
code_your_own_AI
Рет қаралды 1,6 М.
AI Pioneer Shows The Power of AI AGENTS - "The Future Is Agentic"
23:47
Has Generative AI Already Peaked? - Computerphile
12:48
Computerphile
Рет қаралды 923 М.
Heartwarming Unity at School Event #shorts
00:19
Fabiosa Stories
Рет қаралды 25 МЛН