{"id":66,"date":"2024-03-23T13:27:42","date_gmt":"2024-03-23T13:27:42","guid":{"rendered":"https:\/\/artificial-intelligence.news\/?p=66"},"modified":"2024-04-01T16:24:15","modified_gmt":"2024-04-01T16:24:15","slug":"devin-the-ai-developer","status":"publish","type":"post","link":"https:\/\/artificial-intelligence.news\/?p=66","title":{"rendered":"Devin &#8211; The AI developer"},"content":{"rendered":"\n<p><a href=\"https:\/\/devin.ai\">Devin<\/a> was released to the world on 12th March by Cognition Labs. This stealth startup consists of 10 people, and has received $21 million from Peter Thiel&#8217;s Founders Fund. As a result it has produced an AI agent that outperforms the current offerings from tech giants.<\/p>\n\n\n\n<p>You can see it in action in this tweet from Cognition AI&#8217;s Twitter \/ X account:<\/p>\n\n\n\n<figure class=\"wp-block-embed aligncenter is-type-rich is-provider-twitter wp-block-embed-twitter\"><div class=\"wp-block-embed__wrapper\">\n<blockquote class=\"twitter-tweet\" data-width=\"500\" data-dnt=\"true\"><p lang=\"en\" dir=\"ltr\">Today we&#39;re excited to introduce Devin, the first AI software engineer.<br><br>Devin is the new state-of-the-art on the SWE-Bench coding benchmark, has successfully passed practical engineering interviews from leading AI companies, and has even completed real jobs on Upwork.<br><br>Devin is\u2026 <a href=\"https:\/\/t.co\/ladBicxEat\">pic.twitter.com\/ladBicxEat<\/a><\/p>&mdash; Cognition (@cognition_labs) <a href=\"https:\/\/twitter.com\/cognition_labs\/status\/1767548763134964000?ref_src=twsrc%5Etfw\">March 12, 2024<\/a><\/blockquote><script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script>\n<\/div><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Origin story<\/h2>\n\n\n\n<p>Perhaps more remarkable than the agent, are the original founders:<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"683\" src=\"http:\/\/artificial-intelligence.news\/wp-content\/uploads\/2024\/03\/cognition-ai-founders-1024x683.jpg\" alt=\"\" class=\"wp-image-67\" style=\"width:685px;height:auto\" srcset=\"https:\/\/artificial-intelligence.news\/wp-content\/uploads\/2024\/03\/cognition-ai-founders-1024x683.jpg 1024w, https:\/\/artificial-intelligence.news\/wp-content\/uploads\/2024\/03\/cognition-ai-founders-300x200.jpg 300w, https:\/\/artificial-intelligence.news\/wp-content\/uploads\/2024\/03\/cognition-ai-founders-768x512.jpg 768w, https:\/\/artificial-intelligence.news\/wp-content\/uploads\/2024\/03\/cognition-ai-founders.jpg 1200w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\">Steven Hao, Scott Wu and Walden Yan (left to right)<\/figcaption><\/figure>\n\n\n\n<div class=\"wp-block-group is-vertical is-layout-flex wp-container-core-group-is-layout-fe9cc265 wp-block-group-is-layout-flex\">\n<ul class=\"wp-block-list\">\n<li><strong>Scott Wu <\/strong>&#8211; Chief Executive Officer<br><br>A child maths prodigy, who won the <a href=\"https:\/\/stats.ioinformatics.org\/people\/2686\">International Olympiad in Informatics (IOI) for Statistics<\/a> three years in row. Achieving an unbeated 100% score in 2014. Watch this impressive <a href=\"https:\/\/www.youtube.com\/watch?v=G0qvYJVpgc0\">video<\/a> of him in action as a child.<br><br>He is also highly regarded in the competitive programming scene, and has achieved legendary grandmaster rank on <a href=\"https:\/\/codeforces.com\/profile\/scott_wu\">CodeForces<\/a>.<br><br><br><\/li>\n\n\n\n<li><strong>Steven Hao <\/strong>&#8211; Chief Technology Officer<br><br>An international grandmaster on <a href=\"https:\/\/codeforces.com\/profile\/stevenkplus\">CodeForces<\/a>, and a gold and silver medallist in the <a href=\"https:\/\/stats.ioinformatics.org\/people\/3113\">IOI Statistics<\/a> competition.<br><br><br><\/li>\n\n\n\n<li><strong>Walden Yan<\/strong> &#8211; Chief Product Officer<br><br>A highly capable programmer, having reached grandmaster on <a href=\"https:\/\/codeforces.com\/profile\/walnutwaldo20\">CodeForces<\/a>. He also achieved gold in the <a href=\"https:\/\/cphof.org\/standings\/ioi\/2020\">2020 IOI Statistics<\/a> contest.<\/li>\n<\/ul>\n<\/div>\n\n\n\n<p>The current team at Cognition Labs consists of 10 members, who also have some remarkable credentials. Amongst them, they have 10 IOI gold medals, including Scott&#8217;s brother Neal Wu who also won <a href=\"https:\/\/stats.ioinformatics.org\/people\/1667\">3 IOI gold medals<\/a>.<\/p>\n\n\n\n<p><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Benchmark<\/h2>\n\n\n\n<p>How does Devin compare against other AI models? <\/p>\n\n\n\n<p>This graph for the SWE-bench benchmark, measures how many real world software engineering tasks are completed:<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-large is-resized is-style-default\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"545\" src=\"https:\/\/artificial-intelligence.news\/wp-content\/uploads\/2024\/03\/devin-ai-benchmark-1024x545.jpeg\" alt=\"Bar graph for Reald World Software Engineering Performance showing that Devin resolved 13.86% of issues. The next highest is Claude 2 which resolved 4.80% of issues.\" class=\"wp-image-68\" style=\"width:814px;height:auto\" srcset=\"https:\/\/artificial-intelligence.news\/wp-content\/uploads\/2024\/03\/devin-ai-benchmark-1024x545.jpeg 1024w, https:\/\/artificial-intelligence.news\/wp-content\/uploads\/2024\/03\/devin-ai-benchmark-300x160.jpeg 300w, https:\/\/artificial-intelligence.news\/wp-content\/uploads\/2024\/03\/devin-ai-benchmark-768x409.jpeg 768w, https:\/\/artificial-intelligence.news\/wp-content\/uploads\/2024\/03\/devin-ai-benchmark.jpeg 1280w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>At 13.86% of issues resolved, Devin greatly outperforms the closest rival Claude 2 by almost 3 times. Considering that Devin was completely unassissted, whereas the other models were told exactly which files to edit, shows how much more capable it is.<\/p>\n\n\n\n<p>What&#8217;s more impressive is the difference in resources behind Cognition Lab&#8217;s Devin and Anthropic&#8217;s Claude 2. <\/p>\n\n\n\n<p>Anthropic is a 240 people company which raised over $7 billion in funding in July 2023 alone. This far outweighs Cogntion Labs, which has a size of 10 people and $21 million in funding.<\/p>\n\n\n\n<p><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Capabilities<\/h2>\n\n\n\n<p>Devin can complete many tasks that a software engineer may come across, including:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Taking <a href=\"https:\/\/twitter.com\/emollick\/status\/1770128785494700333\">website building requests<\/a> on Reddit. Where it had to be stopped after it wanted to start charging people.<\/li>\n\n\n\n<li><a href=\"https:\/\/www.youtube.com\/watch?v=TiXAzn2_Xck\">Find and fix bugs<\/a>.<\/li>\n\n\n\n<li><a href=\"https:\/\/twitter.com\/emollick\/status\/1768742585122558063\">Talk with the user as it develops<\/a>.<\/li>\n<\/ul>\n\n\n\n<p><\/p>\n\n\n\n<p>Keep in mind that Devin is only in alpha, so it will only improve from here. Meaning it is a possibility that it will replace human software engineers, but whether it will have the common sense to work without close human oversight remains to be seen.<\/p>\n\n\n\n<p><\/p>\n\n\n\n<p>Read more about Devin on the <a href=\"https:\/\/www.cognition-labs.com\/introducing-devin\">Cognition blog<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Devin was released to the world on 12th March by Cognition Labs. This stealth startup consists of 10 people, and has received $21 million from Peter Thiel&#8217;s Founders Fund. As a result it has produced an AI agent that outperforms the current offerings from tech giants. You can see it in action in this tweet [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":72,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"site-container-style":"default","site-container-layout":"default","site-sidebar-layout":"default","disable-article-header":"default","disable-site-header":"default","disable-site-footer":"default","disable-content-area-spacing":"default","footnotes":""},"categories":[1],"tags":[],"class_list":["post-66","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-blog"],"_links":{"self":[{"href":"https:\/\/artificial-intelligence.news\/index.php?rest_route=\/wp\/v2\/posts\/66","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/artificial-intelligence.news\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/artificial-intelligence.news\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/artificial-intelligence.news\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/artificial-intelligence.news\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=66"}],"version-history":[{"count":7,"href":"https:\/\/artificial-intelligence.news\/index.php?rest_route=\/wp\/v2\/posts\/66\/revisions"}],"predecessor-version":[{"id":77,"href":"https:\/\/artificial-intelligence.news\/index.php?rest_route=\/wp\/v2\/posts\/66\/revisions\/77"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/artificial-intelligence.news\/index.php?rest_route=\/wp\/v2\/media\/72"}],"wp:attachment":[{"href":"https:\/\/artificial-intelligence.news\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=66"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/artificial-intelligence.news\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=66"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/artificial-intelligence.news\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=66"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}