Thursday, November 27, 2025
Woah this is incredible
| | | Woah this is incredible. | Anthropic just released the new Claude Opus 4.5 -- and it's better than every other coding model at basically everything. | Just look at the insane difference between Opus 4.5 and Sonnet 4.5 in solving this complex puzzle game: | | | Many devs online have been calling it the greatest coding model ever -- not hard to believe when you see how it stacks up to the other models: | It even beats Gemini 3 Pro that just came out like a week ago: | | This is a model built from the ground up to be an agentic software engineer: fixing bugs, refactoring large codebases, navigating unfamiliar repos, and wiring everything together with tools and terminals. | Opus 4.5 isn't just competitive — it's designed to be the thing you reach for when failure is expensive. | 80% on the SWE-bench verified benchmark is the highest ever any model has ever gotten. | And this SWE-bench Verified is a benchmark where models must actually apply patches that pass tests in real GitHub repos. It's the sort of test where you're not answering quiz questions -- you're actually modifying real-world Python projects and passing every single written test in the codebase. | Anthropic also ran it on their two-hour engineering hiring exam and reported that Opus 4.5, under realistic constraints, scored higher than any human candidate they've evaluated -- though with the important caveat that it was allowed multiple runs and they picked the best. | You can see that Opus 4.5 is optimized for "here's a repo, make it work," not just "explain what a binary search tree is." | This is advanced software engineering for messy real-world tasks -- far more than just "build a todo list app". | The effort knob: turning up (or down) the brainpower | | The most interesting feature for coders is the effort parameter -- exclusive to Opus 4.5 for now. | Instead of swapping between… | | Voice AI: Get the Proof. Avoid the Hype. | | Deepgram interviewed 400 senior leaders on voice AI adoption: 97% already use it, 84% will increase budgets, yet only 21% are very satisfied with legacy agents. See where enterprises deploy human-like voice AI agents - customer service, task automation, order capture. Benchmark your roadmap against $100M peers for 2026 priorities. | Download the Report | What 100K+ Engineers Read to Stay Ahead | | Your GitHub stars won't save you if you're behind on tech trends. | That's why over 100K engineers read The Code to spot what's coming next. | Get curated tech news, tools, and insights twice a week Learn about emerging trends you can leverage at work in just 10 mins Become the engineer who always knows what's next
| Join 100k+ engineers |
|
| | | | Update your email preferences or unsubscribe here © 2025 Beneebo LLC 1603 Capitol Avenue, Suite 413A, #3255 Cheyenne, Wyoming 82001, United States of America | | | Terms of Service |
|
|
|
|
|
0 Komentar untuk "Claude Opus 4.5 is completely insane"