Skip to main content

Anthropic aims to fix one of the biggest problems in AI right now

the Anthropic logo
Anthropic

Hot on the heels of the announcement that its Claude 3.5 Sonnet large language model beat out other leading models, including GPT-4o and Llama-400B, AI startup Anthropic announced Monday that it plans to launch a new program to fund the development of independent, third-party benchmark tests against which to evaluate its upcoming models.

Per a blog post, the company is willing to pay third-party developers to create benchmarks that can “effectively measure advanced capabilities in AI models.”

“Our investment in these evaluations is intended to elevate the entire field of AI safety, providing valuable tools that benefit the whole ecosystem,” Anthropic wrote in a Monday blog post. “Developing high-quality, safety-relevant evaluations remains challenging, and the demand is outpacing the supply.”

The company wants submitted benchmarks to help measure the relative “safety level” of an AI based on a number of factors, including how well it resists attempts to coerce responses that might include cybersecurity; chemical, biological, radiological, and nuclear (CBRN); and misalignment, social manipulation, and other national security risks. Anthropic is also looking for benchmarks to help evaluate models’ advanced capabilities and is willing to fund the “development of tens of thousands of new evaluation questions and end-to-end tasks that would challenge even graduate students,” essentially testing a model’s ability to synthesize knowledge from a variety of sources, its ability to refuse cleverly worded malicious user requests, and its ability to respond in multiple languages.

Anthropic is looking for “sufficiently difficult,” high-volume tasks that can involve as many as “thousands” of testers across a diverse set of test formats that help the company inform its “realistic and safety-relevant” threat modeling efforts. Any interested developers are welcome to submit their proposals to the company, which plans to evaluate them on a rolling basis.

Andrew Tarantola
Andrew has spent more than a decade reporting on emerging technologies ranging from robotics and machine learning to space…
AMD Zen 6 chips could be here sooner than you think
The AMD Ryzen 7 5700 propped up against an action figure.

Last month at Computex, AMD announced its Zen 5-based desktop and mobile processors, set for launch later this month. Shortly after this announcement, details about their successor, code-named "Medusa," have emerged. According to leaks, Medusa will be part of the Zen 6 lineup and is expected to be released in late 2025, contrary to earlier rumors of a 2026 launch.

Sources cited by YouTuber Moore’s Law Is Dead suggest AMD plans to finalize the Zen 6 architecture by Q2 2025, with production possibly beginning later that year. Another source confirmed Medusa as a Zen 6 product, potentially targeting both laptops and the desktop AM5 platform. Additionally, Strix Halo and Medusa Halo, based on Zen 5 and Zen 6 architectures, are expected to use TSMC's N3E (enhanced 3nm process).

Read more
This 15-inch Acer laptop is down to $400 from $630
The Acer Aspire Vero on a table.

With Independence Day coming up fast, 4th of July deals are popping up all over the place. And fortunately, if you’ve been looking for a great midrange laptop to bring to work or school, there are numerous 4th of July laptop deals to choose from. In fact, we found a particularly great one at Best Buy that we can’t help but discuss.

While the sale lasts, you’ll be able to score the Acer 15.6-inch Aspire Vero for only $400. Not only will you save $230 off the normal price, but you’ll be the proud owner of a new, fast, and reliable Windows laptop!

Read more
Best gaming PC deals: Lenovo Legion, ASUS ROG, Acer Predator
young woman playing video games on a PC

While you could always build a gaming PC from scratch, that can take a lot of time and effort, especially for those who don't really have a lot of tech-savvy and don't want to fuss around with costly parts. Luckily, there are a lot of excellent pre-built gaming PCs on the market that you can check out, especially since many of them have a lot of discounts and sales going on. That's why we've collected some of our favorite desktop computer deals and put them below, with many of these, bar the more entry-level ones without some compromises, being able to play the best PC games on the market.

Once you've grabbed a pre-built, check out gaming monitor deals for a chance to save on a nice display. If the machine you pick up needs some upgrades, you can save with GPU deals, SSD deals, and RAM deals.
Best gaming PC deal for entry-level gamers
CyberPowerPC Gamer Master Gaming Desktop -- $650, was $700

Read more