Inference Test - Search News

FPGA-based inference accelerator outscores GPUs and ASICs in MLPerf benchmark

Silicon Valley-based startup Mipsology announced today that its Zebra AI inference accelerator achieved the highest efficiency based on the MLPerf inference test. The benchmark, which measures ...

SiliconANGLE

MLCommons releases results of its latest MLPerf AI inference benchmark test

MLCommons today released the latest results of its MLPerf Inference benchmark test, which compares the speed of artificial intelligence systems from different hardware makers. MLCommons is an industry ...

VentureBeat

Train-to-Test scaling explained: How to optimize your end-to-end AI compute budget for inference

The standard guidelines for building large language models (LLMs) optimize only for training costs and ignore inference costs. This poses a challenge for real-world applications that use ...

EurekAlert!

Common way to test for leaks in large language models may be flawed

This slide shows how a membership inference attack might start. Assessing the product of an app asked to generate an image of a professor teaching students in “the style of” artist Monet could lead to ...

VentureBeat

Hugging Face shows how test-time scaling helps small language models punch above their weight

In a new case study, Hugging Face researchers have demonstrated how small language models (SLMs) can be configured to outperform much larger models. Their findings show that a Llama 3 model with 3B ...

Gizmochina

NVIDIA GB300 GPUs deliver huge AI efficiency gains in Deepseek R1 inference test

NVIDIA’s latest Blackwell-based GB300 GPUs are starting to show what they can do, and early results point to a massive jump in efficiency compared to the company’s previous generation. A recent ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results