Data Compression Methods

Google’s TurboQuant Compression May Support Faster Inference, Same Accuracy on Less Capable Hardware

Google Research unveiled TurboQuant, a novel quantization algorithm that compresses large language models’ Key-Value caches ...

EurekAlert!

Team develops general framework for Gaussian Splatting-based human-centric volumetric videos

A research team has developed a Gaussian Splatting processing platform that supports end-to-end processing from data acquisition to multi-platform rendering. Their framework provides a solid ...

XDA Developers on MSN

Nvidia's new VRAM compression trick just gave it a reason to keep selling 8GB GPUs

It works like magic, but won't renew your old 8GB card's lease on life ...

XDA Developers on MSN

Your 8GB GPU isn't as doomed as everyone says, and Nvidia just proved it

Neural Texture Compression might make 8GB GPUs more viable in the long-term ...

EurekAlert!

New AI image-enhancement method could help transportation systems see more clearly in tunnels

Researchers have developed a dynamic range compression dual-domain attention network for enhancing tunnel images under extreme exposure conditions, a problem that continues to challenge transportation ...

Hackaday

TurboQuant: Reducing LLM Memory Usage With Vector Quantization

Large language models (LLMs) aren’t actually giant computer brains. Instead, they are massive vector spaces in which the ...

IEEE

High-Precision Hybrid Compression Method for Synchrophasor Data in WAMS

Abstract: The widespread deployment of phasor measurement units (PMUs) has introduced unprecedented challenges in handling the transmission and storage of extensive synchrophasor data. Addressing ...

TechCrunch

Google unveils TurboQuant, a new AI memory compression algorithm — and yes, the internet is calling it ‘Pied Piper’

If Google’s AI researchers had a sense of humor, they would have called TurboQuant, the new, ultra-efficient AI memory compression algorithm announced Tuesday, “Pied Piper” — or, at least that’s what ...

VentureBeat

Nvidia says it can shrink LLM memory 20x without changing model weights

Nvidia researchers have introduced a new technique that dramatically reduces how much memory large language models need to track conversation history — by as much as 20x — without modifying the model ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results