nvidia

Nvidia’s new technique cuts LLM reasoning costs by 8x without losing accuracy

12 Feb 2026

venturebeat.com

Nvidia researchers developed dynamic memory sparsification (DMS), a technique that compresses the KV cache in large language models by up to 8x while maintaining reasoning accuracy — and it can be retrofitted onto existing models in hours.

NVIDIA Rubin Is The Most Advanced AI Platform On The Planet: Up To 50 PFLOPs With HBM4, Vera CPU With 88 Olympus Cores, And Delivers 5x Uplift Vs Blackwell

5 Jan 2026

wccftech.com

NVIDIA is formally announcing its Rubin AI platform today which will be the heart of next-gen Data Centers, with a 5x upgrade over Blackwell.

Another Giant Leap: The Rubin CPX Specialized Accelerator & Rack

15 Sep 2025

semianalysis.com

Nvidia announced the Rubin CPX, a solution that is specifically designed to be optimized for the prefill phase, with the single-die Rubin CPX heavily emphasizing compute FLOPS over memory bandwidth…

NVIDIA Unveils Its Newest ‘Rubin CPX’ AI GPUs, Featuring 128 GB GDDR7 Memory & Targeted …

10 Sep 2025

wccftech.com

NVIDIA has surprisingly unveiled a rather 'new class' of AI GPUs, featuring the Rubin CPX AI chip that offers immense inferencing power.

NVIDIA Blackwell Ultra “GB300” GPU, The Fastest AI Chip, Detailed: Dual Reticle GPU With Ove…

25 Aug 2025

wccftech.com

NVIDIA has provided an in-depth look at its fastest chip for AI, the Blackwell GB300, which is 50% faster than GB200 & packs 288 GB memory.

Nvidia just dropped a new AI model that crushes OpenAI’s GPT-4—no big launch, just big results

17 Oct 2024

venturebeat.com

Nvidia quietly launched a groundbreaking AI model that surpasses OpenAI’s GPT-4 and Anthropic’s Claude 3.5, signaling a major shift in the competitive landscape of artificial intelligence.

18.04 Screen remains blank after wake up from suspend - Ask Ubuntu

14 Mar 2022

askubuntu.com

So, when I suspend my laptop, then wake it up later, my laptop does turn on, I'm able to, for example, turn up and down the volume with audio confirmation using the kepboard, but my screen remains ...

How to Accelerate Signal Processing in Python

9 Apr 2021

developer.nvidia.com

This post is the seventh installment of the series of articles on the RAPIDS ecosystem. The series explores and discusses various aspects of RAPIDS that allow its users solve ETL (Extract, Transform…

State of the art NLP at scale with RAPIDS, HuggingFace and Dask

4 Apr 2021

medium.com

See how to build end-to-end NLP pipelines in a fast and scalable way on GPUs — from feature engineering to inference.

Using RAPIDS with PyTorch

15 Mar 2021

developer.nvidia.com

In this post we take a look at how to use cuDF, the RAPIDS dataframe library, to do some of the preprocessing steps required to get the mortgage data in a format that PyTorch can process so that we…

Beginner’s Guide to Querying Data Using SQL on GPUs in Python

15 Mar 2021

developer.nvidia.com

Historically speaking, processing large amounts of structured data has been the domain of relational databases. Databases, consisting of tables that can be joined together or aggregated…

Python Pandas Tutorial – Beginner’s Guide to GPU Accelerated DataFrames for

12 Mar 2021

developer.nvidia.com

This series on the RAPIDS ecosystem explores the various aspects that enable you to solve extract, transform, load (ETL) problems, build machine learning (ML) and deep learning (DL) models…

Nvidia just dropped a bombshell: Its new AI model is open massive and ready to rival GPT-4

24 Oct 2002

venturebeat.com

Nvidia has released NVLM 1.0, a powerful open-source AI model that rivals GPT-4 and Google’s systems, marking a major breakthrough in multimodal language models for vision and text tasks.

nvidia — my Raindrop.io articles