ENTITY tensorrt

tensorrt

PulseAugur coverage of tensorrt — every cluster mentioning tensorrt across labs, papers, and developer communities, ranked by signal.

Total · 30d

7

7 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

5

5 over 90d

TIER MIX · 90D

research 3
tool 3
commentary 1

SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 7 TOTAL

COMMENTARY · CL_30131 · May 13 · 15:24

Microsoft engineer compares TensorRT, vLLM, Triton, ONNX for GPU inference

This article compares four key GPU inference frameworks: NVIDIA's TensorRT, vLLM, Triton, and ONNX Runtime. It delves into their architectures, performance characteristics, and suitability for different large language m…
RESEARCH · CL_21806 · May 7 · 13:47

New satellite system uses AI for real-time wildfire detection under strict constraints

Researchers have developed a real-time wildfire detection system for use on satellites, designed to operate under strict on-board constraints. The system utilizes a lightweight dense representation learning approach, sp…
TOOL · CL_20586 · May 7 · 04:00

New DEEP-GAP study compares NVIDIA T4 and L4 GPU inference performance

A new research paper introduces DEEP-GAP, a methodology for evaluating GPU inference performance. The study systematically compares the NVIDIA T4 and L4 GPUs using various deep learning models and precision modes. Resul…
RESEARCH · CL_15610 · May 5 · 04:00

AI models advance plant disease detection with new datasets and efficient distillation

Researchers have developed new methods for plant leaf disease classification to aid in early detection and treatment. One approach involves training a new base model using the DenseNet201 architecture on a custom datase…
RESEARCH · CL_14350 · May 4 · 04:00

Object detection models show mixed robustness to quantization and input degradations

A new study investigates how post-training quantization (PTQ) affects the robustness of YOLO object detection models when faced with real-world input degradations like noise and blur. Researchers evaluated various preci…
TOOL · CL_10963 · Apr 30 · 19:00

NVIDIA boosts Unreal Engine AI speed 5x; Nadella redefines AI success metrics

NVIDIA has introduced TensorRT for RTX, a technology designed to accelerate Neural Network Engine (NNE) inference within Unreal Engine by up to five times. This advancement aims to significantly reduce latency for real-…
RESEARCH · CL_01035 · Jan 10 · 17:00

Optimizing Transformer Inference: Techniques for Faster, Cheaper Large Models

Large transformer models present significant inference challenges due to their substantial memory footprint and computation costs, which scale quadratically with input length. Researchers and practitioners are exploring…