Click here to Skip to main content
14,088,376 members

CUDA

Great Reads

by Nick Kopp
This article builds upon the earlier High Performance Queries: GPU vs. PLINQ vs. LINQ and ports this to also support OpenCL devices and adds benchmarking so you can easily compare performance.
by Ryan S White
CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of your Cuda code.
by Nick Kopp
Ultra high quality frequency domain image rotation on a GPU.
by Intel Corporation
In this blog post, we highlight one particular class of low precision networks named binarized neural networks (BNNs), the fundamental concepts underlying this class, and introduce a Neon CPU and GPU implementation.

Latest Articles

by Apriorit Inc, Vadym Zhernovyi
The experience of improving Mask R-CNN performance six to ten times by applying TensorRT
by Vangos
This post will show you how to build OpenCV for Windows with CUDA.
by Intel Corporation
In this blog post, we highlight one particular class of low precision networks named binarized neural networks (BNNs), the fundamental concepts underlying this class, and introduce a Neon CPU and GPU implementation.

All Articles

Sort by Updated

CUDA 

10 Dec 2018
Apriorit Inc, Vadym Zhernovyi
The experience of improving Mask R-CNN performance six to ten times by applying TensorRT
2 Nov 2018
Vangos
This post will show you how to build OpenCV for Windows with CUDA.
3 May 2017
Intel Corporation
In this blog post, we highlight one particular class of low precision networks named binarized neural networks (BNNs), the fundamental concepts underlying this class, and introduce a Neon CPU and GPU implementation.
3 Jan 2017
Ryan S White
CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of your Cuda code.
10 Dec 2016
Arthur V. Ratz
In this article, we'll demonstrate an approach the allows to increase the performance (up to 600%) of the code that implements the conventional distribution counting algorithm (DCA) using NVIDIA CUDA 8.0 Runtime API
30 Nov 2016
Dino Konstantopoulos
Running Theano with an Nvidia 1070 GPU on Windows 10, with CUDA 8 and Visual Studio 2015
25 Jul 2016
Igor Gribanov
Performing linear static analysis on a tetrahedral mesh with a little bit of help from a third-party solver.
12 Apr 2016
Shao Voon Wong
Finding lexicographical permutations on GPU
18 Dec 2013
Joren Heit
A Hybrid Framework Code-Generator for CUDA
21 Sep 2013
Mark H Bishop
Tutorial: GPU computing with JCuda and Nsight (Eclipse)
17 Sep 2013
Nick Kopp
This article builds upon the earlier High Performance Queries: GPU vs. PLINQ vs. LINQ and ports this to also support OpenCL devices and adds benchmarking so you can easily compare performance.
16 Sep 2013
Nick Kopp
Ultra high quality frequency domain image rotation on a GPU.


Advertise | Privacy | Cookies | Terms of Service
Web06 | 2.8.190518.1 | Last Updated 19 May 2019
Copyright © CodeProject, 1999-2019
All Rights Reserved.
Layout: fixed | fluid