The NVIDIA command-line profiler, nvprof, now supports collection of any number of events and metrics during a single run of a CUDA application. The NVIDIA Visual Profiler and the command-line profiler, nvprof, now support metrics that report the floating-point operations performed by a kernel. The NVIDIA Visual Profiler and the command-line profiler, nvprof, now support power, thermal, and clock profiling. The application analysis performed by the NVIDIA Visual Profiler has been enhanced. The NVIDIA Visual Profiler now supports applications that use CUDA Dynamic Parallelism. CUDA-MEMCHECK now supports displaying error information as errors occur during program execution instead of waiting for program termination to display output. The racecheck-report mode option of the racecheck tool can be used to enable the generation of analysis records. The racecheck tool in CUDA-MEMCHECK now has support for SM 3.5 devices. #NVIDIA CUDA TOOLKIT LINUX INSTALL CODE#Return code cudaErrorNotReady can be returned by cudaStreamQuery() and cudaEventQuery() in the case where the stream/event being waited on is still busy. The info cuda contexts command was added. The list of threads returned by the info cuda threads can now be narrowed to the threads currently at a breakpoint. Multiple CUDA-GDB instances can be now used for debugging ranks of an MPI application that uses a separate GPU for each rank. CUDA-GDB now supports remote debugging. To represent the parent/child kernel information, two commands were added. Multiple CUDA-GDB sessions can simultaneously debug CUDA applications on the same GPU. Debugging of long-running or indefinite CUDA kernels that would otherwise encounter a launch timeout is now possible CUDA-GDB can now be used to debug a CUDA application on the same GPU that is rendering the desktop GUI. Clang is now supported as a host compiler on Mac OS. #NVIDIA CUDA TOOLKIT LINUX INSTALL MAC OS#The default nvcc.profile no longer includes -lcudart (on Linux and Mac OS X) and cudart.lib (on Windows), and the use of the CUDA runtime is now controlled by the option -cudart (-cudart). Documents for the CUDA Compiler SDK (including the specification for LLVM IR, an API document for libnvvm, and an API document for libdevice) are provided. A set of samples that illustrate the use of the compiler SDK are provided. A set of libraries, libdevice.bc, that implement the common math functions for devices in the LLVM bitcode format are provided. An optimizing compiler library (libnvvm.so, nvvm.dll/nvvm.lib, libnvvm.dylib) and its header file nvvm.h are provided for compiler developers who want to generate PTX from a program written in NVVM IR, which is a compiler internal representation based on LLVM. The version of Thrust included with the current CUDA toolkit was upgraded from version 1.5.3 to version 1.7.0. The cublasgtsv() routines have been replaced with a version that supports pivoting. #NVIDIA CUDA TOOLKIT LINUX INSTALL GENERATOR#CURAND 5.5 introduces support for the random number generator Philox4x32-10. CUFFT 5.5 provides FFTW3 interfaces that enables applications using FFTW to gain performance with NVIDIA CUFFT with minimal changes to program source code. The new calls allow creation of a CUFFT plan handle separate from the actual creation of the plan, allow insertion of new calls to set plan attributes before the work of plan creation is done, and allow advanced users more control over memory space allocation. The limitation on the dimension n of the routine cublasgetrfbatched() has been removed. The routines cublasmatinvBatched() have been added to the CUBLAS Library. Installations can be updated when a new version of the CUDA Toolkit is available. deb installation packages for all the supported Linux distributions, except Ubuntu 10.04 and RHEL 5.5. #NVIDIA CUDA TOOLKIT LINUX INSTALL DRIVER#The CUDA Toolkit and the CUDA Driver are now available for installation as. The CUDA Sample projects have makefiles that are now more self-contained and robust. The Toolkit is using a new installer on Windows. Adds support for Linux on the ARMv7 Architecture.
0 Comments
NTSC is used for the American, Japanese, and Korean versions of the game, and PAL is used for the European version.
|
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |