|
Intel® Trace Analyzer and Collector |
Intel Trace Analyzer and Collector 8.0 Update 1 is enhanced with new features that accelerate the analysis and tuning cycle of MPI-based cluster applications and enables programmers to analyze the effect of advanced interconnects on application performance. The Intel Trace Analyzer and Collector 8.0 Update 1 load imbalance diagram and the ideal interconnect simulator help MPI programmers identify further optimization opportunities. |
|
Intel® MPI Library |
Intel® MPI Library 4.0 update 1 provides new levels of performance and flexibility for applications that execute on clusters of Intel® platforms. The library achieves these advantages by improved interconnect support, faster on-node messaging, and an application- tuning capability that adjusts to the cluster architecture and application structure. This library features multirail InfiniBand* (IB) support, and enhancements to the native IB layer for lower communication latencies. In addition, the library maintains compatibility with previous Intel MPI 3.x versions while providing an enhanced library architecture that will enable interconnect vendors and Intel processors to better expose their future maximum performance and functionality. |
|
Intel® Composer XE compilers and libraries |
Multicore optimizations - Intel® compilers have built-in optimization technologies and multithreading support that help create code that runs best on the latest Intel® multicore processors. Intel® Fortran Compiler XE 12.0 provides support for CAF on clusters as part of its support of key features of Fortran 2008 standard.
Optimize applications - Intel compilers offer the breadth of advanced optimization, multithreading, and processor support that includes automatic processor dispatch, vectorization, auto- parallelization, data prefetching, and loop unrolling. It is possible to take advantage of hybrid models of parallelism, mixing process, task, data, and vector parallelism with MPI and threading models like OpenMP, Intel® Cilk™ Plus, and Intel® Threading Building Blocks methods to boost application performance on clusters made up of two-, four-, and eight-socket multicore nodes, each node being an SMP. |
|
Intel® Math Kernel Library (Intel® MKL) |
Intel Math Kernel Library 10.3 (Intel® MKL 10.3) is a library of highly optimized, extensively threaded math routines for science, engineering, and financial applications that require maximum performance. Core math functions include BLAS, LAPACK, ScaLAPACK, Sparse Solvers, Fast Fourier Transforms, Cluster FFTs, Vector Math, and more. Intel MKL version 10.3 is a revision and offers support for Intel® Advanced Vector Extensions (Intel® AVX), C extensions for LAPACK and PARDISO, Summary Statistics Library, and enhancements to several routines like DGEMM, FFTS, and RNGs. |
|
Intel® Integrated Performance Primitives (Intel® IPP) |
This is an extensive library of multicore-ready, highly optimized software functions for multimedia data processing and communications applications. |
|
Intel® Debugger |
The debugger improves the efficiency of the debugging process on code that has been optimized for Intel® architecture and includes new threaded code debugging features and a new GUI. |
|