Channel Avatar

Vadim Karpusenko @UCxD2TEeLQopByu6jzKs65UA@youtube.com

1K subscribers - no pronouns :c

Mostly Intel Xeon Phi related stuff, but principals of paral


06:49
Attention of Convolutional Neural Network (VGG16)
01:40
Episode 5.19 - Closing words
05:51
Episode 5.18 - Additional Topic- Load Balancing in Heterogeneous Systems
03:03
Episode 5.17 - Optimization of Communication- MPI
06:58
Episode 5.16 - Optimization of Communication- Offload
05:04
Episode 5.15 - NUMA and Allocation on First Touch
05:37
Episode 5.14 - Example of Cache-Oblivious Recursion
06:48
Episode 5.13 - Example of Loop Tiling
07:16
Episode 5.12 - Optimization of Memory Access
07:15
Episode 5.11 - Thread affinity control
07:18
Episode 5.10 - Do you have enough parallelism in your code
05:18
Episode 5.9 - Elimination of False Cache Line Sharing
07:30
Episode 5.8 - Optimization of Synchronization in Multithreaded applications
02:55
Episode 5.7 - Vectorization Tuning Knobs
06:16
Episode 5.6 - Strip-Mining for Vectorization
09:36
Episode 5.5 - Optimization of Vectorization: Regularizing Pattern
04:51
Episode 5.4 - Optimization of Vectorization: Alignment and Hints
06:10
Episode 5.3 - Optimization of Vectorization: Data Structures
10:05
Episode 5.2 - Scalar Tuning and General Optimization
04:26
Episode 5.1 - Optimization roadmap
06:50
Episode 4.9 - Distributed-memory Parallelism and MPI
04:37
Episode 4.8 - Parallel Reduction
06:42
Episode 4.7 - Race Conditions and Mutexes
02:02
Timelapse of video course recording from March 8th, 2015
01:36
teaser-v3ct0r1z@t10n
02:10
A teaser video of webinar series
03:21
Episode 4.6 - Fork-Join Model. OpenMP Tasks
04:34
Episode 4.5 - Parallel Loops, Private and Shared Variables, Scheduling
08:46
Episode 4.3 - Assumed Vector Dependence and Pointer Disambiguation
03:45
Episode 4.4 - Thread Parallelism and OpenMP
06:53
Episode 4.2 - Automatic Vectorization and Array Notation
06:03
Episode 4.1 - SIMD Parallelism and Intrinsics
06:00
Episode 3.9 - File IO in MPI Applications on Coprocessors
08:43
Episode 3.8 - Heterogeneous Programming using MPI
05:28
Episode 3.7 - Asynchronous Offload
07:07
Episode 3.6 - Shared Virtual Memory
07:35
Episode 3.5 - Additional Offload Controls
08:07
Episode 3.4 - Explicit Offload
09:11
Episode 3.3 - Native MPI Applications
07:26
Episode 3.2 - Native Coprocessor Applications
09:03
Episode 3.1 - Overview of Programming Options
02:12
How To Record Video Lectures
08:26
Episode 2.1 - Purpose of the MIC architecture
06:42
Episode 2.6 - Knights Landing, the Next Manycore Architecture
11:03
Episode 2.5 - Will My Application Benefit from the MIC Architecture?
06:52
Episode 2.4 - Software Tools for Intel Xeon Phi Coprocessors
05:40
Episode 2.3 - Vector Instruction Support in Intel Architectures
08:27
Episode 2.1 - Purpose of the MIC architecture
05:04
Episode 2.2 - Details of Intel MIC Architecture
01:12
February 2015 in Sunnyvale, California.
06:57
Episode 08 - Native Coprocessor Applications
05:32
Episode 03 - IMCI set and VPUs
05:00
Episode 02 - Detail of Intel MIC architecture
00:45
slide51
12:44
Strip-mining optimization for vectorization (Russian)
00:24
Sign up for SC14
07:59
Sign-up ad for SC14
07:33
Lecture - strip-mining for vectorization
05:16
Practical Lab - strip-mining for vectorization
01:06
High Peaks, Pinnacles National Park, San Benito County, CA