🍪
cookielau
  • Introduction
  • Machine Learning
    • Distributed
      • Bookmarks
    • NLP
      • Transformers
    • MLC
      • Tensor Program Abstraction
      • End-to-End Module Execution
  • Framework
    • PyTorch
      • Bookmarks
      • Model
      • Shared
      • Miscellaneous
    • Tensorflow
      • Bookmarks
      • Model
      • Shared
      • Miscellaneous
    • CUDA
      • Bookmarks
    • DeepSpeed
    • Bagua
      • Model
      • Optimizer
    • Others
      • Bookmarks
  • About Me
    • 2022-04-28
  • Random Thoughts
  • Archives
    • CPP
      • Bookmarks
      • Container
      • Algorithm
      • FILE CONTROL
      • Virtual Table
      • Assembly
      • Key Words
      • Problems
      • Others
    • JAVA
      • String Container
      • Maps
    • PYTHON
      • Bookmarks
      • Python Tools
        • Batch Rename
        • Combine Excel
        • Excel Oprations
        • Read Write Excel
        • Rotate PDF
      • Library
        • Pandas Notes
        • Numpy Notes
        • Json Notes
      • Spider
        • Selenium Install
        • Selenium Locating
        • Selenium Errors
        • Selenium Basics
      • Django
        • Start Up
      • Others
    • LINUX
      • Installation
      • Cli Tools
      • WSL
      • Bugs
    • JUNIOR2
      • Economics
        • Chapter 0x01 经济管理概述
        • Chapter 0x02 微观市场机制分析
        • Chapter 0x03 生产决策与市场结构
        • Chapter 0x04 宏观经济市场分析
        • Chapter 0x05 管理的职能
        • Chapter 0x06 生产系统结构与战略
        • Chapter 0x0b 投资项目经济评价
        • Chapter 0x0f 投资项目经济评价
      • Computer Network
        • 概述
        • 分层模型
        • 物理层
        • 数据链路层
        • 网络层
        • 传输层
        • 应用层
        • HTTP(s)实验
        • [Practice]
      • Software Engineering
        • Introduction
        • Demand Analysis
        • Task Estimation
        • Presentation
      • Network Security
        • Chapter 0x01 概述
        • Chapter 0x02 密码学
        • Chapter 0x03 公钥体制
        • Chapter 0x04 消息认证
        • Chapter 0x05 密钥管理
        • Chapter 0x06 访问控制
        • Assignments
      • x86 Programming
        • Basic Knowledge
        • Program Design
        • System Interruption
        • Frequently used functions
    • MD&LaTex
      • Markdown
      • LaTex
    • NPM
      • NPM LINK
    • MyBlogs
      • 2020BUAA软工——“停下来,回头看”
      • 2020BUAA软工——“初窥构建之法”
      • 2020BUAA软工——“上手软件工程,PSP初体验!”
      • 2020BUAA软工——“深度评测官”
      • 2020BUAA软工——“并肩作战,平面交点Pro”
    • SC
      • PAC 2022
        • Lectures
      • OpenMP & MPI
        • MPI Overview
        • Message Passing Programming
        • OpenMP Overview
        • Work Sharing Directives
        • Annual Challenge
        • Future Topics in OpenMP
        • Tasks
        • OpenMP & MPI
    • Hardware
      • Nvidia GPU
        • Frequent Error
        • Memory Classification
        • CUDA_7_Streams_Simplify_Concurrency
        • Optimize_Data_Transfers_in_CUDA
        • Overlap_Data_Transfers_in_CUDA
        • Write_Flexible_Kernels_with_Grid-Stride_Loops
        • How_to_Access_Global_Memory_Efficiently
        • Using_Shared_Memory
      • Intel CPU
        • Construction
        • Optimization
        • Compilation
        • OpenMP
    • English
      • Vocab
      • Composition
    • Interview
      • Computer Network
Powered by GitBook
On this page
  • Manual
  • HPC
  • Topic
  • Reduction
  • Loop
  • Taskyield
  • Critical Section
  • Task Dependency
  • SIMD
  • Others

Was this helpful?

  1. Archives
  2. Hardware
  3. Intel CPU

OpenMP

OpenMP is an Application Program Interface (API), jointly defined by a group of major computer hardware and software vendors. OpenMP provides a portable, scalable model for developers of shared memory

Manual

  1. OpenMP Author: Blaise Barney, Lawrence Livermore National Laboratory

  2. OpenMP Exercise

  3. OpenMP Application Programming Interface Examples

  4. Advanced OpenMP® Tutorial

  5. Advanced OpenMP

  6. Introduction to High-Performance Scientific Computing

  7. Lecture 4: Principles of Parallel Algorithm Design (part 1)

  8. Tutorial on OpenMP programming

HPC

  1. Introduction to High Performance Scientific Computing

Topic

Reduction

  1. A Comparison of Parallelization Techniques for Irregular Reductions

  2. Adaptive Reduction Parallelization Techniques

  3. Speculative Separation for Privatization and Reductions

  4. The Art of Data Replication

  5. Improving the Performance of OpenMP by Array Privatization

  6. On Algorithmic Reductions in Task-parallel Programming Models

  7. OpenMP Reduction Operations

  8. OpenMP with parallel reduction in for loop

  9. OpenMP 4 SIMD Programming - Easy

P.S. reduction貌似只能应用于简单数据类型,复杂如complex就无能为力,编译会报错

Loop

  1. How does OpenMP handle nested loops?

  2. OpenMP topic: Loop parallelism

Taskyield

  1. The Impact of Taskyield on the Design of Tasks Communicating Through MPI

Critical Section

  1. OpenMP Critical Sections

Task Dependency

  1. rERA: An Optimization Algorithm of Task Dependency Graph for Scheduling

SIMD

  1. OpenMP: Vectorization and #pragma omp simd

Others

  1. FFTW Manual

  2. NUFFT, Discontinuous Fast Fourier Transform, and Some Applications

  3. Rapid Gridding Reconstruction With a Minimal Oversampling Ratio

PreviousCompilationNextEnglish

Last updated 4 years ago

Was this helpful?