범용 응용프로그램 실행 시 하드웨어 구성과 분기 처리 기법에 따른 GPU 성능 분석

논문상세정보

' 범용 응용프로그램 실행 시 하드웨어 구성과 분기 처리 기법에 따른 GPU 성능 분석' 의 참고문헌

  • 응용프로그램 실행에 따른 CPU/GPU의 온도 및 컴퓨터 시스템의 에너지 효율성 분석
    한국컴퓨터정보학회논문지 17 (5) : 9 ~ 19 [2012]
  • 병렬 응용프로그램 실행 시 GPU 구조에 따른 성능 분석
    한국콘텐츠학회 논문지 12 (5) : 10 ~ 21 [2012]
  • 고성능 GPU의 성능 저하 요인에 대한 정량적 분석
    정보과학회 컴퓨팅의 실제 논문지 18 (4) : 282 ~ 287 [2012]
  • https://developer.nvidia.com/cg-toolkit
  • http://www.simplescalar.com
  • http://www.opengl.org/registry/doc/GLSLangS pec.Full.1.20.8.pdf
  • http://www.nvidia.com/object/product_quadro_ fx_5800_us.html
  • http://www.nvidia.com/content/cudazone/
  • http://www.khronos.org/opencl/
  • http://www.amd.com/stream
  • http://nocs.stanford.edu/booksim.html
  • http://msdn2.microsoft.com/en-us/library/bb50 9638.aspx
  • http://developer.nvidia.com/object/cuda_3_1_do wnloads.html
  • http://developer.download.nvidia.com/compute/ cuda/sdk/website/samples.html
  • Very high-speed computing syste ms
    IEEE 54 (12) : 1901 ~ 1909 [1966]
  • System and method for managing divergent threads in a SIMD architecture
    US Patent 7353369 1 [2008]
  • Simultaneous multithreading : maximizing on-chip parallelism
    22th International Symposium on Computer Architecture : 392 ~ 403 [1995]
  • Performance analysis of the CM-2, a massively parallel SIMD computer
    6th International Conference on Supercomputing : 45 ~ 52 [1992]
  • Parallel Processing for Integral Imaging Pickup Using Multiple Threads
    International Journal of Contents 5 (4) : 30 ~ 34 [2009]
  • Method for conditional branch execution in SIMD vector processors
    US Patent 4435758 6 [1984]
  • Method and system for programmable pipelined graphics processing with branching instructions
    US Patent 6947047 20 [2005]
  • Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow
    40th Microarchitecture : 407 ~ 420 [2007]
  • Clock rate versus IPC: the end of the road for conventional microArchitectures
    27th International Symposium on Computer Architecture : 248 ~ 259 [2000]
  • Chap - a SIMD graphics processor
    11th Annual Conference on Computer Graphics (SIGGRAPH) : 77 ~ 82 [1984]
  • Brook for GPUs: stream computing on graphics hardware
    31th Annual Conference on Computer Graphics (SIGGRAPH) : 777 ~ 786 [2004]
  • Available instruction-level parallelism for superscalar and superpipelined machines
    3th International Conference on Architectural Support for Programming Languages and Operating Systems : 272 ~ 282 [1989]
  • Analyzing CUDA Workloads Using a Detailed GPU Simulator
    9th International Symposium on Performance Analysis of Systems and Software : 163 ~ 174 [2009]
  • A user-programmable vertex engine
    28th Annual Conference on Computer Graphics (SIGGRAPH) : 149 ~ 158 [2001]
  • A study of control independence in superscalar processors
    5th International Symposium on High-Performance Computer Architecture : 115 ~ 124 [1999]
  • A study of control independence in superscalar processors
    5th International Symposium on High-Performance Computer Architecture : 115 ~ 124 [1999]
  • A performance study of general purpose applications on graphics processors using CUDA
    Journal of Parallel and Distributed Computing 68 (10) : 1370 ~ 1380 [2008]
  • A Survey of General-Purpose Computation on Graphics Hardware
    Eurographics 2005, State of the Art Reports : 21 ~ 51 [2005]