Cross Reference: /gem5/src/gpu-compute/wavefront.cc

History log of /gem5/src/gpu-compute/wavefront.cc
Revision	Date	Author	Comments
# 12697:cd71b966be1e	27-Apr-2018	Tony Gutierrez <anthony.gutierrez@amd.com>	style: fix amd license and style issues Change-Id: I26136fb49f743c4a597f8021cfd27f78897267b5 Reviewed-on: https://gem5-review.googlesource.com/10463 Reviewed-by: Jason Lowe-Power <jason@lowepower.com> Maintainer: Anthony Gutierrez <anthony.gutierrez@amd.com>
# 11883:3bfed693ff22	27-Feb-2017	Tony Gutierrez <anthony.gutierrez@amd.com>	gpu-compute: remove unnecessary member from class The clang compiler complains that the wavefront member in the GpuISA class is unused. This changeset removes the member, because it does not appear serve a purpose.
# 11696:80c30bd0c7d6	26-Oct-2016	Tony Gutierrez <anthony.gutierrez@amd.com>	gpu-compute: add gpu_isa.hh to switch hdrs, add GPUISA to WF the GPUISA class is meant to encapsulate any ISA-specific behavior - special register accesses, isa-specific WF/kernel state, etc. - in a generic enough way so that it may be used in ISA-agnostic code. gpu-compute: use the GPUISA object to advance the PC the GPU model treats the PC as a pointer to individual instruction objects - which are store in a contiguous array - and not a byte address to be fetched from the real memory system. this is ok for HSAIL because all instructions are considered by the model to be the same size. in machine ISA, however, instructions may be 32b or 64b, and branches are calculated by advancing the PC by the number of words (4 byte chunks) it needs to advance in the real instruction stream. because of this there is a mismatch between the PC we use to index into the instruction array, and the actual byte address PC the ISA expects. here we move the PC advance calculation to the ISA so that differences in the instrucion sizes may be accounted for in generic way.
# 11695:0a65922d564d	26-Oct-2016	Tony Gutierrez <anthony.gutierrez@amd.com>	gpu-compute: add instruction mix stats for the gpu
# 11694:c3b4d57a15c5	26-Oct-2016	Tony Gutierrez <anthony.gutierrez@amd.com>	gpu-compute, hsail: call discardFetch() from the WF because every taken branch causes fetch to be discarded, we move the call to the WF to avoid to have to call it from each and every branch instruction type.
# 11692:e772fdcd3809	26-Oct-2016	Tony Gutierrez <anthony.gutierrez@amd.com>	gpu-compute: remove inst enums and use bit flag for attributes this patch removes the GPUStaticInst enums that were defined in GPU.py. instead, a simple set of attribute flags that can be set in the base instruction class are used. this will help unify the attributes of HSAIL and machine ISA instructions within the model itself. because the static instrution now carries the attributes, a GPUDynInst must carry a pointer to a valid GPUStaticInst so a new static kernel launch instruction is added, which carries the attributes needed to perform a the kernel launch.
# 11657:5fad5a37d6fc	04-Oct-2016	Alexandru Dutu <alexandru.dutu@amd.com>	gpu-compute: Added method to compute the actual workgroup size This patch adds a method to the Wavefront class to compute the actual workgroup size. This can be different from the maximum workgroup size specified when launching the kernel through the NDRange object. Current solution is still not optimal, as we are computing these for each wavefront and the dispatcher also needs to have this information and can't actually call Wavefront::computeActuallWgSz before the wavefronts are being created. A long term solution would be to have a Workgroup class that deals with all these details.
# 11644:d426728892fe	16-Sep-2016	Alexandru Dutu <alexandru.dutu@amd.com>	gpu-compute: Adding context serialization methods to Wavefront This patch adds methods to serialize the context of a particular wavefront to the simulated system memory. Context serialization is used when a wavefront is preempeted (i.e. context switch).
# 11643:42a1873be45c	16-Sep-2016	Alexandru Dutu <alexandru.dutu@amd.com>	gpu-compute: Refactoring Wavefront::dynWaveId
# 11641:a9f0711e7230	16-Sep-2016	Alexandru Dutu <alexandru.dutu@amd.com>	gpu-compute: Changing reconvergenceStack type std::stack has no iterators, therefore the reconvergence stack can't be iterated without poping elements off. We will be using std::list instead to be able to iterate for saving and restoring purposes.
# 11640:aa846ec8cd8d	16-Sep-2016	Alexandru Dutu <alexandru.dutu@amd.com>	gpu-compute: Adding ioctl for HW context size Adding runtime support for determining the memory required by a SIMD engine when executing a particular wavefront.
# 11639:2e8d4bd8108d	16-Sep-2016	Alexandru Dutu <alexandru.dutu@amd.com>	gpu-compute: Wavefront refactoring Renaming members of the Wavefront class in accordance with the style guide.
# 11534:7106f550afad	09-Jun-2016	jkalamat <john.kalamatianos@amd.com>	gpu-compute: parametrize Wavefront size Eliminate the VSZ constant that defined the Wavefront size (in numbers of work items); replaced it with a parameter in the GPU.py configuration script. Changed all data structures dependent on the Wavefront size to be dynamically sized. Legal values of Wavefront size are 16, 32, 64 for now and checked at initialization time.
# 11523:81332eb10367	06-Jun-2016	David Guillen Fandos <david.guillen@arm.com>	stats: Fixing regStats function for some SimObjects Fixing an issue with regStats not calling the parent class method for most SimObjects in Gem5. This causes issues if one adds new stats in the base class (since they are never initialized properly!). Change-Id: Iebc5aa66f58816ef4295dc8e48a357558d76a77c Reviewed-by: Andreas Sandberg <andreas.sandberg@arm.com>
# 11345:b6a66a90e0a1	18-Feb-2016	John Kalamatianos <john.kalamatianos@amd.com>	gpu: fix bugs with MemFence, Flat Instrs and Resource utilization Both Memory Fence is now flagged as Global Memory only to avoid resource oversubscribing. Flat instructions now check for Shared Memory resource busy to avoid oversubscribing resources. All WaitClass resources now use cycles (not ticks) to register the number of pipe stages between Scoreboard and Execute to be consistent with instruction scheduling logic which always used clock cycles.
# 11308:7d8836fd043d	19-Jan-2016	Tony Gutierrez <anthony.gutierrez@amd.com>	gpu-compute: AMD's baseline GPU model