• Tor Lillqvist's avatar
    Split formula group for OpenCL up into smaller bits when necessary · d6a5aac0
    Tor Lillqvist yazdı
    Will make it less demanding on low-end hardware, where the device
    driver is unresponsive for too long when an OpenCL kernel handling lots
    of data is executing. This makes Windows restart the driver which is
    problematic.
    
    I tried several approaches of splitting, both at higher levels in sc
    and at the lowest level just before creating and executing the OpenCL
    kernel(s). This seems to be the most minimal and local approach. Doing
    it at the lower level would have required too much poking into our
    obscure OpenCL code, like passing an offset parameter to every kernel.
    
    Use a simple heuristic to find out whether to split. On the
    problematic low-end devices, CL_DEVICE_PREFERRED_VECTOR_WIDTH_FLOAT is
    4, while for more performant devices it is 1 or 8.
    d6a5aac0
Adı
Son kayıt (commit)
Son güncelleme
..
clew.h Loading commit data...