Since NVIDIA is apparently investing plenty on these new "tensor cores" in their new architectures (Volta and Turing)... is ACEMD planning on leveraging these? IIRC, these Tensor Cores can do half-precision calculations pretty fast.
|ID: 50636 | Rating: 0 | rate: / Reply Quote|
Turing is in many ways very interesting. It would be a huge performance leap if we can use the Tensor cores for something useful, but there is more:
First, the Turing SM adds a new independent integer datapath that can execute instructions concurrently with the floating-point math datapath. In previous generations, executing these instructions would have blocked floating-point instructions from issuing.
Simultanious execution of integer and floating-point operations could improve performance a lot. Also, FP16 is now supported at double the performance on all cores, so not only Tensor but also regular CUDA.
GeForce RTX 2080 Ti for example:
14.2 TFLOPS of peak single precision (FP32) performance
|ID: 50637 | Rating: 0 | rate: / Reply Quote|