intrin_mfma_f32_32x32x8bf16_1k< 32, 32 > Struct Reference

intrin_mfma_f32_32x32x8bf16_1k&lt; 32, 32 &gt; Struct Reference#

Composable Kernel: ck::intrin_mfma_f32_32x32x8bf16_1k< 32, 32 > Struct Reference
ck::intrin_mfma_f32_32x32x8bf16_1k< 32, 32 > Struct Reference

#include <amd_xdlops.hpp>

Static Public Member Functions

template<class FloatC>
static __device__ void Run (const bhalf4_t &reg_a, const bhalf4_t &reg_b, FloatC &reg_c)

Member Function Documentation

◆ Run()

template<class FloatC>
__device__ void ck::intrin_mfma_f32_32x32x8bf16_1k< 32, 32 >::Run ( const bhalf4_t & reg_a,
const bhalf4_t & reg_b,
FloatC & reg_c )
inlinestatic

The documentation for this struct was generated from the following file: