fp16c

package

v0.0.0-...-3878f85 Latest Latest Go to latest Published: Jul 23, 2017 License: MIT Imports: 1 Imported by: 0

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/klauspost/intrinsics

Links

Open Source Insights

Documentation ¶

Overview ¶

THESE PACKAGES ARE FOR DEMONSTRATION PURPOSES ONLY!

THEY DO NOT NOT CONTAIN WORKING INTRINSICS!

See https://github.com/klauspost/intrinsics

Index ¶

func CvtphPs(a x86.M128i) (dst x86.M128)
func CvtpsPh(a x86.M128, rounding int) (dst x86.M128i)
func M256CvtphPs(a x86.M128i) (dst x86.M256)
func M256CvtpsPh(a x86.M256, rounding int) (dst x86.M128i)

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

func CvtphPs ¶

func CvtphPs(a x86.M128i) (dst x86.M128)

CvtphPs: Convert packed half-precision (16-bit) floating-point elements in 'a' to packed single-precision (32-bit) floating-point elements, and store the results in 'dst'.

FOR j := 0 to 3
	i := j*32
	m := j*16
	dst[i+31:i] := Convert_FP16_To_FP32(a[m+15:m])
ENDFOR
dst[MAX:128] := 0

Instruction: 'VCVTPH2PS'. Intrinsic: '_mm_cvtph_ps'. Requires FP16C.

func CvtpsPh ¶

func CvtpsPh(a x86.M128, rounding int) (dst x86.M128i)

CvtpsPh: Convert packed single-precision (32-bit) floating-point elements in 'a' to packed half-precision (16-bit) floating-point elements, and store the results in 'dst'.

Rounding is done according to the 'rounding' parameter, which can be one

of:

    (_MM_FROUND_TO_NEAREST_INT |_MM_FROUND_NO_EXC) // round to nearest, and suppress exceptions
    (_MM_FROUND_TO_NEG_INF |_MM_FROUND_NO_EXC)     // round down, and suppress exceptions
    (_MM_FROUND_TO_POS_INF |_MM_FROUND_NO_EXC)     // round up, and suppress exceptions
    (_MM_FROUND_TO_ZERO |_MM_FROUND_NO_EXC)        // truncate, and suppress exceptions
    _MM_FROUND_CUR_DIRECTION // use MXCSR.RC; see _MM_SET_ROUNDING_MODE

		FOR j := 0 to 3
			i := 16*j
			l := 32*j
			dst[i+15:i] := Convert_FP32_To_FP16FP(a[l+31:l])
		ENDFOR
		dst[MAX:128] := 0

Instruction: 'VCVTPS2PH'. Intrinsic: '_mm_cvtps_ph'. Requires FP16C.

func M256CvtphPs ¶

func M256CvtphPs(a x86.M128i) (dst x86.M256)

M256CvtphPs: Convert packed half-precision (16-bit) floating-point elements in 'a' to packed single-precision (32-bit) floating-point elements, and store the results in 'dst'.

FOR j := 0 to 7
	i := j*32
	m := j*16
	dst[i+31:i] := Convert_FP16_To_FP32(a[m+15:m])
ENDFOR
dst[MAX:256] := 0

Instruction: 'VCVTPH2PS'. Intrinsic: '_mm256_cvtph_ps'. Requires FP16C.

func M256CvtpsPh ¶

func M256CvtpsPh(a x86.M256, rounding int) (dst x86.M128i)

M256CvtpsPh: Convert packed single-precision (32-bit) floating-point elements in 'a' to packed half-precision (16-bit) floating-point elements, and store the results in 'dst'.

Rounding is done according to the 'rounding' parameter, which can be one

of:

    (_MM_FROUND_TO_NEAREST_INT |_MM_FROUND_NO_EXC) // round to nearest, and suppress exceptions
    (_MM_FROUND_TO_NEG_INF |_MM_FROUND_NO_EXC)     // round down, and suppress exceptions
    (_MM_FROUND_TO_POS_INF |_MM_FROUND_NO_EXC)     // round up, and suppress exceptions
    (_MM_FROUND_TO_ZERO |_MM_FROUND_NO_EXC)        // truncate, and suppress exceptions
    _MM_FROUND_CUR_DIRECTION // use MXCSR.RC; see _MM_SET_ROUNDING_MODE

		FOR j := 0 to 7
			i := 16*j
			l := 32*j
			dst[i+15:i] := Convert_FP32_To_FP16FP(a[l+31:l])
		ENDFOR
		dst[MAX:128] := 0

Instruction: 'VCVTPS2PH'. Intrinsic: '_mm256_cvtps_ph'. Requires FP16C.

Types ¶

This section is empty.

Source Files ¶

View all Source files

fp16c.go

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL