views/asm-docs-source/CVTPD2PS.html - compiler-explorer - Rivoreo Source Code Repositories

 <!DOCTYPE html>

 <html>
 <head>
 <meta charset="UTF-8">
 <link href="style.css" type="text/css" rel="stylesheet">
 <title>CVTPD2PS—Convert Packed Double-Precision FP Values to Packed Single-Precision FP Values </title></head>
 <body>
 <h1>CVTPD2PS—Convert Packed Double-Precision FP Values to Packed Single-Precision FP Values</h1>
 <table>
 <tr>
 <th>Opcode/Instruction</th>
 <th>Op/En</th>
 <th>64/32-bit Mode</th>
 <th>CPUID Feature Flag</th>
 <th>Description</th></tr>
 <tr>
 <td>
 <p>66 0F 5A /<em>r</em></p>
 <p>CVTPD2PS <em>xmm1</em>, <em>xmm2/m128</em></p></td>
 <td>RM</td>
 <td>V/V</td>
 <td>SSE2</td>
 <td>Convert two packed double-precision floating-point values in <em>xmm2/m128 </em>to two packed single-precision floating-point values in <em>xmm1</em>.</td></tr>
 <tr>
 <td>
 <p>VEX.128.66.0F.WIG 5A /r</p>
 <p>VCVTPD2PS xmm1, xmm2/m128</p></td>
 <td>RM</td>
 <td>V/V</td>
 <td>AVX</td>
 <td>Convert two packed double-precision floating-point values in xmm2/mem to two single-precision floating-point values in xmm1.</td></tr>
 <tr>
 <td>
 <p>VEX.256.66.0F.WIG 5A /r</p>
 <p>VCVTPD2PS xmm1, ymm2/m256</p></td>
 <td>RM</td>
 <td>V/V</td>
 <td>AVX</td>
 <td>Convert four packed double-precision floating-point values in ymm2/mem to four single-precision floating-point values in xmm1.</td></tr></table>
 <h3>Instruction Operand Encoding</h3>
 <table>
 <tr>
 <td>Op/En</td>
 <td>Operand 1</td>
 <td>Operand 2</td>
 <td>Operand 3</td>
 <td>Operand 4</td></tr>
 <tr>
 <td>RM</td>
 <td>ModRM:reg (w)</td>
 <td>ModRM:r/m (r)</td>
 <td>NA</td>
 <td>NA</td></tr></table>
 <h2>Description</h2>
 <p>Converts two packed double-precision floating-point values in the source operand (second operand) to two packed single-precision floating-point values in the destination operand (first operand).</p>
 <p>When a conversion is inexact, the value returned is rounded according to the rounding control bits in the MXCSR register.</p>
 <p>In 64-bit mode, use of the REX.R prefix permits this instruction to access additional registers (XMM8-XMM15).</p>
 <p>128-bit Legacy SSE version: The source operand is an XMM register or 128- bit memory location. The destination operation is an XMM register. Bits[127:64] of the destination XMM register are zeroed. However, the upper bits (VLMAX-1:128) of the corresponding YMM register destination are unmodified.</p>
 <p>VEX.128 encoded version: The source operand is an XMM register or 128- bit memory location. The destination operation is a YMM register. The upper bits (VLMAX-1:64) of the corresponding YMM register destination are zeroed.</p>
 <p>VEX.256 encoded version: The source operand is a YMM register or 256- bit memory location. The destination operation is an XMM register. The upper bits (255:128) of the corresponding YMM register destination are zeroed.</p>
 <p>Note: In VEX-encoded versions, VEX.vvvv is reserved and must be 1111b otherwise instructions will #UD.</p>
 <p>SRC</p>
 <p>X3</p>
 <p>X2</p>
 <p>X1</p>
 <p>X0</p>
 <p>DEST</p>
 <p>0</p>
 <p>X3</p>
 <p>X2</p>
 <p>X1</p>
 <p>X0</p>
 <h3>Figure 3-12.  VCVTPD2PS (VEX.256 encoded version)</h3>
 <h2>Operation</h2>
 <p><strong>CVTPD2PS (128-bit Legacy SSE version)</strong></p>
 <pre>DEST[31:0] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[63:0])
 DEST[63:32] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[127:64])
 DEST[127:64] ← 0
 DEST[VLMAX-1:128] (unmodified)</pre>
 <p><strong>VCVTPD2PS (VEX.128 encoded version)</strong></p>
 <pre>DEST[31:0] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[63:0])
 DEST[63:32] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[127:64])
 DEST[VLMAX-1:64] ← 0</pre>
 <p><strong>VCVTPD2PS (VEX.256 encoded version)</strong></p>
 <pre>DEST[31:0] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[63:0])
 DEST[63:32] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[127:64])
 DEST[95:64] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[191:128])
 DEST[127:96] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[255:192)
 DEST[255:128]← 0</pre>
 <h2>Intel C/C++ Compiler Intrinsic Equivalent</h2>
 <p>CVTPD2PS:</p>
 <p>__m128 _mm_cvtpd_ps(__m128d a)</p>
 <p>CVTPD2PS:</p>
 <p>__m256 _mm256_cvtpd_ps (__m256d a)</p>
 <h2>SIMD Floating-Point Exceptions</h2>
 <p>Overflow, Underflow, Invalid, Precision, Denormal.</p>
 <h2>Other Exceptions</h2>
 <p>See Exceptions Type 2; additionally</p>
 <table class="exception-table">
 <tr>
 <td>#UD</td>
 <td>If VEX.vvvv ≠ 1111B.</td></tr></table></body></html>
	<!DOCTYPE html>

	<html>
	<head>
	<meta charset="UTF-8">
	<link href="style.css" type="text/css" rel="stylesheet">
	<title>CVTPD2PS—Convert Packed Double-Precision FP Values to Packed Single-Precision FP Values </title></head>
	<body>
	<h1>CVTPD2PS—Convert Packed Double-Precision FP Values to Packed Single-Precision FP Values</h1>
	<table>
	<tr>
	<th>Opcode/Instruction</th>
	<th>Op/En</th>
	<th>64/32-bit Mode</th>
	<th>CPUID Feature Flag</th>
	<th>Description</th></tr>
	<tr>
	<td>
	<p>66 0F 5A /<em>r</em></p>
	<p>CVTPD2PS <em>xmm1</em>, <em>xmm2/m128</em></p></td>
	<td>RM</td>
	<td>V/V</td>
	<td>SSE2</td>
	<td>Convert two packed double-precision floating-point values in <em>xmm2/m128 </em>to two packed single-precision floating-point values in <em>xmm1</em>.</td></tr>
	<tr>
	<td>
	<p>VEX.128.66.0F.WIG 5A /r</p>
	<p>VCVTPD2PS xmm1, xmm2/m128</p></td>
	<td>RM</td>
	<td>V/V</td>
	<td>AVX</td>
	<td>Convert two packed double-precision floating-point values in xmm2/mem to two single-precision floating-point values in xmm1.</td></tr>
	<tr>
	<td>
	<p>VEX.256.66.0F.WIG 5A /r</p>
	<p>VCVTPD2PS xmm1, ymm2/m256</p></td>
	<td>RM</td>
	<td>V/V</td>
	<td>AVX</td>
	<td>Convert four packed double-precision floating-point values in ymm2/mem to four single-precision floating-point values in xmm1.</td></tr></table>
	<h3>Instruction Operand Encoding</h3>
	<table>
	<tr>
	<td>Op/En</td>
	<td>Operand 1</td>
	<td>Operand 2</td>
	<td>Operand 3</td>
	<td>Operand 4</td></tr>
	<tr>
	<td>RM</td>
	<td>ModRM:reg (w)</td>
	<td>ModRM:r/m (r)</td>
	<td>NA</td>
	<td>NA</td></tr></table>
	<h2>Description</h2>
	<p>Converts two packed double-precision floating-point values in the source operand (second operand) to two packed single-precision floating-point values in the destination operand (first operand).</p>
	<p>When a conversion is inexact, the value returned is rounded according to the rounding control bits in the MXCSR register.</p>
	<p>In 64-bit mode, use of the REX.R prefix permits this instruction to access additional registers (XMM8-XMM15).</p>
	<p>128-bit Legacy SSE version: The source operand is an XMM register or 128- bit memory location. The destination operation is an XMM register. Bits[127:64] of the destination XMM register are zeroed. However, the upper bits (VLMAX-1:128) of the corresponding YMM register destination are unmodified.</p>
	<p>VEX.128 encoded version: The source operand is an XMM register or 128- bit memory location. The destination operation is a YMM register. The upper bits (VLMAX-1:64) of the corresponding YMM register destination are zeroed.</p>
	<p>VEX.256 encoded version: The source operand is a YMM register or 256- bit memory location. The destination operation is an XMM register. The upper bits (255:128) of the corresponding YMM register destination are zeroed.</p>
	<p>Note: In VEX-encoded versions, VEX.vvvv is reserved and must be 1111b otherwise instructions will #UD.</p>
	<p>SRC</p>
	<p>X3</p>
	<p>X2</p>
	<p>X1</p>
	<p>X0</p>
	<p>DEST</p>
	<p>0</p>
	<p>X3</p>
	<p>X2</p>
	<p>X1</p>
	<p>X0</p>
	<h3>Figure 3-12. VCVTPD2PS (VEX.256 encoded version)</h3>
	<h2>Operation</h2>
	<p><strong>CVTPD2PS (128-bit Legacy SSE version)</strong></p>
	<pre>DEST[31:0] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[63:0])
	DEST[63:32] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[127:64])
	DEST[127:64] ← 0
	DEST[VLMAX-1:128] (unmodified)</pre>
	<p><strong>VCVTPD2PS (VEX.128 encoded version)</strong></p>
	<pre>DEST[31:0] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[63:0])
	DEST[63:32] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[127:64])
	DEST[VLMAX-1:64] ← 0</pre>
	<p><strong>VCVTPD2PS (VEX.256 encoded version)</strong></p>
	<pre>DEST[31:0] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[63:0])
	DEST[63:32] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[127:64])
	DEST[95:64] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[191:128])
	DEST[127:96] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[255:192)
	DEST[255:128]← 0</pre>
	<h2>Intel C/C++ Compiler Intrinsic Equivalent</h2>
	<p>CVTPD2PS:</p>
	<p>__m128 _mm_cvtpd_ps(__m128d a)</p>
	<p>CVTPD2PS:</p>
	<p>__m256 _mm256_cvtpd_ps (__m256d a)</p>
	<h2>SIMD Floating-Point Exceptions</h2>
	<p>Overflow, Underflow, Invalid, Precision, Denormal.</p>
	<h2>Other Exceptions</h2>
	<p>See Exceptions Type 2; additionally</p>
	<table class="exception-table">
	<tr>
	<td>#UD</td>
	<td>If VEX.vvvv ≠ 1111B.</td></tr></table></body></html>