| <!DOCTYPE html> |
| |
| <html> |
| <head> |
| <meta charset="UTF-8"> |
| <link href="style.css" type="text/css" rel="stylesheet"> |
| <title>CVTPD2PS—Convert Packed Double-Precision FP Values to Packed Single-Precision FP Values </title></head> |
| <body> |
| <h1>CVTPD2PS—Convert Packed Double-Precision FP Values to Packed Single-Precision FP Values</h1> |
| <table> |
| <tr> |
| <th>Opcode/Instruction</th> |
| <th>Op/En</th> |
| <th>64/32-bit Mode</th> |
| <th>CPUID Feature Flag</th> |
| <th>Description</th></tr> |
| <tr> |
| <td> |
| <p>66 0F 5A /<em>r</em></p> |
| <p>CVTPD2PS <em>xmm1</em>, <em>xmm2/m128</em></p></td> |
| <td>RM</td> |
| <td>V/V</td> |
| <td>SSE2</td> |
| <td>Convert two packed double-precision floating-point values in <em>xmm2/m128 </em>to two packed single-precision floating-point values in <em>xmm1</em>.</td></tr> |
| <tr> |
| <td> |
| <p>VEX.128.66.0F.WIG 5A /r</p> |
| <p>VCVTPD2PS xmm1, xmm2/m128</p></td> |
| <td>RM</td> |
| <td>V/V</td> |
| <td>AVX</td> |
| <td>Convert two packed double-precision floating-point values in xmm2/mem to two single-precision floating-point values in xmm1.</td></tr> |
| <tr> |
| <td> |
| <p>VEX.256.66.0F.WIG 5A /r</p> |
| <p>VCVTPD2PS xmm1, ymm2/m256</p></td> |
| <td>RM</td> |
| <td>V/V</td> |
| <td>AVX</td> |
| <td>Convert four packed double-precision floating-point values in ymm2/mem to four single-precision floating-point values in xmm1.</td></tr></table> |
| <h3>Instruction Operand Encoding</h3> |
| <table> |
| <tr> |
| <td>Op/En</td> |
| <td>Operand 1</td> |
| <td>Operand 2</td> |
| <td>Operand 3</td> |
| <td>Operand 4</td></tr> |
| <tr> |
| <td>RM</td> |
| <td>ModRM:reg (w)</td> |
| <td>ModRM:r/m (r)</td> |
| <td>NA</td> |
| <td>NA</td></tr></table> |
| <h2>Description</h2> |
| <p>Converts two packed double-precision floating-point values in the source operand (second operand) to two packed single-precision floating-point values in the destination operand (first operand).</p> |
| <p>When a conversion is inexact, the value returned is rounded according to the rounding control bits in the MXCSR register.</p> |
| <p>In 64-bit mode, use of the REX.R prefix permits this instruction to access additional registers (XMM8-XMM15).</p> |
| <p>128-bit Legacy SSE version: The source operand is an XMM register or 128- bit memory location. The destination operation is an XMM register. Bits[127:64] of the destination XMM register are zeroed. However, the upper bits (VLMAX-1:128) of the corresponding YMM register destination are unmodified.</p> |
| <p>VEX.128 encoded version: The source operand is an XMM register or 128- bit memory location. The destination operation is a YMM register. The upper bits (VLMAX-1:64) of the corresponding YMM register destination are zeroed.</p> |
| <p>VEX.256 encoded version: The source operand is a YMM register or 256- bit memory location. The destination operation is an XMM register. The upper bits (255:128) of the corresponding YMM register destination are zeroed.</p> |
| <p>Note: In VEX-encoded versions, VEX.vvvv is reserved and must be 1111b otherwise instructions will #UD.</p> |
| <p>SRC</p> |
| <p>X3</p> |
| <p>X2</p> |
| <p>X1</p> |
| <p>X0</p> |
| <p>DEST</p> |
| <p>0</p> |
| <p>X3</p> |
| <p>X2</p> |
| <p>X1</p> |
| <p>X0</p> |
| <h3>Figure 3-12. VCVTPD2PS (VEX.256 encoded version)</h3> |
| <h2>Operation</h2> |
| <p><strong>CVTPD2PS (128-bit Legacy SSE version)</strong></p> |
| <pre>DEST[31:0] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[63:0]) |
| DEST[63:32] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[127:64]) |
| DEST[127:64] ← 0 |
| DEST[VLMAX-1:128] (unmodified)</pre> |
| <p><strong>VCVTPD2PS (VEX.128 encoded version)</strong></p> |
| <pre>DEST[31:0] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[63:0]) |
| DEST[63:32] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[127:64]) |
| DEST[VLMAX-1:64] ← 0</pre> |
| <p><strong>VCVTPD2PS (VEX.256 encoded version)</strong></p> |
| <pre>DEST[31:0] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[63:0]) |
| DEST[63:32] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[127:64]) |
| DEST[95:64] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[191:128]) |
| DEST[127:96] ← Convert_Double_Precision_To_Single_Precision_Floating_Point(SRC[255:192) |
| DEST[255:128]← 0</pre> |
| <h2>Intel C/C++ Compiler Intrinsic Equivalent</h2> |
| <p>CVTPD2PS:</p> |
| <p>__m128 _mm_cvtpd_ps(__m128d a)</p> |
| <p>CVTPD2PS:</p> |
| <p>__m256 _mm256_cvtpd_ps (__m256d a)</p> |
| <h2>SIMD Floating-Point Exceptions</h2> |
| <p>Overflow, Underflow, Invalid, Precision, Denormal.</p> |
| <h2>Other Exceptions</h2> |
| <p>See Exceptions Type 2; additionally</p> |
| <table class="exception-table"> |
| <tr> |
| <td>#UD</td> |
| <td>If VEX.vvvv ≠ 1111B.</td></tr></table></body></html> |