about summary refs log tree commit diff
path: root/manual/platform.texi
blob: 6caf68d796956d7986190f41afb1e8114f2535d4 (plain) (blame)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
@node Platform, Contributors, Maintenance, Top
@c %MENU% Describe all platform-specific facilities provided
@appendix Platform-specific facilities

@Theglibc{} can provide machine-specific functionality.

@menu
* PowerPC::           Facilities Specific to the PowerPC Architecture
* RISC-V::            Facilities Specific to the RISC-V Architecture
* X86::               Facilities Specific to the X86 Architecture
@end menu

@node PowerPC
@appendixsec PowerPC-specific Facilities

Facilities specific to PowerPC that are not specific to a particular
operating system are declared in @file{sys/platform/ppc.h}.

@deftypefun {uint64_t} __ppc_get_timebase (void)
@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
Read the current value of the Time Base Register.

The @dfn{Time Base Register} is a 64-bit register that stores a monotonically
incremented value updated at a system-dependent frequency that may be
different from the processor frequency.  More information is available in
@cite{Power ISA 2.06b - Book II - Section 5.2}.

@code{__ppc_get_timebase} uses the processor's time base facility directly
without requiring assistance from the operating system, so it is very
efficient.
@end deftypefun

@deftypefun {uint64_t} __ppc_get_timebase_freq (void)
@safety{@prelim{}@mtunsafe{@mtuinit{}}@asunsafe{@asucorrupt{:init}}@acunsafe{@acucorrupt{:init}}}
@c __ppc_get_timebase_freq=__get_timebase_freq @mtuinit @acsfd
@c  __get_clockfreq @mtuinit @asucorrupt:init @acucorrupt:init @acsfd
@c    the initialization of the static timebase_freq is not exactly
@c    safe, because hp_timing_t cannot be atomically set up.
@c   syscall:get_tbfreq ok
@c   open dup @acsfd
@c   read dup ok
@c   memcpy dup ok
@c   memmem dup ok
@c   close dup @acsfd
Read the current frequency at which the Time Base Register is updated.

This frequency is not related to the processor clock or the bus clock.
It is also possible that this frequency is not constant.  More information is
available in @cite{Power ISA 2.06b - Book II - Section 5.2}.
@end deftypefun

The following functions provide hints about the usage of resources that are
shared with other processors.  They can be used, for example, if a program
waiting on a lock intends to divert the shared resources to be used by other
processors.  More information is available in @cite{Power ISA 2.06b - Book II -
Section 3.2}.

@deftypefun {void} __ppc_yield (void)
@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
Provide a hint that performance will probably be improved if shared resources
dedicated to the executing processor are released for use by other processors.
@end deftypefun

@deftypefun {void} __ppc_mdoio (void)
@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
Provide a hint that performance will probably be improved if shared resources
dedicated to the executing processor are released until all outstanding storage
accesses to caching-inhibited storage have been completed.
@end deftypefun

@deftypefun {void} __ppc_mdoom (void)
@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
Provide a hint that performance will probably be improved if shared resources
dedicated to the executing processor are released until all outstanding storage
accesses to cacheable storage for which the data is not in the cache have been
completed.
@end deftypefun

@deftypefun {void} __ppc_set_ppr_med (void)
@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
Set the Program Priority Register to medium value (default).

The @dfn{Program Priority Register} (PPR) is a 64-bit register that controls
the program's priority.  By adjusting the PPR value the programmer may
improve system throughput by causing the system resources to be used
more efficiently, especially in contention situations.
The three unprivileged states available are covered by the functions
@code{__ppc_set_ppr_med} (medium -- default), @code{__ppc_set_ppc_low} (low)
and @code{__ppc_set_ppc_med_low} (medium low).  More information
available in @cite{Power ISA 2.06b - Book II - Section 3.1}.
@end deftypefun

@deftypefun {void} __ppc_set_ppr_low (void)
@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
Set the Program Priority Register to low value.
@end deftypefun

@deftypefun {void} __ppc_set_ppr_med_low (void)
@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
Set the Program Priority Register to medium low value.
@end deftypefun

Power ISA 2.07 extends the priorities that can be set to the Program Priority
Register (PPR).  The following functions implement the new priority levels:
very low and medium high.

@deftypefun {void} __ppc_set_ppr_very_low (void)
@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
Set the Program Priority Register to very low value.
@end deftypefun

@deftypefun {void} __ppc_set_ppr_med_high (void)
@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
Set the Program Priority Register to medium high value.  The medium high
priority is privileged and may only be set during certain time intervals by
problem-state programs.  If the program priority is medium high when the time
interval expires or if an attempt is made to set the priority to medium high
when it is not allowed, the priority is set to medium.
@end deftypefun

@node RISC-V
@appendixsec RISC-V-specific Facilities

Cache management facilities specific to RISC-V systems that implement the Linux
ABI are declared in @file{sys/cachectl.h}.

@deftypefun {void} __riscv_flush_icache (void *@var{start}, void *@var{end}, unsigned long int @var{flags})
@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
Enforce ordering between stores and instruction cache fetches.  The range of
addresses over which ordering is enforced is specified by @var{start} and
@var{end}.  The @var{flags} argument controls the extent of this ordering, with
the default behavior (a @var{flags} value of 0) being to enforce the fence on
all threads in the current process.  Setting the
@code{SYS_RISCV_FLUSH_ICACHE_LOCAL} bit allows users to indicate that enforcing
ordering on only the current thread is necessary.  All other flag bits are
reserved.
@end deftypefun

@node X86
@appendixsec X86-specific Facilities

Facilities specific to X86 that are not specific to a particular
operating system are declared in @file{sys/platform/x86.h}.

@deftypefun {const struct cpuid_feature *} __x86_get_cpuid_feature_leaf (unsigned int @var{leaf})
@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
Return a pointer to x86 CPU feature structure used by query macros for x86
CPU feature @var{leaf}.
@end deftypefun

@deftypefn Macro int HAS_CPU_FEATURE (@var{name})
This macro returns a nonzero value (true) if the processor has the feature
@var{name}.
@end deftypefn

@deftypefn Macro int CPU_FEATURE_USABLE (@var{name})
This macro returns a nonzero value (true) if the processor has the feature
@var{name} and the feature is supported by the operating system.
@end deftypefn

The supported processor features are:

@itemize @bullet

@item
@code{ACPI} -- Thermal Monitor and Software Controlled Clock Facilities.

@item
@code{ADX} -- ADX instruction extensions.

@item
@code{APIC} -- APIC On-Chip.

@item
@code{AES} -- The AES instruction extensions.

@item
@code{AESKLE} -- AES Key Locker instructions are enabled by OS.

@item
@code{AMX_BF16} -- Tile computational operations on bfloat16 numbers.

@item
@code{AMX_INT8} -- Tile computational operations on 8-bit numbers.

@item
@code{AMX_TILE} -- Tile architecture.

@item
@code{ARCH_CAPABILITIES} -- IA32_ARCH_CAPABILITIES MSR.

@item
@code{AVX} -- The AVX instruction extensions.

@item
@code{AVX2} -- The AVX2 instruction extensions.

@item
@code{AVX_VNNI} -- The AVX-VNNI instruction extensions.

@item
@code{AVX512_4FMAPS} -- The AVX512_4FMAPS instruction extensions.

@item
@code{AVX512_4VNNIW} -- The AVX512_4VNNIW instruction extensions.

@item
@code{AVX512_BF16} -- The AVX512_BF16 instruction extensions.

@item
@code{AVX512_BITALG} -- The AVX512_BITALG instruction extensions.

@item
@code{AVX512_FP16} -- The AVX512_FP16 instruction extensions.

@item
@code{AVX512_IFMA} -- The AVX512_IFMA instruction extensions.

@item
@code{AVX512_VBMI} -- The AVX512_VBMI instruction extensions.

@item
@code{AVX512_VBMI2} -- The AVX512_VBMI2 instruction extensions.

@item
@code{AVX512_VNNI} -- The AVX512_VNNI instruction extensions.

@item
@code{AVX512_VP2INTERSECT} -- The AVX512_VP2INTERSECT instruction
extensions.

@item
@code{AVX512_VPOPCNTDQ} -- The AVX512_VPOPCNTDQ instruction extensions.

@item
@code{AVX512BW} -- The AVX512BW instruction extensions.

@item
@code{AVX512CD} -- The AVX512CD instruction extensions.

@item
@code{AVX512ER} -- The AVX512ER instruction extensions.

@item
@code{AVX512DQ} -- The AVX512DQ instruction extensions.

@item
@code{AVX512F} -- The AVX512F instruction extensions.

@item
@code{AVX512PF} -- The AVX512PF instruction extensions.

@item
@code{AVX512VL} -- The AVX512VL instruction extensions.

@item
@code{BMI1} -- BMI1 instructions.

@item
@code{BMI2} -- BMI2 instructions.

@item
@code{CLDEMOTE} -- CLDEMOTE instruction.

@item
@code{CLFLUSHOPT} -- CLFLUSHOPT instruction.

@item
@code{CLFSH} -- CLFLUSH instruction.

@item
@code{CLWB} -- CLWB instruction.

@item
@code{CMOV} -- Conditional Move instructions.

@item
@code{CMPXCHG16B} -- CMPXCHG16B instruction.

@item
@code{CNXT_ID} -- L1 Context ID.

@item
@code{CORE_CAPABILITIES} -- IA32_CORE_CAPABILITIES MSR.

@item
@code{CX8} -- CMPXCHG8B instruction.

@item
@code{DCA} -- Data prefetch from a memory mapped device.

@item
@code{DE} -- Debugging Extensions.

@item
@code{DEPR_FPU_CS_DS} -- Deprecates FPU CS and FPU DS values.

@item
@code{DS} -- Debug Store.

@item
@code{DS_CPL} -- CPL Qualified Debug Store.

@item
@code{DTES64} -- 64-bit DS Area.

@item
@code{EIST} -- Enhanced Intel SpeedStep technology.

@item
@code{ENQCMD} -- Enqueue Stores instructions.

@item
@code{ERMS} -- Enhanced REP MOVSB/STOSB.

@item
@code{F16C} -- 16-bit floating-point conversion instructions.

@item
@code{FMA} -- FMA extensions using YMM state.

@item
@code{FMA4} -- FMA4 instruction extensions.

@item
@code{FPU} -- X87 Floating Point Unit On-Chip.

@item
@code{FSGSBASE} -- RDFSBASE/RDGSBASE/WRFSBASE/WRGSBASE instructions.

@item
@code{FSRCS} -- Fast Short REP CMP and SCA.

@item
@code{FSRM} -- Fast Short REP MOV.

@item
@code{FSRS} -- Fast Short REP STO.

@item
@code{FXSR} -- FXSAVE and FXRSTOR instructions.

@item
@code{FZLRM} -- Fast Zero-Length REP MOV.

@item
@code{GFNI} -- GFNI instruction extensions.

@item
@code{HLE} -- HLE instruction extensions.

@item
@code{HTT} -- Max APIC IDs reserved field is Valid.

@item
@code{HRESET} -- History reset.

@item
@code{HYBRID} -- Hybrid processor.

@item
@code{IBRS_IBPB} -- Indirect branch restricted speculation (IBRS) and
the indirect branch predictor barrier (IBPB).

@item
@code{IBT} -- Intel Indirect Branch Tracking instruction extensions.

@item
@code{INVARIANT_TSC} -- Invariant TSC.

@item
@code{INVPCID} -- INVPCID instruction.

@item
@code{KL} -- AES Key Locker instructions.

@item
@code{LAM} -- Linear Address Masking.

@item
@code{L1D_FLUSH} -- IA32_FLUSH_CMD MSR.

@item
@code{LAHF64_SAHF64} -- LAHF/SAHF available in 64-bit mode.

@item
@code{LM} -- Long mode.

@item
@code{LWP} -- Lightweight profiling.

@item
@code{LZCNT} -- LZCNT instruction.

@item
@code{MCA} -- Machine Check Architecture.

@item
@code{MCE} -- Machine Check Exception.

@item
@code{MD_CLEAR} -- MD_CLEAR.

@item
@code{MMX} -- Intel MMX Technology.

@item
@code{MONITOR} --  MONITOR/MWAIT instructions.

@item
@code{MOVBE} -- MOVBE instruction.

@item
@code{MOVDIRI} -- MOVDIRI instruction.

@item
@code{MOVDIR64B} -- MOVDIR64B instruction.

@item
@code{MPX} -- Intel Memory Protection Extensions.

@item
@code{MSR} -- Model Specific Registers RDMSR and WRMSR instructions.

@item
@code{MTRR} -- Memory Type Range Registers.

@item
@code{NX} -- No-execute page protection.

@item
@code{OSPKE} -- OS has set CR4.PKE to enable protection keys.

@item
@code{OSXSAVE} -- The OS has set CR4.OSXSAVE[bit 18] to enable
XSETBV/XGETBV instructions to access XCR0 and to support processor
extended state management using XSAVE/XRSTOR.

@item
@code{PAE} -- Physical Address Extension.

@item
@code{PAGE1GB} -- 1-GByte page.

@item
@code{PAT} -- Page Attribute Table.

@item
@code{PBE} -- Pending Break Enable.

@item
@code{PCID} -- Process-context identifiers.

@item
@code{PCLMULQDQ} -- PCLMULQDQ instruction.

@item
@code{PCONFIG} -- PCONFIG instruction.

@item
@code{PDCM} -- Perfmon and Debug Capability.

@item
@code{PGE} -- Page Global Bit.

@item
@code{PKS} -- Protection keys for supervisor-mode pages.

@item
@code{PKU} -- Protection keys for user-mode pages.

@item
@code{POPCNT} -- POPCNT instruction.

@item
@code{PREFETCHW} -- PREFETCHW instruction.

@item
@code{PREFETCHWT1} -- PREFETCHWT1 instruction.

@item
@code{PSE} -- Page Size Extension.

@item
@code{PSE_36} -- 36-Bit Page Size Extension.

@item
@code{PSN} -- Processor Serial Number.

@item
@code{RDPID} -- RDPID instruction.

@item
@code{RDRAND} -- RDRAND instruction.

@item
@code{RDSEED} -- RDSEED instruction.

@item
@code{RDT_A} -- Intel Resource Director Technology (Intel RDT) Allocation
capability.

@item
@code{RDT_M} -- Intel Resource Director Technology (Intel RDT) Monitoring
capability.

@item
@code{RDTSCP} -- RDTSCP instruction.

@item
@code{RTM} -- RTM instruction extensions.

@item
@code{SDBG} -- IA32_DEBUG_INTERFACE MSR for silicon debug.

@item
@code{SEP} -- SYSENTER and SYSEXIT instructions.

@item
@code{SERIALIZE} -- SERIALIZE instruction.

@item
@code{SGX} -- Intel Software Guard Extensions.

@item
@code{SGX_LC} -- SGX Launch Configuration.

@item
@code{SHA} -- SHA instruction extensions.

@item
@code{SHSTK} -- Intel Shadow Stack instruction extensions.

@item
@code{SMAP} -- Supervisor-Mode Access Prevention.

@item
@code{SMEP} -- Supervisor-Mode Execution Prevention.

@item
@code{SMX} -- Safer Mode Extensions.

@item
@code{SS} -- Self Snoop.

@item
@code{SSBD} -- Speculative Store Bypass Disable (SSBD).

@item
@code{SSE} -- Streaming SIMD Extensions.

@item
@code{SSE2} -- Streaming SIMD Extensions 2.

@item
@code{SSE3} -- Streaming SIMD Extensions 3.

@item
@code{SSE4_1} -- Streaming SIMD Extensions 4.1.

@item
@code{SSE4_2} -- Streaming SIMD Extensions 4.2.

@item
@code{SSE4A} -- SSE4A instruction extensions.

@item
@code{SSSE3} -- Supplemental Streaming SIMD Extensions 3.

@item
@code{STIBP} -- Single thread indirect branch predictors (STIBP).

@item
@code{SVM} -- Secure Virtual Machine.

@item
@code{SYSCALL_SYSRET} -- SYSCALL/SYSRET instructions.

@item
@code{TBM} -- Trailing bit manipulation instructions.

@item
@code{TM} -- Thermal Monitor.

@item
@code{TM2} -- Thermal Monitor 2.

@item
@code{TRACE} -- Intel Processor Trace.

@item
@code{TSC} -- Time Stamp Counter.  RDTSC instruction.

@item
@code{TSC_ADJUST} -- IA32_TSC_ADJUST MSR.

@item
@code{TSC_DEADLINE} -- Local APIC timer supports one-shot operation
using a TSC deadline value.

@item
@code{TSXLDTRK} -- TSXLDTRK instructions.

@item
@code{UINTR} -- User interrupts.

@item
@code{UMIP} -- User-mode instruction prevention.

@item
@code{VAES} -- VAES instruction extensions.

@item
@code{VME} -- Virtual 8086 Mode Enhancements.

@item
@code{VMX} -- Virtual Machine Extensions.

@item
@code{VPCLMULQDQ} -- VPCLMULQDQ instruction.

@item
@code{WAITPKG} -- WAITPKG instruction extensions.

@item
@code{WBNOINVD} -- WBINVD/WBNOINVD instructions.

@item
@code{WIDE_KL} -- AES wide Key Locker instructions.

@item
@code{X2APIC} -- x2APIC.

@item
@code{XFD} -- Extended Feature Disable (XFD).

@item
@code{XGETBV_ECX_1} -- XGETBV with ECX = 1.

@item
@code{XOP} -- XOP instruction extensions.

@item
@code{XSAVE} -- The XSAVE/XRSTOR processor extended states feature, the
XSETBV/XGETBV instructions, and XCR0.

@item
@code{XSAVEC} -- XSAVEC instruction.

@item
@code{XSAVEOPT} -- XSAVEOPT instruction.

@item
@code{XSAVES} -- XSAVES/XRSTORS instructions.

@item
@code{XTPRUPDCTRL} -- xTPR Update Control.

@end itemize

You could query if a processor supports @code{AVX} with:

@smallexample
#include <sys/platform/x86.h>

int
support_avx (void)
@{
  return HAS_CPU_FEATURE (AVX);
@}
@end smallexample

and if @code{AVX} is usable with:

@smallexample
#include <sys/platform/x86.h>

int
usable_avx (void)
@{
  return CPU_FEATURE_USABLE (AVX);
@}
@end smallexample