summaryrefslogtreecommitdiff
path: root/docs/users_guide
diff options
context:
space:
mode:
Diffstat (limited to 'docs/users_guide')
-rw-r--r--docs/users_guide/using.xml54
1 files changed, 40 insertions, 14 deletions
diff --git a/docs/users_guide/using.xml b/docs/users_guide/using.xml
index 499e486572..0504fb5d41 100644
--- a/docs/users_guide/using.xml
+++ b/docs/users_guide/using.xml
@@ -3308,13 +3308,29 @@ data D = D !C
<listitem>
<para><indexterm><primary><option>-N<replaceable>x</replaceable></option></primary><secondary>RTS option</secondary></indexterm>
Use <replaceable>x</replaceable> simultaneous threads when
- running the program. Normally <replaceable>x</replaceable>
- should be chosen to match the number of CPU cores on the
- machine<footnote><para>Whether hyperthreading cores should be counted or not is an
- open question; please feel free to experiment and let us know what
- results you find.</para></footnote>. For example,
- on a dual-core machine we would probably use
- <literal>+RTS -N2 -RTS</literal>.</para>
+ running the program.</para>
+
+ <para>The runtime manages a set of virtual processors,
+ which we call <emphasis>capabilities</emphasis>, the
+ number of which is determined by the <option>-N</option>
+ option. Each capability can run one Haskell thread at a
+ time, so the number of capabilities is equal to the
+ number of Haskell threads that can run physically in
+ parallel. A capability is animated by one or more OS
+ threads; the runtime manages a pool of OS threads for
+ each capability, so that if a Haskell thread makes a
+ foreign call (see <xref linkend="ffi-threads" />)
+ another OS thread can take over that capability.
+ </para>
+
+ <para>Normally <replaceable>x</replaceable> should be
+ chosen to match the number of CPU cores on the
+ machine<footnote><para>Whether hyperthreading cores
+ should be counted or not is an open question; please
+ feel free to experiment and let us know what results you
+ find.</para></footnote>. For example, on a dual-core
+ machine we would probably use <literal>+RTS -N2
+ -RTS</literal>.</para>
<para>Omitting <replaceable>x</replaceable>,
i.e. <literal>+RTS -N -RTS</literal>, lets the runtime
@@ -3331,10 +3347,11 @@ data D = D !C
<xref linkend="rts-options-gc" />).</para>
<para>The current value of the <option>-N</option> option
- is available to the Haskell program
- via <literal>Control.Concurrent.getNumCapabilities</literal>, and
- it may be changed while the program is running by
- calling <literal>Control.Concurrent.setNumCapabilities</literal>.</para>
+ is available to the Haskell program via
+ <literal>Control.Concurrent.getNumCapabilities</literal>,
+ and it may be changed while the program is running by
+ calling
+ <literal>Control.Concurrent.setNumCapabilities</literal>.</para>
</listitem>
</varlistentry>
</variablelist>
@@ -3349,9 +3366,18 @@ data D = D !C
option</secondary></indexterm>
<listitem>
<para>Use the OS's affinity facilities to try to pin OS
- threads to CPU cores. This is an experimental feature,
- and may or may not be useful. Please let us know
- whether it helps for you!</para>
+ threads to CPU cores.</para>
+
+ <para>When this option is enabled, the OS threads for a
+ capability <emphasis>i</emphasis> are bound to the CPU
+ core <emphasis>i</emphasis> using the API provided by the
+ OS for setting thread affinity. e.g. on Linux
+ GHC uses <literal>sched_setaffinity()</literal>.</para>
+
+ <para>Depending on your workload and the other activity on
+ the machine, this may or may not result in a performance
+ improvement. We recommend trying it out and measuring the
+ difference.</para>
</listitem>
</varlistentry>
<varlistentry>