diff --git a/README.md b/README.md
index 7ce7462..9e1e52a 100644
--- a/README.md
+++ b/README.md
@@ -1,5 +1,5 @@
-    Document Number: N4506
-    Date:            2015-05-05
+    Document Number: N4699
+    Date:            2017-10-16
     Revises:
     Project:         Programming Language C++
     Project Number:  TS 19570
@@ -7,24 +7,22 @@
                      NVIDIA Corporation
                      jhoberock@nvidia.com
 
-# Parallelism TS Editor's Report, post-Lenexa mailing 
+# Parallelism TS Editor's Report, pre-Albuquerque mailing 
 
-N4505 is the latest Parallelism TS Working Draft. It contains editorial and technical changes to the Parallelism TS to apply the following revisions:
+N4698 is the proposed working draft of Parallelism TS Version 2. It contains changes to the Parallelism TS as directed by the committee at the Toronto meeting, and editorial changes.
 
-  * N4274 - Relaxing Packing Rules for Exceptions Thrown by Parallel Algorithms - Proposed Wording (Revision 1)
-  * Feature test macro for the Parallelism TS
+N4698 updates the previous draft, N4669, published in the pre-Toronto mailing.
 
-N4505 updates the previous draft, N4407, published in the pre-Lenexa mailing.
+# Technical Changes
 
-N4507 is document N4505 reformatted as a TS document. It updates N4409, which was published in the pre-Lenexa mailing.
+* Apply P0076R4 - Vector and Wavefront Policies.
 
-## Technical Changes
+# Editorial Changes
 
-* Applied N4274, which relaxes the exception packaging rules for exceptions thrown by parallel algorithms. Additionally, changed instances of "terminates with (exception)" phrasing to "exits via (exception)", as directed by the Library Working Group.
+* Reformat Table 1 - Feature Test Macro(s), to match the style of the Library Fundamentals TS.
 
-* Introduced the feature test macro `__cpp_lib_experimental_parallel_algorithm` for the functionality of the Parallelism TS as directed by SG1.
+# Notes
 
-## Editorial Changes
-
-* Promoted subsection 1.3.1, which was incorrectly grouped under section 1.3, to section 1.4.
+* The pre-existing content of N4698 has not yet been harmonized with C++17. As a result, this content is named and namespaced inconsistently with the newly applied content of P0076R4. We anticipate that these inconsistencies will be harmonized by a future revision.
+* N4698 contains forward references to `for_loop` and `for_loop_strided`. We anticipate their introduction in a future revision.
 
diff --git a/algorithms.html b/algorithms.html
index 0a62818..17ec63d 100644
--- a/algorithms.html
+++ b/algorithms.html
@@ -88,6 +88,37 @@ <h1>Effect of execution policies on algorithm execution</h1>
         incremented correctly.
       </cxx-example>
 
+      <ins>
+      <p>
+        The invocations of element access functions in parallel algorithms invoked with an
+        execution policy of type <code>unsequenced_policy</code> are permitted to execute
+        in an unordered fashion in the calling thread, unsequenced with respect to one another
+        within the calling thread.
+
+        <cxx-note>
+          This means that multiple function object invocations may be interleaved on a single thread.
+        </cxx-note>
+        <pre>
+</pre>
+
+        <cxx-note>
+          This overrides the usual guarantee from the C++ standard, Section 1.9 [intro.execution] that
+          function executions do not interleave with one another.
+        </cxx-note>
+      </p>
+      </ins>
+
+      <ins>
+      <p>
+        The invocations of element access functions in parallel algorithms invoked with an
+        executino policy of type <code>vector_policy</code> are permitted to execute
+        in an unordered fashion in the calling thread, unsequenced with respect to one another
+        within the calling thread, subject to the sequencing constraints of wavefront application
+        (<cxx-ref to="parallel.alg.general.wavefront"></cxx-ref>) for the last argument to
+        <code>for_loop</code> or <code>for_loop_strided</code>.
+      </p>
+      </ins>
+
       <p>
         The invocations of element access functions in parallel algorithms invoked with an execution
         policy of type <code>parallel_vector_execution_policy</code>
@@ -163,6 +194,107 @@ <h1>Effect of execution policies on algorithm execution</h1>
       </p>
     </cxx-section>
 
+    <cxx-section id="parallel.alg.general.wavefront">
+      <h1>Wavefront Application</h1>
+      <ins>
+      <p>
+        For the purposes of this section, an <i>evaluation</i> is a value computation or side effect of
+        an expression, or an execution of a statement. Initialization of a temporary object is considered a
+        subexpression of the expression that necessitates the temporary object.
+      </p>
+
+      <p>
+        An evaluation A <i>contains</i> an evaluation B if:
+
+        <ul>
+        <li>A and B are not potentially concurrent ([intro.races]); and</li>
+        <li>the start of A is the start of B or the start of A is sequenced before the start of B; and</li>
+        <li>the completion of B is the completion of A or the completion of B is sequenced before the completion of A.</li>
+        </ul>
+
+        <cxx-note>This includes evaluations occurring in function invocations.</cxx-note>
+      </p>
+
+      <p>
+        An evaluation A is <i>ordered before</i> an evaluation B if A is deterministically
+        sequenced before B. <cxx-note>If A is indeterminately sequenced with respect to B
+        or A and B are unsequenced, then A is not ordered before B and B is not ordered
+        before A. The ordered before relationship is transitive.</cxx-note>
+      </p>
+
+      <p>
+        For an evaluation A ordered before an evaluation B, both contained in the same
+        invocation of an element access function, A is a <i>vertical antecedent</i> of B if:
+
+        <ul>
+        <li>there exists an evaluation S such that:
+          <ul>
+            <li>S contains A, and</li>
+            <li>S contains all evaluations C (if any) such that A is ordered before C and C is ordered before B,</li>
+            <li>but S does not contain B, and</li>
+          </ul>
+        </li>
+        <li>
+          control reached B from A without executing any of the following:
+          <ul>
+            <li>a <code>goto</code> statement or <code>asm</code> declaration that jumps to a statement outside of S, or</li>
+            <li>a <code>switch</code> statement executed within S that transfers control into a substatement of a nested selection or iteration statement, or</li>
+            <li>a <code>throw</code> <cxx-note>even if caught</cxx-note>, or</li>
+            <li>a <code>longjmp</code>.
+          </ul>
+        </li>
+        </ul>
+
+        <cxx-note>
+          Vertical antecedent is an irreflexive, antisymmetric, nontransitive relationship between two evaluations.
+          Informally, A is a vertical antecedent of B if A is sequenced immediately before B or A is nested zero or
+          more levels within a statement S that immediately precedes B.
+        </cxx-note>
+      </p>
+
+      <p>
+        In the following, <i>X<sub>i</sub></i> and <i>X<sub>j</sub></i> refer to evaluations of the <i>same</i> expression
+        or statement contained in the application of an element access function corresponding to the i<sup>th</sup> and
+        j<sup>th</sup> elements of the input sequence. <cxx-note>There might be several evaluations <i>X<sub>k</sub></i>,
+        <i>Y<sub>k</sub></i>, etc. of a single expression or statement in application <i>k</i>, for example, if the
+        expression or statement appears in a loop within the element access function.</cxx-note>
+      </p>
+
+      <p>
+        <i>Horizontally matched</i> is an equivalence relationship between two evaluations of the same expression. An
+        evaluation B<sub>i</sub> is <i>horizontally matched</i> with an evaluation B<sub>j</sub> if:
+
+        <ul>
+          <li>both are the first evaluations in their respective applications of the element access function, or</li>
+          <li>there exist horizontally matched evaluations A<sub>i</sub> and A<sub>j</sub> that are vertical antecedents of evaluations B<sub>i</sub> and B<sub>j</sub>, respectively.
+        </ul>
+
+        <cxx-note>
+          <i>Horizontally matched</i> establishes a theoretical <i>lock-step</i> relationship between evaluations in different applications of an element access function.
+        </cxx-note>
+      </p>
+
+      <p>
+        Let <i>f</i> be a function called for each argument list in a sequence of argument lists.
+        <i>Wavefront application</i> of <i>f</i> requires that evaluation A<sub>i</sub> be sequenced
+        before evaluation B<sub>i</sub> if i &lt; j and and:
+
+        <ul>
+          <li>A<sub>i</sub> is sequenced before some evaluation B<sub>i</sub> and B<sub>i</sub> is horizontally matched with B<sub>j</sub>, or</li>
+          <li>A<sub>i</sub> is horizontally matched with some evaluation A<sub>j</sub> and A<sub>j</sub> is sequenced before B<sub>j<sub>.</li>
+        </ul>
+
+        <cxx-note>
+          <i>Wavefront application</i> guarantees that parallel applications i and j execute such that progress on application j never gets <i>ahead</i> of application i.
+        </cxx-note>
+
+        <cxx-note>
+          The relationships between A<sub>i</sub> and B<sub>i</sub> and between A<sub>j</sub> and B<sub>j</sub> are <i>sequenced before</i>, not <i>vertical antecedent</i>.
+        </cxx-note>
+      </p>
+      </ins>
+    </cxx-section>
+
     <cxx-section id="parallel.alg.overloads">
       <h1><code>ExecutionPolicy</code> algorithm overloads</h1>
 
@@ -365,7 +497,7 @@ <h1>Header <code>&lt;experimental/algorithm&gt;</code> synopsis</h1>
 namespace std {
 namespace experimental {
 namespace parallel {
-inline namespace v1 {
+inline namespace v2 {
   template&lt;class ExecutionPolicy,
            class InputIterator, class Function&gt;
     void for_each(ExecutionPolicy&amp;&amp; exec,
@@ -379,6 +511,20 @@ <h1>Header <code>&lt;experimental/algorithm&gt;</code> synopsis</h1>
     InputIterator for_each_n(ExecutionPolicy&amp;&amp; exec,
                              InputIterator first, Size n,
                              Function f);
+
+<ins>namespace execution {
+  <cxx-ref insynopsis="" to="parallel.alg.novec"></cxx-ref>
+  template&lt;class F&gt;
+    auto no_vec(F&amp;&amp; f) noexcept -&gt; decltype(std::forward&lt;F&gt;(f)());
+
+  <cxx-ref insynopsis="" to="parallel.alg.ordupdate.class"></cxx-ref>
+  template&lt;class T&gt;
+    class ordered_update_t;
+
+  <cxx-ref insynopsis="" to="parallel.alg.ordupdate.func"></cxx-ref>
+  template&lt;class T&gt;
+    ordered_update_t&lt;T&gt; ordered_update(T&amp; ref) noexcept;
+}</ins>
 }
 }
 }
@@ -487,6 +633,143 @@ <h1>For each</h1>
         </cxx-notes>
       </cxx-function>
     </cxx-section>
+
+    <cxx-section id="parallel.alg.novec">
+      <h1>No vec</h1>
+
+      <ins>
+      <cxx-function>
+        <cxx-signature>template&lt;class F&gt;
+auto no_vec(F&amp;&amp; f) noexcept -&gt; decltype(std::forward&lt;F&gt;(f)());</cxx-signature>
+
+        <cxx-effects>
+          Evaluates <code>std::forward&gt;F&lt;(f)()</code>. When invoked within an element access function
+          in a parallel algorithm using <code>vector_policy</code>, if two calls to <code>no_vec</code> are
+          horizontally matched within a wavefront application of an element access function over input
+          sequence S, then the execution of <code>f</code> in the application for one element in S is
+          sequenced before the execution of <code>f</code> in the application for a subsequent element in
+          S; otherwise, there is no effect on sequencing.
+        </cxx-effects>
+
+        <cxx-returns>
+          the result of <code>f</code>.
+        </cxx-returns>
+        
+        <cxx-remarks>
+          If <code>f</code> returns a result, the result is ignored.
+        </cxx-remarks>
+
+        <cxx-notes>
+          If <code>f</code> exits via an exception, then <code>terminate</code> will be called, consistent
+          with all other potentially-throwing operations invoked with <code>vector_policy</code> execution.
+
+          <cxx-example>
+            <pre>extern int* p;
+for_loop(vec, 0, n[&amp;](int i) {
+  y[i] +=y[i+1];
+  if(y[i] &lt; 0) {
+    no_vec([]{
+      *p++ = i;
+    });
+  }
+});</pre>
+
+            The updates <code>*p++ = i</code> will occur in the same order as if the policy were <code>seq</code>.
+          </cxx-example>
+        </cxx-notes>
+      </cxx-function>
+      </ins>
+    </cxx-section>
+
+    <cxx-section id="parallel.alg.ordupdate.class">
+      <h1>Ordered update class</h1>
+
+      <ins>
+<pre>
+class ordered_update_t {
+  T&amp; ref_; // exposition only
+public:
+  ordered_update_t(T&amp; loc) noexcept
+    : ref_(loc) {}
+  ordered_update_t(const ordered_update_t&amp;) = delete;
+  ordered_update_t&amp; operator=(const ordered_update_t&amp;) = delete;
+
+  template &lt;class U&gt;
+    auto operator=(U rhs) const noexcept
+      { return no_vec([&amp;]{ return ref_ = std::move(rhs); }); }
+  template &lt;class U&gt;
+    auto operator+=(U rhs) const noexcept
+      { return no_vec([&amp;]{ return ref_ += std::move(rhs); }); }
+  template &lt;class U&gt;
+    auto operator-=(U rhs) const noexcept
+      { return no_vec([&amp;]{ return ref_ -= std::move(rhs); }); }
+  template &lt;class U&gt;
+    auto operator*=(U rhs) const noexcept
+      { return no_vec([&amp;]{ return ref_ *= std::move(rhs); }); }
+  template &lt;class U&gt;
+    auto operator/=(U rhs) const noexcept
+      { return no_vec([&amp;]{ return ref_ /= std::move(rhs); }); }
+  template &lt;class U&gt;
+    auto operator%=(U rhs) const noexcept
+      { return no_vec([&amp;]{ return ref_ %= std::move(rhs); }); }
+  template &lt;class U&gt;
+    auto operator&gt;&gt;=(U rhs) const noexcept
+      { return no_vec([&amp;]{ return ref_ &gt;&gt;= std::move(rhs); }); }
+  template &lt;class U&gt;
+    auto operator&lt;&lt;=(U rhs) const noexcept
+      { return no_vec([&amp;]{ return ref_ &lt;&lt;= std::move(rhs); }); }
+  template &lt;class U&gt;
+    auto operator&amp;=(U rhs) const noexcept
+      { return no_vec([&amp;]{ return ref_ &amp;= std::move(rhs); }); }
+  template &lt;class U&gt;
+    auto operator^=(U rhs) const noexcept
+      { return no_vec([&amp;]{ return ref_ ^= std::move(rhs); }); }
+  template &lt;class U&gt;
+    auto operator|=(U rhs) const noexcept
+      { return no_vec([&amp;]{ return ref_ |= std::move(rhs); }); }
+
+  auto operator++() const noexcept
+    { return no_vec([&amp;]{ return ++ref_; }); }
+  auto operator++(int) const noexcept
+    { return no_vec([&amp;]{ return ref_++; }); }
+  auto operator--() const noexcept
+    { return no_vec([&amp;]{ return --ref_; }); }
+  auto operator--(int) const noexcept
+    { return no_vec([&amp;]{ return ref_--; }); }
+};
+</pre>
+
+      <p>
+        An object of type <code>ordered_update_t&gt;T&lt;</code> is a proxy for an object of type T
+        intended to be used within a parallel application of an element access function using a
+        policy object of type <code>vector_policy</code>. Simple increments, assignments, and compound
+        assignments to the object are forwarded to the proxied object, but are sequenced as though
+        executed within a <code>no_vec</code> invocation.
+
+        <cxx-note>
+          The return-value deduction of the forwarded operations results in these operations returning by
+          value, not reference. This formulation prevents accidental collisions on accesses to the return
+          value.
+        </cxx-note>
+      </p>
+      </ins>
+    </cxx-section>
+
+    <cxx-section id="parallel.alg.ordupdate.func">
+      <h1>Ordered update function template</h1>
+      <ins>
+
+      <cxx-function>
+        <cxx-signature>template&lt;T&gt;
+ordered_update_t&lt;T&gt; ordered_update(T&amp; loc) noexcept;</cxx-signature>
+      </cxx-function>
+
+      <cxx-returns>
+        <code>{ loc }</code>.
+      </cxx-returns>
+
+      </ins>
+    </cxx-section>
   </cxx-section>
 
     <cxx-section id="parallel.alg.numeric">
@@ -499,7 +782,7 @@ <h1>Header <code>&lt;experimental/numeric&gt;</code> synopsis</h1>
 namespace std {
 namespace experimental {
 namespace parallel {
-inline namespace v1 {
+inline namespace v2 {
   template&lt;class InputIterator&gt;
     typename iterator_traits&lt;InputIterator&gt;::value_type
       reduce(InputIterator first, InputIterator last);
@@ -772,7 +1055,7 @@ <h1>Inclusive scan</h1>
 OutputIterator inclusive_scan(InputIterator first, InputIterator last,
                               OutputIterator result,
                               BinaryOperation binary_op);</cxx-signature>
-        <cxx-signature>template&lt;class InputIterator, class OutputIterator, class BinaryOperation&gt;
+        <cxx-signature>template&lt;class InputIterator, class OutputIterator, class BinaryOperation, class T&gt;
 OutputIterator inclusive_scan(InputIterator first, InputIterator last,
                               OutputIterator result,
                               BinaryOperation binary_op, T init);</cxx-signature>
diff --git a/exceptions.html b/exceptions.html
index 4c07716..d32baef 100644
--- a/exceptions.html
+++ b/exceptions.html
@@ -9,40 +9,37 @@ <h1>Exception reporting behavior</h1>
         </p>
         <p>
           During the execution of a standard parallel algorithm, if the invocation of an element access function
-          <ins>exits via</ins><del>terminates with</del> an uncaught exception, the behavior of the program is determined by the type of
+          exits via an uncaught exception, the behavior of the program is determined by the type of
           execution policy used to invoke the algorithm:
 
           <ul>
             <li>
-              If the execution policy object is of type <code>class parallel_vector_execution_policy</code>,
+              If the execution policy object is of type <code><del>class </del>parallel_vector_execution_policy</code><ins>, <code>unsequenced_policy</code>, or <code>vector_policy</code></ins>,
               <code>std::terminate</code> shall be called.
             </li>
             <li>
               If the execution policy object is of type <code>sequential_execution_policy</code> or
-              <code>parallel_execution_policy</code>, the execution of the algorithm <ins>exits via</ins><del>terminates with</del> an
-              <del><code>exception_list</code></del> exception. <ins>The exception shall be an <code>exception_list</code> containing all</ins><del>All</del> uncaught exceptions thrown during
-              the invocations of element access functions<ins>, or optionally the uncaught exception if there was only one</ins><del>shall be contained in the
-              <code>exception_list</code></del>.<pre>
+              <code>parallel_execution_policy</code>, the execution of the algorithm exits via an
+               exception. The exception shall be an <code>exception_list</code> containing all uncaught exceptions thrown during
+              the invocations of element access functions, or optionally the uncaught exception if there was only one.<pre>
 </pre>
 
               <cxx-note>
-                For example, <del>the number of invocations of the user-provided function object in
-                <code>for_each</code> is unspecified. W</del><ins>w</ins>hen <code>for_each</code> is executed sequentially,
-                <ins>if an invocation of the user-provided function object throws an exception, <code>for_each</code> can exit via the uncaught exception, or throw an <code>exception_list</code> containing the original exception.
-                <del>only one exception will be contained in the <code>exception_list</code> object.</del>
+                For example, when <code>for_each</code> is executed sequentially,
+                if an invocation of the user-provided function object throws an exception, <code>for_each</code> can exit via the uncaught exception, or throw an <code>exception_list</code> containing the original exception.
               </cxx-note><pre>
 </pre>
 
               <cxx-note>
                 These guarantees imply that, unless the algorithm has failed to allocate memory and
-                <ins>exits via</ins><del>terminated with</del> <code>std::bad_alloc</code>, all exceptions thrown during the execution of
+                exits via <code>std::bad_alloc</code>, all exceptions thrown during the execution of
                 the algorithm are communicated to the caller. It is unspecified whether an algorithm implementation will "forge ahead" after 
                 encountering and capturing a user exception.
               </cxx-note><pre>
 </pre>
               <cxx-note>
-                The algorithm may <ins>exit via</ins><del>terminate with</del> the <code>std::bad_alloc</code> exception even if one or more
-                user-provided function objects have <ins>exited via</ins><del>terminated with</del> an exception. For example, this can happen when an algorithm fails to allocate memory while
+                The algorithm may exit via the <code>std::bad_alloc</code> exception even if one or more
+                user-provided function objects have exited via an exception. For example, this can happen when an algorithm fails to allocate memory while
                 creating or adding elements to the <code>exception_list</code> object.
               </cxx-note>
             </li>
@@ -60,7 +57,7 @@ <h1>Header <code>&lt;experimental/exception_list&gt;</code> synopsis</h1>
 namespace std {
 namespace experimental {
 namespace parallel {
-inline namespace v1 {
+inline namespace v2 {
 
   class exception_list : public exception
   {
diff --git a/execution_policies.html b/execution_policies.html
index 5e6be39..a6c285b 100644
--- a/execution_policies.html
+++ b/execution_policies.html
@@ -52,7 +52,7 @@ <h1>Header <code>&lt;experimental/execution_policy&gt;</code> synopsis</h1>
 namespace std {
 namespace experimental {
 namespace parallel {
-inline namespace v1 {
+inline namespace v2 {
   <cxx-ref insynopsis="" to="parallel.execpol.type"></cxx-ref>
   template&lt;class T&gt; struct is_execution_policy;
   template&lt;class T&gt; constexpr bool is_execution_policy_v = is_execution_policy&lt;T&gt;::value;
@@ -63,12 +63,19 @@ <h1>Header <code>&lt;experimental/execution_policy&gt;</code> synopsis</h1>
   <cxx-ref insynopsis="" to="parallel.execpol.par"></cxx-ref>
   class parallel_execution_policy;
 
-  <cxx-ref insynopsis="" to="parallel.execpol.vec"></cxx-ref>
+  <cxx-ref insynopsis="" to="parallel.execpol.parvec"></cxx-ref>
   class parallel_vector_execution_policy;
 
   <cxx-ref insynopsis="" to="parallel.execpol.dynamic"></cxx-ref>
   class execution_policy;
-}
+
+<ins>namespace execution {
+  <cxx-ref insynopsis="" to="parallel.execpol.unseq"></cxx-ref>
+  <ins>class unsequenced_policy;</ins>
+
+  <cxx-ref insynopsis="" to="parallel.execpol.vec"></cxx-ref>
+  <ins>class vector_policy;</ins>
+}</ins>
 }
 }
 }
@@ -115,7 +122,7 @@ <h1>Parallel execution policy</h1>
     <p>The class <code>parallel_execution_policy</code> is an execution policy type used as a unique type to disambiguate parallel algorithm overloading and indicate that a parallel algorithm's execution may be parallelized.</p>
 
   </cxx-section>
-  <cxx-section id="parallel.execpol.vec">
+  <cxx-section id="parallel.execpol.parvec">
     <h1>Parallel+Vector execution policy</h1>
 
 <pre>
@@ -126,6 +133,32 @@ <h1>Parallel+Vector execution policy</h1>
 
   </cxx-section>
 
+  <cxx-section id="parallel.execpol.unseq">
+    <h1>Unsequenced execution policy</h1>
+<ins>
+
+<pre>
+class unsequenced_policy{ <i>unspecified</i> };
+</pre>
+
+    <p>The class <code>unsequenced_policy</code> is an execution policy type used as a unique type to disambiguate parallel algorithm overloading and indicate that a parallel algorithm's execution may be vectorized, e.g., executed on a single thread using instructions that operate on multiple data items.</p>
+
+</ins>
+  </cxx-section>
+
+  <cxx-section id="parallel.execpol.vec">
+    <h1>Vector execution policy</h1>
+<ins>
+
+<pre>
+class vector_policy{ <i>unspecified</i> };
+</pre>
+
+    <p>The class <code>vector_policy</code> is an execution policy type used as a unique type to disambiguate parallel algorithm overloading and indicate that a parallel algorithm's execution may be vectorized. Additionally, such vectorization will result in an execution that respects the sequencing constraints of wavefront application ([parallel.alg.general.wavefront]). <cxx-note>The implementation thus makes stronger guarantees than for <code>unsequenced_policy</code>, for example.</cxx-note></p>
+
+</ins>
+  </cxx-section>
+
   <cxx-section id="parallel.execpol.dynamic">
     <h1>Dynamic execution policy</h1>
 
diff --git a/front_matter.html b/front_matter.html
index 9e1e977..9a1ad15 100644
--- a/front_matter.html
+++ b/front_matter.html
@@ -1,14 +1,14 @@
 <cxx-titlepage stage="draft">
-<cxx-docnum>N4505</cxx-docnum>
+<cxx-docnum>N4698</cxx-docnum>
 <cxx-project-number>19570</cxx-project-number>
-  <time pubdate="">2015-05-05</time>
-  <cxx-revises><a href="http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2015/n4407.html">N4407</a></cxx-revises>
+  <time pubdate="">2017-10-16</time>
+  <cxx-revises><a href="http://wg21.link/N4669">N4669</a></cxx-revises>
   <cxx-editor>
     Jared Hoberock<br/>
     NVIDIA Corporation<br/>
     <cxx-email>jhoberock@nvidia.com</cxx-email>
   </cxx-editor>
-  <h1>Technical Specification for C++ Extensions for Parallelism</h1>
+  <h1>Technical Specification for C++ Extensions for Parallelism Version 2</h1>
 </cxx-titlepage>
 
 <cxx-toc></cxx-toc>
diff --git a/general.html b/general.html
index 5abe923..2fad874 100644
--- a/general.html
+++ b/general.html
@@ -1,50 +1,5 @@
 <cxx-clause id="parallel.general">
   <h1>General</h1>
-  <cxx-section id="parallel.general.scope">
-    <h1>Scope</h1>
-      <p>This Technical Specification describes requirements for implementations of an
-      interface that computer programs written in the C++ programming language may
-      use to invoke algorithms with parallel execution. The algorithms described by
-      this Technical Specification are realizable across a broad class of
-      computer architectures.</p>
-      
-      <p>This Technical Specification is non-normative. Some of the functionality
-      described by this Technical Specification may be considered for standardization
-      in a future version of C++, but it is not currently part of any C++ standard.
-      Some of the functionality in this Technical Specification may never be
-      standardized, and other functionality may be standardized in a substantially
-      changed form.</p>
-      
-      <p>The goal of this Technical Specification is to build widespread existing
-      practice for parallelism in the C++ standard algorithms library. It gives
-      advice on extensions to those vendors who wish to provide them.</p>
-  </cxx-section>
-
-  <cxx-section id="parallel.general.references">
-    <h1>Normative references</h1>
-
-    <p>The following referenced document is indispensable for the
-    application of this document. For dated references, only the
-    edition cited applies. For undated references, the latest edition
-    of the referenced document (including any amendments) applies.</p>
-
-    <ul>
-      <li>ISO/IEC 14882:—<cxx-footnote>To be published. Section references are relative to <a href="http://www.open-std.org/jtc1/sc22/wg21/prot/14882fdis/n3937.pdf">N3937</a>.</cxx-footnote>,
-      <cite>Programming Languages — C++</cite>
-      <cxx-foreign-index id="cxx" src="cxx_N3797_index.json" name="C++14"></cxx-foreign-index></li>
-    </ul>
-
-    <p>ISO/IEC 14882:— is herein called the <dfn>C++ Standard</dfn>.
-    The library described in ISO/IEC 14882:— clauses 17-30 is herein called
-    the <dfn>C++ Standard Library</dfn>. The C++ Standard Library components described in
-    ISO/IEC 14882:— clauses 25, 26.7 and 20.7.2 are herein called the <dfn>C++ Standard
-    Algorithms Library</dfn>.</p>
-
-    <p>Unless otherwise specified, the whole of the C++ Standard's Library
-    introduction (<cxx-ref in="cxx" to="library"></cxx-ref>) is included into this
-    Technical Specification by reference.</p>
-  </cxx-section>
-
   <cxx-section id="parallel.general.namespaces">
     <h1>Namespaces and headers</h1>
 
@@ -52,7 +7,7 @@ <h1>Namespaces and headers</h1>
     experimental and not part of the C++ Standard Library, they should not be
     declared directly within namespace <code>std</code>. Unless otherwise specified, all
     components described in this Technical Specification are declared in namespace 
-    <code>std::experimental::parallel::v1</code>.</p>
+    <code>std::experimental::parallel::v2</code>.</p>
 
     <cxx-note>
     Once standardized, the components described by this Technical Specification are expected to be promoted to namespace <code>std</code>. 
@@ -60,7 +15,7 @@ <h1>Namespaces and headers</h1>
 
     <p>Unless otherwise specified, references to such entities described in this
     Technical Specification are assumed to be qualified with
-    <code>std::experimental::parallel::v1</code>, and references to entities described in the C++
+    <code>std::experimental::parallel::v2</code>, and references to entities described in the C++
     Standard Library are assumed to be qualified with <code>std::</code>.</p>
 
     <p>Extensions that are expected to eventually be added to an existing header
@@ -72,65 +27,11 @@ <h1>Namespaces and headers</h1>
 </pre>
   </cxx-section>
 
-  <cxx-section id="parallel.general.defns">
-    <h1>Terms and definitions</h1>
-  
-    <p>For the purposes of this document, the terms and definitions given in the C++ Standard and the following apply.</p>
-  
-    <p>A <dfn>parallel algorithm</dfn> is a function template described by this Technical Specification declared in namespace <code>std::experimental::parallel::v1</code> with a formal template parameter named <code>ExecutionPolicy</code>.</p>
-  
-    <p>
-      Parallel algorithms access objects indirectly accessible via their arguments by invoking the following functions:
-  
-      <ul>
-        <li>
-          All operations of the categories of the iterators that the algorithm is instantiated with.
-        </li>
-  
-        <li>
-          Functions on those sequence elements that are required by its specification.
-        </li>
-  
-        <li>
-          User-provided function objects to be applied during the execution of the algorithm, if required by the specification.
-        </li>
-
-        <ins><li>
-          Operations on those function objects required by the specification.
-
-          <cxx-note>
-            See clause 25.1 of <em>C++ Standard Algorithms Library</em>.
-          </cxx-note>
-        </li></ins>
-      </ul>
-  
-      These functions are herein called <em>element access functions</em>.
-  
-      <cxx-example>
-        The <code>sort</code> function may invoke the following element access functions:
-  
-        <ul>
-          <li>
-            Methods of the random-access iterator of the actual template argument, as per 24.2.7, as implied by the name of the
-            template parameters <code>RandomAccessIterator</code>.
-          </li>
-  
-          <li>
-            The <code>swap</code> function on the elements of the sequence (as per 25.4.1.1 [sort]/2).
-          </li>
-  
-          <li>
-            The user-provided <code>Compare</code> function object.
-          </li>
-        </ul>
-      </cxx-example>
-  </cxx-section>
-
   <cxx-section id="parallel.general.features">
-  <ins>
     <h1>Feature-testing recommendations</h1>     
     <p>An implementation that provides support for this Technical Specification shall define the feature test macro(s) in Table 1.</p>
 
+    <del>
     <table is="cxx-table" class="column-rules">
       <caption>Feature Test Macro(s)</caption>
 
@@ -150,9 +51,67 @@ <h1>Feature-testing recommendations</h1>
             <code>&lt;experimental/numeric&gt;</code>
           </td>
         </tr>
+        <tr>
+          <td><code>__cpp_lib_experimental_parallel_task_block</code></td>
+          <td>201510</td>
+          <td>
+            <code>&lt;experimental/task_block&gt;</code><br>
+          </td>
+        </tr>
+      </thead>
+    </table>
+    </del>
+
+    <ins>
+    <table is="cxx-table" class="column-rules">
+      <caption>Feature Test Macro(s)</caption>
+
+      <thead>
+        <tr>
+          <th>Doc. No.</th>
+          <th>Title</th>
+          <th>Primary Section</th>
+          <th>Macro Name</th>
+          <th>Value</th>
+          <th>Header</th>
+        </tr>
+        <tr>
+          <td>N4505</td>
+          <td>Working Draft, Technical Specification for C++ Extensions for Parallelism</td>
+          <td><cxx-ref to="parallel.alg"</cxx-ref></td>
+          <td><code>__cpp_lib_experimental_parallel_algorithm</code></td>
+          <td>201505</td>
+          <td>
+            <code>&lt;experimental/algorithm&gt;</code><br>
+            <code>&lt;experimental/exception_list&gt;</code><br>
+            <code>&lt;experimental/execution_policy&gt;</code><br>
+            <code>&lt;experimental/numeric&gt;</code>
+          </td>
+        </tr>
+        <tr>
+          <td>P0155R0</td>
+          <td>Task Block R5</td>
+          <td><cxx-ref to="parallel.task_block"></cxx-ref></td>
+          <td><code>__cpp_lib_experimental_parallel_task_block</code></td>
+          <td>201510</td>
+          <td>
+            <code>&lt;experimental/task_block&gt;</code><br>
+          </td>
+        </tr>
+        <tr>
+          <td>P0076R4</td>
+          <td>Vector and Wavefront Policies</td>
+          <td><cxx-ref to="parallel.execpol.unseq"</cxx-ref>, <cxx-ref to="parallel.execpol.vec"</cxx-ref></td>
+          <td><code>__cpp_lib_experimental_execution_vector_policy</code></td>
+          <td>201707</td>
+          <td>
+            <code>&lt;experimental/algorithm&gt;</code><br>
+            <code>&lt;experimental/execution&gt;</code><br>
+          </td>
+        </tr>
       </thead>
     </table>
-  </ins>
+    <ins>
   </cxx-section>
 </cxx-clause>
 
diff --git a/main.html b/main.html
index c447c4e..8631972 100644
--- a/main.html
+++ b/main.html
@@ -8,10 +8,14 @@
 <body unresolved="">
 
 <cxx-include href="front_matter.html"></cxx-include>
+<cxx-include href="scope.html"></cxx-include>
+<cxx-include href="normative_references.html"></cxx-include>
+<cxx-include href="terms_and_definitions.html"></cxx-include>
 <cxx-include href="general.html"></cxx-include>
 <cxx-include href="execution_policies.html"></cxx-include>
 <cxx-include href="exceptions.html"></cxx-include>
 <cxx-include href="algorithms.html"></cxx-include>
+<cxx-include href="task_block.html"></cxx-include>
 <cxx-publish-button source="https://github.com/cplusplus/parallelism-ts"></cxx-publish-button>
 
 </body>
diff --git a/normative_references.html b/normative_references.html
new file mode 100644
index 0000000..4aad4a1
--- /dev/null
+++ b/normative_references.html
@@ -0,0 +1,25 @@
+<cxx-clause id="parallel.references">
+  <h1>Normative references</h1>
+
+  <p>The following referenced document is indispensable for the
+  application of this document. For dated references, only the
+  edition cited applies. For undated references, the latest edition
+  of the referenced document (including any amendments) applies.</p>
+
+  <ul>
+    <li>ISO/IEC 14882:—<cxx-footnote>To be published. Section references are relative to <a href="http://www.open-std.org/jtc1/sc22/wg21/prot/14882fdis/n3937.pdf">N3937</a>.</cxx-footnote>,
+    <cite>Programming Languages — C++</cite>
+    <cxx-foreign-index id="cxx" src="cxx_N3797_index.json" name="C++14"></cxx-foreign-index></li>
+  </ul>
+
+  <p>ISO/IEC 14882:— is herein called the <dfn>C++ Standard</dfn>.
+  The library described in ISO/IEC 14882:— clauses 17-30 is herein called
+  the <dfn>C++ Standard Library</dfn>. The C++ Standard Library components described in
+  ISO/IEC 14882:— clauses 25, 26.7 and 20.7.2 are herein called the <dfn>C++ Standard
+  Algorithms Library</dfn>.</p>
+
+  <p>Unless otherwise specified, the whole of the C++ Standard's Library
+  introduction (<cxx-ref in="cxx" to="library"></cxx-ref>) is included into this
+  Technical Specification by reference.</p>
+</cxx-clause>
+
diff --git a/parallelism-ts.html b/parallelism-ts.html
index 3097044..8857fdf 100644
--- a/parallelism-ts.html
+++ b/parallelism-ts.html
@@ -1,6 +1,7 @@
 <!DOCTYPE html>
 <!-- Sources at https://github.com/cplusplus/parallelism-ts -->
-<html><head><!--[if lte IE 8]><script>document.createElement("nav");document.createElement("section");document.createElement("time");document.createElement("CXX-TITLEPAGE");document.createElement("CXX-DOCNUM");document.createElement("CXX-REVISES");document.createElement("CXX-EDITOR");document.createElement("CXX-EMAIL");document.createElement("CXX-TOC");document.createElement("CXX-CLAUSE");document.createElement("CXX-SECTION");document.createElement("CXX-FOOTNOTE");document.createElement("CXX-FOREIGN-INDEX");document.createElement("CXX-REF");document.createElement("CXX-NOTE");document.createElement("CXX-EXAMPLE");document.createElement("CXX-FUNCTION");document.createElement("CXX-SIGNATURE");document.createElement("CXX-EFFECTS");document.createElement("CXX-REMARKS");document.createElement("CXX-RETURNS");document.createElement("CXX-REQUIRES");document.createElement("CXX-COMPLEXITY");document.createElement("CXX-NOTES");document.createElement("CXX-PUBLISH-BUTTON");</script><![endif]--><style>template {display: none !important;} /* injected by platform.js */</style><style>body {transition: opacity ease-in 0.2s; } 
+<html><head>
+<meta http-equiv="content-type" content="text/html; charset=UTF-8"><!--[if lte IE 8]><script>document.createElement("nav");document.createElement("section");document.createElement("time");document.createElement("CXX-TITLEPAGE");document.createElement("CXX-DOCNUM");document.createElement("CXX-REVISES");document.createElement("CXX-EDITOR");document.createElement("CXX-EMAIL");document.createElement("CXX-TOC");document.createElement("CXX-CLAUSE");document.createElement("CXX-SECTION");document.createElement("CXX-FOOTNOTE");document.createElement("CXX-FOREIGN-INDEX");document.createElement("CXX-REF");document.createElement("CXX-NOTE");document.createElement("CXX-EXAMPLE");document.createElement("CXX-FUNCTION");document.createElement("CXX-SIGNATURE");document.createElement("CXX-EFFECTS");document.createElement("CXX-REMARKS");document.createElement("CXX-RETURNS");document.createElement("CXX-REQUIRES");document.createElement("CXX-COMPLEXITY");document.createElement("CXX-NOTES");document.createElement("CXX-PRECONDITIONS");document.createElement("CXX-THROWS");document.createElement("CXX-POSTCONDITIONS");document.createElement("CXX-PUBLISH-BUTTON");</script><![endif]--><style>template {display: none !important;} /* injected by platform.js */</style><style>body {transition: opacity ease-in 0.2s; } 
 body[unresolved] {opacity: 0; display: block; overflow: hidden; position: relative; } 
 </style><style shim-shadowdom-css="">style { display: none !important; }
 cxx-function {
@@ -953,7 +954,7 @@
   box-shadow: 0px 0px 0px 1px rgba(0, 0, 0, 0.1);
   border-radius: 5px 5px 5px 5px;
 }</style>
-<title>Technical Specification for C++ Extensions for Parallelism, Working Draft</title></head>
+<title>Technical Specification for C++ Extensions for Parallelism Version 2, Working Draft</title></head>
 <body class="cxx-draft">
 
 <cxx-titlepage stage="draft">
@@ -962,13 +963,13 @@
       <div class="page">
         <table class="header">
           
-            <tr><th>Document Number:</th><td><cxx-docnum class="docname">N4505</cxx-docnum></td></tr>
+            <tbody><tr><th>Document Number:</th><td><cxx-docnum class="docname">N4578</cxx-docnum></td></tr>
           
           
-            <tr><th>Date:</th><td><time pubdate=""><span class="pubyear">2015</span>-05-05</time></td></tr>
+            <tr><th>Date:</th><td><time pubdate=""><span class="pubyear">2016</span>-02-22</time></td></tr>
           
           
-            <tr><th>Revises:</th><td><cxx-revises><a href="http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2015/n4407.html">N4407</a></cxx-revises></td></tr>
+            <tr><th>Revises:</th><td><cxx-revises><a href="http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2015/n4505.html">N4505</a></cxx-revises></td></tr>
           
           
             <tr><th>Editor:</th><td><cxx-editor>
@@ -977,8 +978,8 @@
     <cxx-email><a href="mailto:jhoberock@nvidia.com">jhoberock@nvidia.com</a></cxx-email>
   </cxx-editor></td></tr>
           
-        </table>
-        <h1>Working Draft, Technical Specification for C++ Extensions for Parallelism</h1>
+        </tbody></table>
+        <h1>Working Draft, Technical Specification for C++ Extensions for Parallelism Version 2</h1>
         <p class="warning"><strong>Note: this is an early draft. It’s known to be
         incomplet and incorrekt, and it has lots of b<span style="margin-left: -1.2pt; margin-right: 1pt">a</span>d<span style="width:1.5em"> </span>for<span style="margin-left:-3pt; margin-right:0.6pt">mat</span>ti<span style="position:relative; top:-0.15ex">n</span>g.</strong></p>
       </div>
@@ -1175,6 +1176,54 @@ <h1>Contents</h1>
             
           </ol>
         
+      </li>
+            
+              <li><span class="marker">5</span><a href="#parallel.task_block">Task Block</a>
+        
+          <ol>
+            
+              <li><span class="marker">5.1</span><a href="#parallel.task_block.synopsis">Header &lt;experimental/task_block&gt; synopsis</a>
+        
+      </li>
+            
+              <li><span class="marker">5.2</span><a href="#parallel.task_block.task_cancelled_exception">Class task_cancelled_exception</a>
+        
+          <ol>
+            
+              <li><span class="marker">5.2.1</span><a href="#parallel.task_block.task_cancelled_exception.what">task_cancelled_exception member function what</a>
+        
+      </li>
+            
+          </ol>
+        
+      </li>
+            
+              <li><span class="marker">5.3</span><a href="#parallel.task_block.class">Class task_block</a>
+        
+          <ol>
+            
+              <li><span class="marker">5.3.1</span><a href="#parallel.task_block.class.run">task_block member function template run</a>
+        
+      </li>
+            
+              <li><span class="marker">5.3.2</span><a href="#parallel.task_block.class.wait">task_block member function wait</a>
+        
+      </li>
+            
+          </ol>
+        
+      </li>
+            
+              <li><span class="marker">5.4</span><a href="#parallel.task_block.define_task_block">Function template define_task_block</a>
+        
+      </li>
+            
+              <li><span class="marker">5.5</span><a href="#parallel.task_block.exceptions">Exception Handling</a>
+        
+      </li>
+            
+          </ol>
+        
       </li>
             
           </ol>
@@ -1264,7 +1313,7 @@ <h1>Contents</h1>
     experimental and not part of the C++ Standard Library, they should not be
     declared directly within namespace <code>std</code>. Unless otherwise specified, all
     components described in this Technical Specification are declared in namespace 
-    <code>std::experimental::parallel::v1</code>.</p>
+    <code>std::experimental::parallel::<ins>v2</ins><del>v1</del></code>.</p>
 
     <cxx-note><span class="nowrap">[ <em>Note:</em></span>
     
@@ -1275,7 +1324,7 @@ <h1>Contents</h1>
 
     <p para_num="2" id="parallel.general.namespaces.2">Unless otherwise specified, references to such entities described in this
     Technical Specification are assumed to be qualified with
-    <code>std::experimental::parallel::v1</code>, and references to entities described in the C++
+    <code>std::experimental::parallel::<ins>v2</ins><del>v1</del></code>, and references to entities described in the C++
     Standard Library are assumed to be qualified with <code>std::</code>.</p>
 
     <p para_num="3" id="parallel.general.namespaces.3">Extensions that are expected to eventually be added to an existing header
@@ -1298,7 +1347,7 @@ <h1>Contents</h1>
   
     <p para_num="1" id="parallel.general.defns.1">For the purposes of this document, the terms and definitions given in the C++ Standard and the following apply.</p>
   
-    <p para_num="2" id="parallel.general.defns.2">A <dfn>parallel algorithm</dfn> is a function template described by this Technical Specification declared in namespace <code>std::experimental::parallel::v1</code> with a formal template parameter named <code>ExecutionPolicy</code>.</p>
+    <p para_num="2" id="parallel.general.defns.2">A <dfn>parallel algorithm</dfn> is a function template described by this Technical Specification declared in namespace <code>std::experimental::parallel::<ins>v2</ins><del>v1</del></code> with a formal template parameter named <code>ExecutionPolicy</code>.</p>
   
     <p para_num="3" id="parallel.general.defns.3">
       Parallel algorithms access objects indirectly accessible via their arguments by invoking the following functions:
@@ -1316,7 +1365,7 @@ <h1>Contents</h1>
           User-provided function objects to be applied during the execution of the algorithm, if required by the specification.
         </li>
 
-        <ins><li>
+        <li>
           Operations on those function objects required by the specification.
 
           <cxx-note><span class="nowrap">[ <em>Note:</em></span>
@@ -1325,7 +1374,7 @@ <h1>Contents</h1>
           
     <span class="nowrap">— <em>end note</em> ]</span>
   </cxx-note>
-        </li></ins>
+        </li>
       </ul>
   
       These functions are herein called <em>element access functions</em>.
@@ -1361,11 +1410,10 @@ <h1>Contents</h1>
     
 
     <section>
-      <header><span class="section-number">1.5</span>  <span style="float:right"><a href="#parallel.general.features">[parallel.general.features]</a></span></header>
+      <header><span class="section-number">1.5</span> <h1 data-bookmark-label="1.5 Feature-testing recommendations">Feature-testing recommendations</h1> <span style="float:right"><a href="#parallel.general.features">[parallel.general.features]</a></span></header>
       
-  <ins>
-    <h1>Feature-testing recommendations</h1>     
-    <p>An implementation that provides support for this Technical Specification shall define the feature test macro(s) in Table 1.</p>
+         
+    <p para_num="1" id="parallel.general.features.1">An implementation that provides support for this Technical Specification shall define the feature test macro(s) in Table 1.</p>
 
     <table is="cxx-table" class="column-rules">
     
@@ -1390,10 +1438,16 @@ <h1>Feature-testing recommendations</h1>
             <code>&lt;experimental/numeric&gt;</code>
           </td>
         </tr>
+        <tr>
+          <td><code>__cpp_lib_experimental_parallel_task_block</code></td>
+          <td>201510</td>
+          <td>
+            <code>&lt;experimental/task_block&gt;</code><br>
+          </td>
+        </tr>
       </thead>
     
   </table>
-  </ins>
   
     </section>
   </cxx-section>
@@ -1472,14 +1526,14 @@ <h1>Feature-testing recommendations</h1>
     
 
     <section>
-      <header><span class="section-number">2.2</span> <h1 data-bookmark-label="2.2 Header <experimental/execution_policy> synopsis">Header <code>&lt;experimental/execution_policy&gt;</code> synopsis</h1> <span style="float:right"><a href="#parallel.execpol.synopsis">[parallel.execpol.synopsis]</a></span></header>
+      <header><span class="section-number">2.2</span> <h1 data-bookmark-label="2.2 Header &lt;experimental/execution_policy&gt; synopsis">Header <code>&lt;experimental/execution_policy&gt;</code> synopsis</h1> <span style="float:right"><a href="#parallel.execpol.synopsis">[parallel.execpol.synopsis]</a></span></header>
       
   
 
 <pre>namespace std {
 namespace experimental {
 namespace parallel {
-inline namespace v1 {
+inline namespace <ins>v2</ins><del>v1</del> {
   <cxx-ref insynopsis="" to="parallel.execpol.type">// <i><a title="parallel.execpol.type" href="#parallel.execpol.type">2.3</a>, Execution policy type trait</i></cxx-ref>
   template&lt;class T&gt; struct is_execution_policy;
   template&lt;class T&gt; constexpr bool is_execution_policy_v = is_execution_policy&lt;T&gt;::value;
@@ -1514,7 +1568,10 @@ <h1>Feature-testing recommendations</h1>
 <pre>template&lt;class T&gt; struct is_execution_policy { <em>see below</em> };
 </pre>
 
-    <p para_num="1" id="parallel.execpol.type.1"><code>is_execution_policy</code> can be used to detect parallel execution policies for the purpose of excluding function signatures from otherwise ambiguous overload resolution participation.</p>
+    <p para_num="1" id="parallel.execpol.type.1"><code>is_execution_policy</code>
+ can be used to detect parallel execution policies for the purpose of 
+excluding function signatures from otherwise ambiguous overload 
+resolution participation.</p>
     
     <p para_num="2" id="parallel.execpol.type.2"><code>is_execution_policy&lt;T&gt;</code> shall be a UnaryTypeTrait with a BaseCharacteristic of <code>true_type</code> if <code>T</code> is the type of a standard or implementation-defined execution policy, otherwise <code>false_type</code>.
 
@@ -1543,7 +1600,10 @@ <h1>Feature-testing recommendations</h1>
     <pre>class sequential_execution_policy{ <i>unspecified</i> };
 </pre>
 
-    <p para_num="1" id="parallel.execpol.seq.1">The class <code>sequential_execution_policy</code> is an execution policy type used as a unique type to disambiguate parallel algorithm overloading and require that a parallel algorithm's execution may not be parallelized.</p>
+    <p para_num="1" id="parallel.execpol.seq.1">The class <code>sequential_execution_policy</code>
+ is an execution policy type used as a unique type to disambiguate 
+parallel algorithm overloading and require that a parallel algorithm's 
+execution may not be parallelized.</p>
 
   
     </section>
@@ -1559,7 +1619,10 @@ <h1>Feature-testing recommendations</h1>
 <pre>class parallel_execution_policy{ <i>unspecified</i> };
 </pre>
 
-    <p para_num="1" id="parallel.execpol.par.1">The class <code>parallel_execution_policy</code> is an execution policy type used as a unique type to disambiguate parallel algorithm overloading and indicate that a parallel algorithm's execution may be parallelized.</p>
+    <p para_num="1" id="parallel.execpol.par.1">The class <code>parallel_execution_policy</code>
+ is an execution policy type used as a unique type to disambiguate 
+parallel algorithm overloading and indicate that a parallel algorithm's 
+execution may be parallelized.</p>
 
   
     </section>
@@ -1575,7 +1638,10 @@ <h1>Feature-testing recommendations</h1>
 <pre>class parallel_vector_execution_policy{ <i>unspecified</i> };
 </pre>
 
-    <p para_num="1" id="parallel.execpol.vec.1">The class <code>class parallel_vector_execution_policy</code> is an execution policy type used as a unique type to disambiguate parallel algorithm overloading and indicate that a parallel algorithm's execution may be vectorized and parallelized.</p>
+    <p para_num="1" id="parallel.execpol.vec.1">The class <code>class parallel_vector_execution_policy</code>
+ is an execution policy type used as a unique type to disambiguate 
+parallel algorithm overloading and indicate that a parallel algorithm's 
+execution may be vectorized and parallelized.</p>
 
   
     </section>
@@ -1781,7 +1847,7 @@ <h1>Feature-testing recommendations</h1>
         </p>
         <p para_num="2" id="parallel.exceptions.behavior.2">
           During the execution of a standard parallel algorithm, if the invocation of an element access function
-          <ins>exits via</ins><del>terminates with</del> an uncaught exception, the behavior of the program is determined by the type of
+          exits via an uncaught exception, the behavior of the program is determined by the type of
           execution policy used to invoke the algorithm:
 
           </p><ul>
@@ -1791,34 +1857,37 @@ <h1>Feature-testing recommendations</h1>
             </li>
             <li>
               If the execution policy object is of type <code>sequential_execution_policy</code> or
-              <code>parallel_execution_policy</code>, the execution of the algorithm <ins>exits via</ins><del>terminates with</del> an
-              <del><code>exception_list</code></del> exception. <ins>The exception shall be an <code>exception_list</code> containing all</ins><del>All</del> uncaught exceptions thrown during
-              the invocations of element access functions<ins>, or optionally the uncaught exception if there was only one</ins><del>shall be contained in the
-              <code>exception_list</code></del>.<pre></pre>
+              <code>parallel_execution_policy</code>, the execution of the algorithm exits via an
+               exception. The exception shall be an <code>exception_list</code> containing all uncaught exceptions thrown during
+              the invocations of element access functions, or optionally the uncaught exception if there was only one.<pre></pre>
 
               <cxx-note><span class="nowrap">[ <em>Note:</em></span>
     
-                For example, <del>the number of invocations of the user-provided function object in
-                <code>for_each</code> is unspecified. W</del><ins>w</ins>hen <code>for_each</code> is executed sequentially,
-                <ins>if an invocation of the user-provided function object throws an exception, <code>for_each</code> can exit via the uncaught exception, or throw an <code>exception_list</code> containing the original exception.
-                <del>only one exception will be contained in the <code>exception_list</code> object.</del>
-              </ins>
+                For example, when <code>for_each</code> is executed sequentially,
+                if an invocation of the user-provided function object throws an exception, <code>for_each</code> can exit via the uncaught exception, or throw an <code>exception_list</code> containing the original exception.
+              
     <span class="nowrap">— <em>end note</em> ]</span>
   </cxx-note><pre></pre>
 
               <cxx-note><span class="nowrap">[ <em>Note:</em></span>
     
                 These guarantees imply that, unless the algorithm has failed to allocate memory and
-                <ins>exits via</ins><del>terminated with</del> <code>std::bad_alloc</code>, all exceptions thrown during the execution of
-                the algorithm are communicated to the caller. It is unspecified whether an algorithm implementation will "forge ahead" after 
+                exits via <code>std::bad_alloc</code>, all exceptions 
+thrown during the execution of
+                the algorithm are communicated to the caller. It is 
+unspecified whether an algorithm implementation will "forge ahead" after
+ 
                 encountering and capturing a user exception.
               
     <span class="nowrap">— <em>end note</em> ]</span>
   </cxx-note><pre></pre>
               <cxx-note><span class="nowrap">[ <em>Note:</em></span>
     
-                The algorithm may <ins>exit via</ins><del>terminate with</del> the <code>std::bad_alloc</code> exception even if one or more
-                user-provided function objects have <ins>exited via</ins><del>terminated with</del> an exception. For example, this can happen when an algorithm fails to allocate memory while
+                The algorithm may exit via the <code>std::bad_alloc</code>
+ exception even if one or more
+                user-provided function objects have exited via an 
+exception. For example, this can happen when an algorithm fails to 
+allocate memory while
                 creating or adding elements to the <code>exception_list</code> object.
               
     <span class="nowrap">— <em>end note</em> ]</span>
@@ -1837,14 +1906,13 @@ <h1>Feature-testing recommendations</h1>
     
 
     <section>
-      <header><span class="section-number">3.2</span> <h1 data-bookmark-label="3.2 Header <experimental/exception_list> synopsis">Header <code>&lt;experimental/exception_list&gt;</code> synopsis</h1> <span style="float:right"><a href="#parallel.exceptions.synopsis">[parallel.exceptions.synopsis]</a></span></header>
+      <header><span class="section-number">3.2</span> <h1 data-bookmark-label="3.2 Header &lt;experimental/exception_list&gt; synopsis">Header <code>&lt;experimental/exception_list&gt;</code> synopsis</h1> <span style="float:right"><a href="#parallel.exceptions.synopsis">[parallel.exceptions.synopsis]</a></span></header>
       
       
-      <pre>
-namespace std {
+      <pre>namespace std {
 namespace experimental {
 namespace parallel {
-inline namespace v1 {
+inline namespace <ins>v2</ins><del>v1</del> {
 
   class exception_list : public exception
   {
@@ -2403,14 +2471,14 @@ <h1>Feature-testing recommendations</h1>
     
 
     <section>
-      <header><span class="section-number">4.3.1</span> <h1 data-bookmark-label="4.3.1 Header <experimental/algorithm> synopsis">Header <code>&lt;experimental/algorithm&gt;</code> synopsis</h1> <span style="float:right"><a href="#parallel.alg.ops.synopsis">[parallel.alg.ops.synopsis]</a></span></header>
+      <header><span class="section-number">4.3.1</span> <h1 data-bookmark-label="4.3.1 Header &lt;experimental/algorithm&gt; synopsis">Header <code>&lt;experimental/algorithm&gt;</code> synopsis</h1> <span style="float:right"><a href="#parallel.alg.ops.synopsis">[parallel.alg.ops.synopsis]</a></span></header>
       
       
 
       <pre>namespace std {
 namespace experimental {
 namespace parallel {
-inline namespace v1 {
+inline namespace <ins>v2</ins><del>v1</del> {
   template&lt;class ExecutionPolicy,
            class InputIterator, class Function&gt;
     void for_each(ExecutionPolicy&amp;&amp; exec,
@@ -2629,14 +2697,14 @@ <h1>Feature-testing recommendations</h1>
     
 
     <section>
-      <header><span class="section-number">4.4.1</span> <h1 data-bookmark-label="4.4.1 Header <experimental/numeric> synopsis">Header <code>&lt;experimental/numeric&gt;</code> synopsis</h1> <span style="float:right"><a href="#parallel.alg.numeric.synopsis">[parallel.alg.numeric.synopsis]</a></span></header>
+      <header><span class="section-number">4.4.1</span> <h1 data-bookmark-label="4.4.1 Header &lt;experimental/numeric&gt; synopsis">Header <code>&lt;experimental/numeric&gt;</code> synopsis</h1> <span style="float:right"><a href="#parallel.alg.numeric.synopsis">[parallel.alg.numeric.synopsis]</a></span></header>
       
       
 
       <pre>namespace std {
 namespace experimental {
 namespace parallel {
-inline namespace v1 {
+inline namespace <ins>v2</ins><del>v1</del> {
   template&lt;class InputIterator&gt;
     typename iterator_traits&lt;InputIterator&gt;::value_type
       reduce(InputIterator first, InputIterator last);
@@ -3275,6 +3343,469 @@ <h1>Feature-testing recommendations</h1>
     </section>
   </cxx-clause>
 
+<ins>
+<cxx-clause id="parallel.task_block">
+    
+
+    <section>
+      <header><span class="section-number">5</span> <h1 data-bookmark-label="5 Task Block">Task Block</h1> <span style="float:right"><a href="#parallel.task_block">[parallel.task_block]</a></span></header>
+      
+  
+
+   <cxx-section id="parallel.task_block.synopsis">
+    
+
+    <section>
+      <header><span class="section-number">5.1</span> <h1 data-bookmark-label="5.1 Header &lt;experimental/task_block&gt; synopsis">Header <code>&lt;experimental/task_block&gt;</code> synopsis</h1> <span style="float:right"><a href="#parallel.task_block.synopsis">[parallel.task_block.synopsis]</a></span></header>
+      
+     
+
+     <pre>namespace std {
+namespace experimental {
+namespace parallel {
+inline namespace v2 {
+  class task_cancelled_exception;
+
+  class task_block;
+
+  template&lt;class F&gt;
+    void define_task_block(F&amp;&amp; f);
+
+  template&lt;class f&gt;
+    void define_task_block_restore_thread(F&amp;&amp; f);
+}
+}
+}
+}
+     </pre>
+   
+    </section>
+  </cxx-section>
+
+   <cxx-section id="parallel.task_block.task_cancelled_exception">
+    
+
+    <section>
+      <header><span class="section-number">5.2</span> <h1 data-bookmark-label="5.2 Class task_cancelled_exception">Class <code>task_cancelled_exception</code></h1> <span style="float:right"><a href="#parallel.task_block.task_cancelled_exception">[parallel.task_block.task_cancelled_exception]</a></span></header>
+      
+     
+     <pre>namespace std {
+namespace experimental {
+namespace parallel
+inline namespace v2 {
+
+  class task_cancelled_exception : public exception
+  {
+    public:
+      task_cancelled_exception() noexcept;
+      virtual const char* what() const noexcept;
+  };
+}
+}
+}
+}
+     </pre>
+
+     <p para_num="1" id="parallel.task_block.task_cancelled_exception.1">
+       The class <code>task_cancelled_exception</code> defines the type of objects thrown by
+       <code>task_block::run</code> or <code>task_block::wait</code> if they detect than an
+       exception is pending within the current parallel block. See <cxx-ref to="parallel.task_block.exceptions"><a title="parallel.task_block.exceptions" href="#parallel.task_block.exceptions">5.5</a></cxx-ref>, below.
+     </p>
+
+     <cxx-section id="parallel.task_block.task_cancelled_exception.what">
+    
+
+    <section>
+      <header><span class="section-number">5.2.1</span> <h1 data-bookmark-label="5.2.1 task_cancelled_exception member function what"><code>task_cancelled_exception</code> member function <code>what</code></h1> <span style="float:right"><a href="#parallel.task_block.task_cancelled_exception.what">[parallel.task_block.task_cancelled_exception.what]</a></span></header>
+      
+       
+
+       <cxx-function para_num="1" id="parallel.task_block.task_cancelled_exception.what.1">
+    
+    <pre><code><cxx-signature>virtual const char* what() const noexcept</cxx-signature></code></pre>
+
+    <dl>
+      
+         
+
+         <cxx-returns para_num="2" id="parallel.task_block.task_cancelled_exception.what.2">
+    
+    <dt>Returns:</dt><dd>
+           An implementation-defined NTBS.
+         </dd>
+  </cxx-returns>
+       
+    </dl>
+  </cxx-function>
+     
+    </section>
+  </cxx-section>
+   
+    </section>
+  </cxx-section>
+
+   <cxx-section id="parallel.task_block.class">
+    
+
+    <section>
+      <header><span class="section-number">5.3</span> <h1 data-bookmark-label="5.3 Class task_block">Class <code>task_block</code></h1> <span style="float:right"><a href="#parallel.task_block.class">[parallel.task_block.class]</a></span></header>
+      
+     
+     <pre>namespace std {
+namespace experimental {
+namespace parallel {
+inline namespace v2 {
+
+  class task_block
+  {
+    private:
+      ~task_block();
+
+    public:
+      task_block(const task_block&amp;) = delete;
+      task_block&amp; operator=(const task_block&amp;) = delete;
+      void operator&amp;() const = delete;
+
+      template&lt;class F&gt;
+        void run(F&amp;&amp; f);
+
+      void wait();
+  };
+}
+}
+}
+}
+     </pre>
+
+     <p para_num="1" id="parallel.task_block.class.1">
+       The class <code>task_block</code> defines an interface for forking and joining parallel tasks. The <code>define_task_block</code> and <code>define_task_block_restore_thread</code> function templates create an object of type <code>task_block</code> and pass a reference to that object to a user-provided function object.
+     </p>
+
+     <p para_num="2" id="parallel.task_block.class.2">
+       An object of class <code>task_block</code> cannot be constructed,
+ destroyed, copied, or moved except by the implementation of the task 
+block library. Taking the address of a <code>task_block</code> object via <code>operator&amp;</code> is ill-formed. Obtaining its address by any other means (including <code>addressof</code>) results in a pointer with an unspecified value; dereferencing such a pointer results in undefined behavior.
+     </p>
+
+     <p para_num="3" id="parallel.task_block.class.3">
+       A <code>task_block</code> is <em>active</em> if it was created by the nearest enclosing task block, where “task block” refers to an
+       invocation of <code>define_task_block</code> or <code>define_task_block_restore_thread</code> and “nearest enclosing” means the most
+       recent invocation that has not yet completed. Code designated for execution in another thread by means other
+       than the facilities in this section (e.g., using <code>thread</code> or <code>async</code>) are not enclosed in the task block and a
+       <code>task_block</code> passed to (or captured by) such code is not active within that code. Performing any operation on a
+       <code>task_block</code> that is not active results in undefined behavior.
+     </p>
+
+     <p para_num="4" id="parallel.task_block.class.4">
+       When the argument to <code>task_block::run</code> is called, no <code>task_block</code> is active, not even the <code>task_block</code> on which <code>run</code> was called.
+       (The function object should not, therefore, capture a <code>task_block</code> from the surrounding block.)
+     </p>
+
+     <cxx-example>
+    
+    <span class="nowrap">[ <em>Example:</em></span>
+    
+     <pre>define_task_block([&amp;](auto&amp; tb) {
+  tb.run([&amp;]{
+    tb.run([] { f(); });               // Error: tb is not active within run
+    define_task_block([&amp;](auto&amp; tb2) { // Define new task block
+      tb2.run(f);
+      ...
+    });
+  });
+  ...
+});
+     </pre>
+     
+    <span class="nowrap">— <em>end example</em> ]</span>
+  </cxx-example><pre></pre>
+
+     <cxx-note><span class="nowrap">[ <em>Note:</em></span>
+    
+       Implementations are encouraged to diagnose the above error at translation time.
+     
+    <span class="nowrap">— <em>end note</em> ]</span>
+  </cxx-note>
+
+     <cxx-section id="parallel.task_block.class.run">
+    
+
+    <section>
+      <header><span class="section-number">5.3.1</span> <h1 data-bookmark-label="5.3.1 task_block member function template run"><code>task_block</code> member function template <code>run</code></h1> <span style="float:right"><a href="#parallel.task_block.class.run">[parallel.task_block.class.run]</a></span></header>
+      
+     
+       
+  
+       <cxx-function para_num="1" id="parallel.task_block.class.run.1">
+    
+    <pre><code><cxx-signature>template&lt;class F&gt; void run(F&amp;&amp; f);</cxx-signature></code></pre>
+
+    <dl>
+      
+         
+
+         <cxx-requires para_num="2" id="parallel.task_block.class.run.2">
+    
+    <dt>Requires:</dt><dd>
+           <code>F</code> shall be <code>MoveConstructible</code>. <code><em>DECAY_COPY</em>(std::forward&lt;F&gt;(f))()</code> shall be a valid expression.
+         </dd>
+  </cxx-requires>
+  
+         <cxx-preconditions para_num="3" id="parallel.task_block.class.run.3">
+    
+    <dt>Preconditions:</dt><dd>
+           <code>*this</code> shall be the active <code>task_block</code>.
+         </dd>
+  </cxx-preconditions>
+  
+         <cxx-effects para_num="4" id="parallel.task_block.class.run.4">
+    
+    <dt>Effects:</dt><dd>
+           Evaluates <code><em>DECAY_COPY</em>(std::forward&lt;F&gt;(f))()</code>, where <code><em>DECAY_COPY</em>(std::forward&lt;F&gt;(f))</code>
+           is evaluated synchronously within the current thread. The call to the resulting copy of the function object is
+           permitted to run on an unspecified thread created by the implementation in an unordered fashion relative to
+           the sequence of operations following the call to <code>run(f)</code> (the continuation), or indeterminately sequenced
+           within the same thread as the continuation. The call to <code>run</code> synchronizes with the call to the function
+           object. The completion of the call to the function object synchronizes with the next invocation of <code>wait</code> on
+           the same <code>task_block</code> or completion of the nearest enclosing task block (i.e., the <code>define_task_block</code> or
+           <code>define_task_block_restore_thread</code> that created this <code>task_block</code>).
+         </dd>
+  </cxx-effects>
+  
+         <cxx-throws para_num="5" id="parallel.task_block.class.run.5">
+    
+    <dt>Throws:</dt><dd>
+           <code>task_cancelled_exception</code>, as described in <cxx-ref to="parallel.task_block.exceptions"><a title="parallel.task_block.exceptions" href="#parallel.task_block.exceptions">5.5</a></cxx-ref>.
+         </dd>
+  </cxx-throws>
+  
+         <cxx-remarks para_num="6" id="parallel.task_block.class.run.6">
+    
+    <dt>Remarks:</dt><dd>
+           The <code>run</code> function may return on a thread other than the one on which it was called; in such cases,
+           completion of the call to <code>run</code> synchronizes with the continuation.
+           
+           <cxx-note><span class="nowrap">[ <em>Note:</em></span>
+     The return from <code>run</code> is ordered similarly to an ordinary function call in a single thread.
+    <span class="nowrap">— <em>end note</em> ]</span>
+  </cxx-note>
+         </dd>
+  </cxx-remarks>
+  
+         <cxx-remarks para_num="7" id="parallel.task_block.class.run.7">
+    
+    <dt>Remarks:</dt><dd>
+           The invocation of the user-supplied function object <code>f</code> may be immediate or may be delayed until
+           compute resources are available. <code>run</code> might or might not return before the invocation of <code>f</code> completes.
+         </dd>
+  </cxx-remarks>
+  
+       
+    </dl>
+  </cxx-function>
+     
+    </section>
+  </cxx-section>
+
+     <cxx-section id="parallel.task_block.class.wait">
+    
+
+    <section>
+      <header><span class="section-number">5.3.2</span> <h1 data-bookmark-label="5.3.2 task_block member function wait"><code>task_block</code> member function <code>wait</code></h1> <span style="float:right"><a href="#parallel.task_block.class.wait">[parallel.task_block.class.wait]</a></span></header>
+      
+
+       
+
+       <cxx-function para_num="1" id="parallel.task_block.class.wait.1">
+    
+    <pre><code><cxx-signature>void wait();</cxx-signature></code></pre>
+
+    <dl>
+      
+         
+
+         <cxx-preconditions para_num="2" id="parallel.task_block.class.wait.2">
+    
+    <dt>Preconditions:</dt><dd><code>*this</code> shall be the active <code>task_block</code>.</dd>
+  </cxx-preconditions>
+
+         <cxx-effects para_num="3" id="parallel.task_block.class.wait.3">
+    
+    <dt>Effects:</dt><dd>
+           Blocks until the tasks spawned using this <code>task_block</code> have completed.
+         </dd>
+  </cxx-effects>
+
+         <cxx-throws para_num="4" id="parallel.task_block.class.wait.4">
+    
+    <dt>Throws:</dt><dd>
+           <code>task_cancelled_exception</code>, as described in <cxx-ref to="parallel.task_block.exceptions"><a title="parallel.task_block.exceptions" href="#parallel.task_block.exceptions">5.5</a></cxx-ref>.
+         </dd>
+  </cxx-throws>
+
+         <cxx-postconditions para_num="5" id="parallel.task_block.class.wait.5">
+    
+    <dt>Postconditions:</dt><dd>
+           All tasks spawned by the nearest enclosing task block have completed.
+         </dd>
+  </cxx-postconditions>
+
+         <cxx-remarks para_num="6" id="parallel.task_block.class.wait.6">
+    
+    <dt>Remarks:</dt><dd>
+           The <code>wait</code> function may return on a thread other than the one on which it was called; in such cases, completion of the call to <code>wait</code> synchronizes with subsequent operations.
+           
+           <cxx-note><span class="nowrap">[ <em>Note:</em></span>
+    The return from wait is ordered similarly to an ordinary function call in a single thread.
+    <span class="nowrap">— <em>end note</em> ]</span>
+  </cxx-note>
+
+           <cxx-example>
+    
+    <span class="nowrap">[ <em>Example:</em></span>
+    <pre>define_task_block([&amp;](auto&amp; tb) {
+  tb.run([&amp;]{ process(a, w, x); }); // Process a[w] through a[x]
+  if (y &lt; x) tb.wait();             // Wait if overlap between [w,x) and [y,z)
+  process(a, y, z);                 // Process a[y] through a[z]
+});
+</pre>
+           
+    <span class="nowrap">— <em>end example</em> ]</span>
+  </cxx-example>
+         </dd>
+  </cxx-remarks>
+       
+    </dl>
+  </cxx-function>
+     
+    </section>
+  </cxx-section>
+   
+    </section>
+  </cxx-section>
+
+   <cxx-section id="parallel.task_block.define_task_block">
+    
+
+    <section>
+      <header><span class="section-number">5.4</span> <h1 data-bookmark-label="5.4 Function template define_task_block">Function template <code>define_task_block</code></h1> <span style="float:right"><a href="#parallel.task_block.define_task_block">[parallel.task_block.define_task_block]</a></span></header>
+      
+     
+
+     <cxx-function para_num="1" id="parallel.task_block.define_task_block.1">
+    
+    <pre><code><cxx-signature>template&lt;class F&gt;
+void define_task_block(F&amp;&amp; f);
+       </cxx-signature><cxx-signature>template&lt;class F&gt;
+void define_task_block_restore_thread(F&amp;&amp; f);
+       </cxx-signature></code></pre>
+
+    <dl>
+      
+       
+
+       
+
+       <cxx-requires para_num="2" id="parallel.task_block.define_task_block.2">
+    
+    <dt>Requires:</dt><dd>
+         Given an lvalue <code>tb</code> of type <code>task_block</code>, the expression <code>f(tb)</code> shall be well-formed
+       </dd>
+  </cxx-requires>
+
+       <cxx-effects para_num="3" id="parallel.task_block.define_task_block.3">
+    
+    <dt>Effects:</dt><dd>
+         Constructs a <code>task_block</code> <code>tb</code> and calls <code>f(tb)</code>.
+       </dd>
+  </cxx-effects>
+
+       <cxx-throws para_num="4" id="parallel.task_block.define_task_block.4">
+    
+    <dt>Throws:</dt><dd>
+         <code>exception_list</code>, as specified in <cxx-ref to="parallel.task_block.exceptions"><a title="parallel.task_block.exceptions" href="#parallel.task_block.exceptions">5.5</a></cxx-ref>.
+       </dd>
+  </cxx-throws>
+
+       <cxx-postconditions para_num="5" id="parallel.task_block.define_task_block.5">
+    
+    <dt>Postconditions:</dt><dd>
+         All tasks spawned from <code>f</code> have finished execution.
+       </dd>
+  </cxx-postconditions>
+
+       <cxx-remarks para_num="6" id="parallel.task_block.define_task_block.6">
+    
+    <dt>Remarks:</dt><dd>
+         The <code>define_task_block</code> function may return on a thread other than the one on which it was called
+         unless there are no task blocks active on entry to <code>define_task_block</code> (see <cxx-ref to="parallel.task_block.class"><a title="parallel.task_block.class" href="#parallel.task_block.class">5.3</a></cxx-ref>), in which
+         case the function returns on the original thread. When <code>define_task_block</code> returns on a different thread,
+         it synchronizes with operations following the call. <cxx-note><span class="nowrap">[ <em>Note:</em></span>
+     The return from define_task_block is ordered
+         similarly to an ordinary function call in a single thread.
+    <span class="nowrap">— <em>end note</em> ]</span>
+  </cxx-note> The <code>define_task_block_restore_thread</code>
+         function always returns on the same thread as the one on which it was called.
+       </dd>
+  </cxx-remarks>
+
+       <cxx-notes para_num="7" id="parallel.task_block.define_task_block.7">
+    
+    <dt>Notes:</dt><dd>
+         It is expected (but not mandated) that <code>f</code> will (directly or indirectly) call <code>tb.run(<em>function-object</em>)</code>.
+       </dd>
+  </cxx-notes>
+     
+    </dl>
+  </cxx-function>
+   
+    </section>
+  </cxx-section>
+
+   <cxx-section id="parallel.task_block.exceptions">
+    
+
+    <section>
+      <header><span class="section-number">5.5</span> <h1 data-bookmark-label="5.5 Exception Handling">Exception Handling</h1> <span style="float:right"><a href="#parallel.task_block.exceptions">[parallel.task_block.exceptions]</a></span></header>
+      
+     
+
+     <p para_num="1" id="parallel.task_block.exceptions.1">
+       Every <code>task_block</code> has an associated exception list. When the task block starts, its associated exception list is empty.
+     </p>
+
+     <p para_num="2" id="parallel.task_block.exceptions.2">
+       When an exception is thrown from the user-provided function object passed to <code>define_task_block</code> or
+       <code>define_task_block_restore_thread</code>, it is added to the exception list for that task block. Similarly, when
+       an exception is thrown from the user-provided function object passed into <code>task_block::run</code>, the exception
+       object is added to the exception list associated with the nearest enclosing task block. In both cases, an
+       implementation may discard any pending tasks that have not yet been invoked. Tasks that are already in
+       progress are not interrupted except at a call to <code>task_block::run</code> or <code>task_block::wait</code> as described below.
+     </p>
+
+     <p para_num="3" id="parallel.task_block.exceptions.3">
+       If the implementation is able to detect that an exception has been thrown by another task within
+       the same nearest enclosing task block, then <code>task_block::run</code> or <code>task_block::wait</code> may throw
+       <code>task_canceled_exception</code>; these instances of <code>task_canceled_exception</code> are not added to the exception
+       list of the corresponding task block.
+     </p>
+
+     <p para_num="4" id="parallel.task_block.exceptions.4">
+       When a task block finishes with a non-empty exception list, the exceptions are aggregated into an <code>exception_list</code> object, which is then thrown from the task block.
+     </p>
+
+     <p para_num="5" id="parallel.task_block.exceptions.5">
+       The order of the exceptions in the <code>exception_list</code> object is unspecified.
+     </p>
+   
+    </section>
+  </cxx-section>
+
+    </section>
+  </cxx-clause>
+</ins>
+
+
 
 
 
diff --git a/parallelism-ts.pdf b/parallelism-ts.pdf
deleted file mode 100644
index afabf17..0000000
Binary files a/parallelism-ts.pdf and /dev/null differ
diff --git a/scope.html b/scope.html
new file mode 100644
index 0000000..04db655
--- /dev/null
+++ b/scope.html
@@ -0,0 +1,20 @@
+<cxx-clause id="parallel.scope">
+  <h1>Scope</h1>
+    <p>This Technical Specification describes requirements for implementations of an
+    interface that computer programs written in the C++ programming language may
+    use to invoke algorithms with parallel execution. The algorithms described by
+    this Technical Specification are realizable across a broad class of
+    computer architectures.</p>
+    
+    <p>This Technical Specification is non-normative. Some of the functionality
+    described by this Technical Specification may be considered for standardization
+    in a future version of C++, but it is not currently part of any C++ standard.
+    Some of the functionality in this Technical Specification may never be
+    standardized, and other functionality may be standardized in a substantially
+    changed form.</p>
+    
+    <p>The goal of this Technical Specification is to build widespread existing
+    practice for parallelism in the C++ standard algorithms library. It gives
+    advice on extensions to those vendors who wish to provide them.</p>
+</cxx-clause>
+
diff --git a/task_block.html b/task_block.html
new file mode 100644
index 0000000..e006c52
--- /dev/null
+++ b/task_block.html
@@ -0,0 +1,298 @@
+<cxx-clause id="parallel.task_block">
+  <h1>Task Block</h1>
+
+   <cxx-section id="parallel.task_block.synopsis">
+     <h1>Header <code>&lt;experimental/task_block&gt;</code> synopsis</h1>
+
+     <pre>
+namespace std {
+namespace experimental {
+namespace parallel {
+inline namespace v2 {
+  class task_cancelled_exception;
+
+  class task_block;
+
+  template&lt;class F&gt;
+    void define_task_block(F&& f);
+
+  template&lt;class f&gt;
+    void define_task_block_restore_thread(F&& f);
+}
+}
+}
+}
+     </pre>
+   </cxx-section>
+
+   <cxx-section id="parallel.task_block.task_cancelled_exception">
+     <h1>Class <code>task_cancelled_exception</code></h1>
+     <pre>
+
+namespace std {
+namespace experimental {
+namespace parallel
+inline namespace v2 {
+
+  class task_cancelled_exception : public exception
+  {
+    public:
+      task_cancelled_exception() noexcept;
+      virtual const char* what() const noexcept;
+  };
+}
+}
+}
+}
+     </pre>
+
+     <p>
+       The class <code>task_cancelled_exception</code> defines the type of objects thrown by
+       <code>task_block::run</code> or <code>task_block::wait</code> if they detect than an
+       exception is pending within the current parallel block. See <cxx-ref to="parallel.task_block.exceptions"></cxx-ref>, below.
+     </p>
+
+     <cxx-section id="parallel.task_block.task_cancelled_exception.what">
+       <h1><code>task_cancelled_exception</code> member function <code>what</code></h1>
+
+       <cxx-function>
+         <cxx-signature>virtual const char* what() const noexcept</cxx-signature>
+
+         <cxx-returns>
+           An implementation-defined NTBS.
+         </cxx-returns>
+       </cxx-function>
+     </cxx-section>
+   </cxx-section>
+
+   <cxx-section id="parallel.task_block.class">
+     <h1>Class <code>task_block</code></h1>
+     <pre>
+
+namespace std {
+namespace experimental {
+namespace parallel {
+inline namespace v2 {
+
+  class task_block
+  {
+    private:
+      ~task_block();
+
+    public:
+      task_block(const task_block&) = delete;
+      task_block& operator=(const task_block&) = delete;
+      void operator&() const = delete;
+
+      template&lt;class F&gt;
+        void run(F&& f);
+
+      void wait();
+  };
+}
+}
+}
+}
+     </pre>
+
+     <p>
+       The class <code>task_block</code> defines an interface for forking and joining parallel tasks. The <code>define_task_block</code> and <code>define_task_block_restore_thread</code> function templates create an object of type <code>task_block</code> and pass a reference to that object to a user-provided function object.
+     </p>
+
+     <p>
+       An object of class <code>task_block</code> cannot be constructed, destroyed, copied, or moved except by the implementation of the task block library. Taking the address of a <code>task_block</code> object via <code>operator&</code> is ill-formed. Obtaining its address by any other means (including <code>addressof</code>) results in a pointer with an unspecified value; dereferencing such a pointer results in undefined behavior.
+     </p>
+
+     <p>
+       A <code>task_block</code> is <em>active</em> if it was created by the nearest enclosing task block, where “task block” refers to an
+       invocation of <code>define_task_block</code> or <code>define_task_block_restore_thread</code> and “nearest enclosing” means the most
+       recent invocation that has not yet completed. Code designated for execution in another thread by means other
+       than the facilities in this section (e.g., using <code>thread</code> or <code>async</code>) are not enclosed in the task block and a
+       <code>task_block</code> passed to (or captured by) such code is not active within that code. Performing any operation on a
+       <code>task_block</code> that is not active results in undefined behavior.
+     </p>
+
+     <p>
+       When the argument to <code>task_block::run</code> is called, no <code>task_block</code> is active, not even the <code>task_block</code> on which <code>run</code> was called.
+       (The function object should not, therefore, capture a <code>task_block</code> from the surrounding block.)
+     </p>
+
+     <cxx-example>
+     <pre>define_task_block([&](auto& tb) {
+  tb.run([&]{
+    tb.run([] { f(); });               // Error: tb is not active within run
+    define_task_block([&](auto& tb2) { // Define new task block
+      tb2.run(f);
+      ...
+    });
+  });
+  ...
+});
+     </pre>
+     </cxx-example><pre>
+</pre>
+
+     <cxx-note>
+       Implementations are encouraged to diagnose the above error at translation time.
+     </cxx-note>
+
+     <cxx-section id="parallel.task_block.class.run">
+     
+       <h1><code>task_block</code> member function template <code>run</code></h1>
+  
+       <cxx-function>
+         <cxx-signature>template&lt;class F&gt; void run(F&& f);</cxx-signature>
+
+         <cxx-requires>
+           <code>F</code> shall be <code>MoveConstructible</code>. <code><em>DECAY_COPY</em>(std::forward&lt;F&gt;(f))()</code> shall be a valid expression.
+         </cxx-requires>
+  
+         <cxx-preconditions>
+           <code>*this</code> shall be the active <code>task_block</code>.
+         </cxx-preconditions>
+  
+         <cxx-effects>
+           Evaluates <code><em>DECAY_COPY</em>(std::forward&lt;F&gt;(f))()</code>, where <code><em>DECAY_COPY</em>(std::forward&lt;F&gt;(f))</code>
+           is evaluated synchronously within the current thread. The call to the resulting copy of the function object is
+           permitted to run on an unspecified thread created by the implementation in an unordered fashion relative to
+           the sequence of operations following the call to <code>run(f)</code> (the continuation), or indeterminately sequenced
+           within the same thread as the continuation. The call to <code>run</code> synchronizes with the call to the function
+           object. The completion of the call to the function object synchronizes with the next invocation of <code>wait</code> on
+           the same <code>task_block</code> or completion of the nearest enclosing task block (i.e., the <code>define_task_block</code> or
+           <code>define_task_block_restore_thread</code> that created this <code>task_block</code>).
+         </cxx-effects>
+  
+         <cxx-throws>
+           <code>task_cancelled_exception</code>, as described in <cxx-ref to="parallel.task_block.exceptions"></cxx-ref>.
+         </cxx-throws>
+  
+         <cxx-remarks>
+           The <code>run</code> function may return on a thread other than the one on which it was called; in such cases,
+           completion of the call to <code>run</code> synchronizes with the continuation.
+           
+           <cxx-note> The return from <code>run</code> is ordered similarly to an ordinary function call in a single thread.</cxx-note>
+         </cxx-remarks>
+  
+         <cxx-remarks>
+           The invocation of the user-supplied function object <code>f</code> may be immediate or may be delayed until
+           compute resources are available. <code>run</code> might or might not return before the invocation of <code>f</code> completes.
+         </cxx-remarks>
+  
+       </cxx-function>
+     </cxx-section>
+
+     <cxx-section id="parallel.task_block.class.wait">
+
+       <h1><code>task_block</code> member function <code>wait</code></h1>
+
+       <cxx-function>
+         <cxx-signature>void wait();</cxx-signature>
+
+         <cxx-preconditions><code>*this</code> shall be the active <code>task_block</code>.</cxx-preconditions>
+
+         <cxx-effects>
+           Blocks until the tasks spawned using this <code>task_block</code> have completed.
+         </cxx-effects>
+
+         <cxx-throws>
+           <code>task_cancelled_exception</code>, as described in <cxx-ref to="parallel.task_block.exceptions"></cxx-ref>.
+         </cxx-throws>
+
+         <cxx-postconditions>
+           All tasks spawned by the nearest enclosing task block have completed.
+         </cxx-postconditions>
+
+         <cxx-remarks>
+           The <code>wait</code> function may return on a thread other than the one on which it was called; in such cases, completion of the call to <code>wait</code> synchronizes with subsequent operations.
+           
+           <cxx-note>The return from wait is ordered similarly to an ordinary function call in a single thread.</cxx-note>
+
+           <cxx-example><pre>
+define_task_block([&](auto& tb) {
+  tb.run([&]{ process(a, w, x); }); // Process a[w] through a[x]
+  if (y &lt; x) tb.wait();             // Wait if overlap between [w,x) and [y,z)
+  process(a, y, z);                 // Process a[y] through a[z]
+});
+</pre>
+           </cxx-example>
+         </cxx-remarks>
+       </cxx-function>
+     </cxx-section>
+   </cxx-section>
+
+   <cxx-section id="parallel.task_block.define_task_block">
+     <h1>Function template <code>define_task_block</code></h1>
+
+     <cxx-function>
+       <cxx-signature>template&lt;class F&gt;
+void define_task_block(F&& f);
+       </cxx-signature>
+
+       <cxx-signature>template&lt;class F&gt;
+void define_task_block_restore_thread(F&& f);
+       </cxx-signature>
+
+       <cxx-requires>
+         Given an lvalue <code>tb</code> of type <code>task_block</code>, the expression <code>f(tb)</code> shall be well-formed
+       </cxx-requires>
+
+       <cxx-effects>
+         Constructs a <code>task_block</code> <code>tb</code> and calls <code>f(tb)</code>.
+       </cxx-effects>
+
+       <cxx-throws>
+         <code>exception_list</code>, as specified in <cxx-ref to="parallel.task_block.exceptions"></cxx-ref>.
+       </cxx-throws>
+
+       <cxx-postconditions>
+         All tasks spawned from <code>f</code> have finished execution.
+       </cxx-postconditions>
+
+       <cxx-remarks>
+         The <code>define_task_block</code> function may return on a thread other than the one on which it was called
+         unless there are no task blocks active on entry to <code>define_task_block</code> (see <cxx-ref to="parallel.task_block.class"></cxx-ref>), in which
+         case the function returns on the original thread. When <code>define_task_block</code> returns on a different thread,
+         it synchronizes with operations following the call. <cxx-note> The return from define_task_block is ordered
+         similarly to an ordinary function call in a single thread.</cxx-note> The <code>define_task_block_restore_thread</code>
+         function always returns on the same thread as the one on which it was called.
+       </cxx-remarks>
+
+       <cxx-notes>
+         It is expected (but not mandated) that <code>f</code> will (directly or indirectly) call <code>tb.run(<em>function-object</em>)</code>.
+       </cxx-notes>
+     </cxx-function>
+   </cxx-section>
+
+   <cxx-section id="parallel.task_block.exceptions">
+     <h1>Exception Handling</h1>
+
+     <p>
+       Every <code>task_block</code> has an associated exception list. When the task block starts, its associated exception list is empty.
+     </p>
+
+     <p>
+       When an exception is thrown from the user-provided function object passed to <code>define_task_block</code> or
+       <code>define_task_block_restore_thread</code>, it is added to the exception list for that task block. Similarly, when
+       an exception is thrown from the user-provided function object passed into <code>task_block::run</code>, the exception
+       object is added to the exception list associated with the nearest enclosing task block. In both cases, an
+       implementation may discard any pending tasks that have not yet been invoked. Tasks that are already in
+       progress are not interrupted except at a call to <code>task_block::run</code> or <code>task_block::wait</code> as described below.
+     </p>
+
+     <p>
+       If the implementation is able to detect that an exception has been thrown by another task within
+       the same nearest enclosing task block, then <code>task_block::run</code> or <code>task_block::wait</code> may throw
+       <code>task_canceled_exception</code>; these instances of <code>task_canceled_exception</code> are not added to the exception
+       list of the corresponding task block.
+     </p>
+
+     <p>
+       When a task block finishes with a non-empty exception list, the exceptions are aggregated into an <code>exception_list</code> object, which is then thrown from the task block.
+     </p>
+
+     <p>
+       The order of the exceptions in the <code>exception_list</code> object is unspecified.
+     </p>
+   </cxx-section>
+</cxx-clause>
+
diff --git a/terms_and_definitions.html b/terms_and_definitions.html
new file mode 100644
index 0000000..22aa5e4
--- /dev/null
+++ b/terms_and_definitions.html
@@ -0,0 +1,54 @@
+<cxx-clause id="parallel.defns">
+  <h1>Terms and definitions</h1>
+
+  <p>For the purposes of this document, the terms and definitions given in the C++ Standard and the following apply.</p>
+
+  <p>A <dfn>parallel algorithm</dfn> is a function template described by this Technical Specification declared in namespace <code>std::experimental::parallel::v2</code> with a formal template parameter named <code>ExecutionPolicy</code>.</p>
+
+  <p>
+    Parallel algorithms access objects indirectly accessible via their arguments by invoking the following functions:
+
+    <ul>
+      <li>
+        All operations of the categories of the iterators that the algorithm is instantiated with.
+      </li>
+
+      <li>
+        Functions on those sequence elements that are required by its specification.
+      </li>
+
+      <li>
+        User-provided function objects to be applied during the execution of the algorithm, if required by the specification.
+      </li>
+
+      <li>
+        Operations on those function objects required by the specification.
+
+        <cxx-note>
+          See clause 25.1 of <em>C++ Standard Algorithms Library</em>.
+        </cxx-note>
+      </li>
+    </ul>
+
+    These functions are herein called <em>element access functions</em>.
+
+    <cxx-example>
+      The <code>sort</code> function may invoke the following element access functions:
+
+      <ul>
+        <li>
+          Methods of the random-access iterator of the actual template argument, as per 24.2.7, as implied by the name of the
+          template parameters <code>RandomAccessIterator</code>.
+        </li>
+
+        <li>
+          The <code>swap</code> function on the elements of the sequence (as per 25.4.1.1 [sort]/2).
+        </li>
+
+        <li>
+          The user-provided <code>Compare</code> function object.
+        </li>
+      </ul>
+    </cxx-example>
+</cxx-clause>
+

Doc. No.	Title	Primary Section	Macro Name	Value	Header
N4505	Working Draft, Technical Specification for C++ Extensions for Parallelism		`__cpp_lib_experimental_parallel_algorithm`	201505	+ `<experimental/algorithm>` + `<experimental/exception_list>` + `<experimental/execution_policy>` + `<experimental/numeric>` +
P0155R0	Task Block R5		`__cpp_lib_experimental_parallel_task_block`	201510	+ `<experimental/task_block>` +
P0076R4	Vector and Wavefront Policies	,	`__cpp_lib_experimental_execution_vector_policy`	201707	+ `<experimental/algorithm>` + `<experimental/execution>` +