From 8db35f3eddb46f5162dcef4ed47658fab576fe8c Mon Sep 17 00:00:00 2001
From: Robert Collins <robertc@robertcollins.net>
Date: Mon, 13 Jul 2015 12:09:31 +1200
Subject: Rename README to README.rst for github rendering

---
 Makefile.am |   2 +-
 README      | 470 ------------------------------------------------------------
 README.rst  | 470 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
 setup.py    |   2 +-
 4 files changed, 472 insertions(+), 472 deletions(-)
 delete mode 100644 README
 create mode 100644 README.rst

diff --git a/Makefile.am b/Makefile.am
index 765ae40..82cd61e 100644
--- a/Makefile.am
+++ b/Makefile.am
@@ -5,7 +5,7 @@ EXTRA_DIST =  \
 	INSTALL \
 	Makefile.am \
 	NEWS \
-	README \
+	README.rst \
 	all_tests.py \
 	c++/README \
 	c/README \
diff --git a/README b/README
deleted file mode 100644
index 76a7418..0000000
--- a/README
+++ /dev/null
@@ -1,470 +0,0 @@
-
-  subunit: A streaming protocol for test results
-  Copyright (C) 2005-2013 Robert Collins <robertc@robertcollins.net>
-
-  Licensed under either the Apache License, Version 2.0 or the BSD 3-clause
-  license at the users choice. A copy of both licenses are available in the
-  project source as Apache-2.0 and BSD. You may not use this file except in
-  compliance with one of these two licences.
-
-  Unless required by applicable law or agreed to in writing, software
-  distributed under these licenses is distributed on an "AS IS" BASIS, WITHOUT
-  WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.  See the
-  license you chose for the specific language governing permissions and
-  limitations under that license.
-
-  See the COPYING file for full details on the licensing of Subunit.
-
-  subunit reuses iso8601 by Michael Twomey, distributed under an MIT style
-  licence - see python/iso8601/LICENSE for details.
-
-Subunit
--------
-
-Subunit is a streaming protocol for test results.
-
-There are two major revisions of the protocol. Version 1 was trivially human
-readable but had significant defects as far as highly parallel testing was
-concerned - it had no room for doing discovery and execution in parallel,
-required substantial buffering when multiplexing and was fragile - a corrupt
-byte could cause an entire stream to be misparsed. Version 1.1 added
-encapsulation of binary streams which mitigated some of the issues but the
-core remained.
-
-Version 2 shares many of the good characteristics of Version 1 - it can be
-embedded into a regular text stream (e.g. from a build system) and it still
-models xUnit style test execution. It also fixes many of the issues with
-Version 1 - Version 2 can be multiplexed without excessive buffering (in
-time or space), it has a well defined recovery mechanism for dealing with
-corrupted streams (e.g. where two processes write to the same stream
-concurrently, or where the stream generator suffers a bug).
-
-More details on both protocol version s can be found in the 'Protocol' section
-of this document.
-
-Subunit comes with command line filters to process a subunit stream and
-language bindings for python, C, C++ and shell. Bindings are easy to write
-for other languages.
-
-A number of useful things can be done easily with subunit:
- * Test aggregation: Tests run separately can be combined and then
-   reported/displayed together. For instance, tests from different languages
-   can be shown as a seamless whole, and tests running on multiple machines
-   can be aggregated into a single stream through a multiplexer.
- * Test archiving: A test run may be recorded and replayed later.
- * Test isolation: Tests that may crash or otherwise interact badly with each
-   other can be run seperately and then aggregated, rather than interfering
-   with each other or requiring an adhoc test->runner reporting protocol.
- * Grid testing: subunit can act as the necessary serialisation and
-   deserialiation to get test runs on distributed machines to be reported in
-   real time.
-
-Subunit supplies the following filters:
- * tap2subunit - convert perl's TestAnythingProtocol to subunit.
- * subunit2csv - convert a subunit stream to csv.
- * subunit2pyunit - convert a subunit stream to pyunit test results.
- * subunit2gtk - show a subunit stream in GTK.
- * subunit2junitxml - convert a subunit stream to JUnit's XML format.
- * subunit-diff - compare two subunit streams.
- * subunit-filter - filter out tests from a subunit stream.
- * subunit-ls - list info about tests present in a subunit stream.
- * subunit-stats - generate a summary of a subunit stream.
- * subunit-tags - add or remove tags from a stream.
-
-Integration with other tools
-----------------------------
-
-Subunit's language bindings act as integration with various test runners like
-'check', 'cppunit', Python's 'unittest'. Beyond that a small amount of glue
-(typically a few lines) will allow Subunit to be used in more sophisticated
-ways.
-
-Python
-======
-
-Subunit has excellent Python support: most of the filters and tools are written
-in python and there are facilities for using Subunit to increase test isolation
-seamlessly within a test suite.
-
-The most common way is to run an existing python test suite and have it output
-subunit via the ``subunit.run`` module::
-
-  $ python -m subunit.run mypackage.tests.test_suite
-
-For more information on the Python support Subunit offers , please see
-``pydoc subunit``, or the source in ``python/subunit/``
-
-C
-=
-
-Subunit has C bindings to emit the protocol. The 'check' C unit testing project
-has included subunit support in their project for some years now. See
-'c/README' for more details.
-
-C++
-===
-
-The C library is includable and usable directly from C++. A TestListener for
-CPPUnit is included in the Subunit distribution. See 'c++/README' for details.
-
-shell
-=====
-
-There are two sets of shell tools. There are filters, which accept a subunit
-stream on stdin and output processed data (or a transformed stream) on stdout.
-
-Then there are unittest facilities similar to those for C : shell bindings
-consisting of simple functions to output protocol elements, and a patch for
-adding subunit output to the 'ShUnit' shell test runner. See 'shell/README' for
-details.
-
-Filter recipes
---------------
-
-To ignore some failing tests whose root cause is already known::
-
-  subunit-filter --without 'AttributeError.*flavor'
-
-
-The xUnit test model
---------------------
-
-Subunit implements a slightly modified xUnit test model. The stock standard
-model is that there are tests, which have an id(), can be run, and when run
-start, emit an outcome (like success or failure) and then finish.
-
-Subunit extends this with the idea of test enumeration (find out about tests
-a runner has without running them), tags (allow users to describe tests in
-ways the test framework doesn't apply any semantic value to), file attachments
-(allow arbitrary data to make analysing a failure easy) and timestamps.
-
-The protocol
-------------
-
-Version 2, or v2 is new and still under development, but is intended to
-supercede version 1 in the very near future. Subunit's bundled tools accept
-only version 2 and only emit version 2, but the new filters subunit-1to2 and
-subunit-2to1 can be used to interoperate with older third party libraries.
-
-Version 2
-=========
-
-Version 2 is a binary protocol consisting of independent packets that can be
-embedded in the output from tools like make - as long as each packet has no
-other bytes mixed in with it (which 'make -j N>1' has a tendency of doing).
-Version 2 is currently in draft form, and early adopters should be willing
-to either discard stored results (if protocol changes are made), or bulk
-convert them back to v1 and then to a newer edition of v2.
-
-The protocol synchronises at the start of the stream, after a packet, or
-after any 0x0A byte. That is, a subunit v2 packet starts after a newline or
-directly after the end of the prior packet.
-
-Subunit is intended to be transported over a reliable streaming protocol such
-as TCP. As such it does not concern itself with out of order delivery of
-packets. However, because of the possibility of corruption due to either
-bugs in the sender, or due to mixed up data from concurrent writes to the same
-fd when being embedded, subunit strives to recover reasonably gracefully from
-damaged data.
-
-A key design goal for Subunit version 2 is to allow processing and multiplexing
-without forcing buffering for semantic correctness, as buffering tends to hide
-hung or otherwise misbehaving tests. That said, limited time based buffering
-for network efficiency is a good idea - this is ultimately implementator
-choice. Line buffering is also discouraged for subunit streams, as dropping
-into a debugger or other tool may require interactive traffic even if line
-buffering would not otherwise be a problem.
-
-In version two there are two conceptual events - a test status event and a file
-attachment event. Events may have timestamps, and the path of multiplexers that
-an event is routed through is recorded to permit sending actions back to the
-source (such as new tests to run or stdin for driving debuggers and other
-interactive input). Test status events are used to enumerate tests, to report
-tests and test helpers as they run. Tests may have tags, used to allow
-tunnelling extra meanings through subunit without requiring parsing of
-arbitrary file attachments. Things that are not standalone tests get marked
-as such by setting the 'Runnable' flag to false. (For instance, individual
-assertions in TAP are not runnable tests, only the top level TAP test script
-is runnable).
-
-File attachments are used to provide rich detail about the nature of a failure.
-File attachments can also be used to encapsulate stdout and stderr both during
-and outside tests.
-
-Most numbers are stored in network byte order - Most Significant Byte first
-encoded using a variation of http://www.dlugosz.com/ZIP2/VLI.html. The first
-byte's top 2 high order bits encode the total number of octets in the number.
-This encoding can encode values from 0 to 2**30-1, enough to encode a
-nanosecond. Numbers that are not variable length encoded are still stored in
-MSB order.
-
- prefix   octets   max       max
-+-------+--------+---------+------------+
-| 00    |      1 |  2**6-1 |         63 |
-| 01    |      2 | 2**14-1 |      16383 |
-| 10    |      3 | 2**22-1 |    4194303 |
-| 11    |      4 | 2**30-1 | 1073741823 |
-+-------+--------+---------+------------+
-
-All variable length elements of the packet are stored with a length prefix
-number allowing them to be skipped over for consumers that don't need to
-interpret them.
-
-UTF-8 strings are with no terminating NUL and should not have any embedded NULs
-(implementations SHOULD validate any such strings that they process and take
-some remedial action (such as discarding the packet as corrupt).
-
-In short the structure of a packet is:
-PACKET := SIGNATURE FLAGS PACKET_LENGTH TIMESTAMP? TESTID? TAGS? MIME?
-          FILECONTENT? ROUTING_CODE? CRC32
-
-In more detail...
-
-Packets are identified by a single byte signature - 0xB3, which is never legal
-in a UTF-8 stream as the first byte of a character. 0xB3 starts with the first
-bit set and the second not, which is the UTF-8 signature for a continuation
-byte. 0xB3 was chosen as 0x73 ('s' in ASCII') with the top two bits replaced by
-the 1 and 0 for a continuation byte.
-
-If subunit packets are being embedded in a non-UTF-8 text stream, where 0x73 is
-a legal character, consider either recoding the text to UTF-8, or using
-subunit's 'file' packets to embed the text stream in subunit, rather than the
-other way around.
-
-Following the signature byte comes a 16-bit flags field, which includes a
-4-bit version field - if the version is not 0x2 then the packet cannot be
-read. It is recommended to signal an error at this point (e.g. by emitting
-a synthetic error packet and returning to the top level loop to look for
-new packets, or exiting with an error). If recovery is desired, treat the
-packet signature as an opaque byte and scan for a new synchronisation point.
-NB: Subunit V1 and V2 packets may legitimately included 0xB3 internally,
-as they are an 8-bit safe container format, so recovery from this situation
-may involve an arbitrary number of false positives until an actual packet
-is encountered : and even then it may still be false, failing after passing
-the version check due to coincidence.
-
-Flags are stored in network byte order too.
-+-------------------------+------------------------+
-| High byte               | Low byte               |
-| 15 14 13 12 11 10  9  8 | 7  6  5  4  3  2  1  0 |
-| VERSION    |feature bits|                        |
-+------------+------------+------------------------+
-
-Valid version values are:
-0x2 - version 2
-
-Feature bits:
-Bit 11 - mask 0x0800 - Test id present.
-Bit 10 - mask 0x0400 - Routing code present.
-Bit  9 - mask 0x0200 - Timestamp present.
-Bit  8 - mask 0x0100 - Test is 'runnable'.
-Bit  7 - mask 0x0080 - Tags are present.
-Bit  6 - mask 0x0040 - File content is present.
-Bit  5 - mask 0x0020 - File MIME type is present.
-Bit  4 - mask 0x0010 - EOF marker.
-Bit  3 - mask 0x0008 - Must be zero in version 2.
-
-Test status gets three bits:
-Bit 2 | Bit 1 | Bit 0 - mask 0x0007 - A test status enum lookup:
-000 - undefined / no test
-001 - Enumeration / existence
-002 - In progress
-003 - Success
-004 - Unexpected Success
-005 - Skipped
-006 - Failed
-007 - Expected failure
-
-After the flags field is a number field giving the length in bytes for the
-entire packet including the signature and the checksum. This length must
-be less than 4MiB - 4194303 bytes. The encoding can obviously record a larger
-number but one of the goals is to avoid requiring large buffers, or causing
-large latency in the packet forward/processing pipeline. Larger file
-attachments can be communicated in multiple packets, and the overhead in such a
-4MiB packet is approximately 0.2%.
-
-The rest of the packet is a series of optional features as specified by the set
-feature bits in the flags field. When absent they are entirely absent.
-
-Forwarding and multiplexing of packets can be done without interpreting the
-remainder of the packet until the routing code and checksum (which are both at
-the end of the packet). Additionally, routers can often avoid copying or moving
-the bulk of the packet, as long as the routing code size increase doesn't force
-the length encoding to take up a new byte (which will only happen to packets
-less than or equal to 16KiB in length) - large packets are very efficient to
-route.
-
-Timestamp when present is a 32 bit unsigned integer for secnods, and a variable
-length number for nanoseconds, representing UTC time since Unix Epoch in
-seconds and nanoseconds.
-
-Test id when present is a UTF-8 string. The test id should uniquely identify
-runnable tests such that they can be selected individually. For tests and other
-actions which cannot be individually run (such as test
-fixtures/layers/subtests) uniqueness is not required (though being human
-meaningful is highly recommended).
-
-Tags when present is a length prefixed vector of UTF-8 strings, one per tag.
-There are no restrictions on tag content (other than the restrictions on UTF-8
-strings in subunit in general). Tags have no ordering.
-
-When a MIME type is present, it defines the MIME type for the file across all
-packets same file (routing code + testid + name uniquely identifies a file,
-reset when EOF is flagged). If a file never has a MIME type set, it should be
-treated as application/octet-stream.
-
-File content when present is a UTF-8 string for the name followed by the length
-in bytes of the content, and then the content octets.
-
-If present routing code is a UTF-8 string. The routing code is used to
-determine which test backend a test was running on when doing data analysis,
-and to route stdin to the test process if interaction is required.
-
-Multiplexers SHOULD add a routing code if none is present, and prefix any
-existing routing code with a routing code ('/' separated) if one is already
-present. For example, a multiplexer might label each stream it is multiplexing
-with a simple ordinal ('0', '1' etc), and given an incoming packet with route
-code '3' from stream '0' would adjust the route code when forwarding the packet
-to be '0/3'.
-
-Following the end of the packet is a CRC-32 checksum of the contents of the
-packet including the signature.
-
-Example packets
-~~~~~~~~~~~~~~~
-
-Trivial test "foo" enumeration packet, with test id, runnable set,
-status=enumeration. Spaces below are to visually break up signature / flags /
-length / testid / crc32
-
-b3 2901 0c 03666f6f 08555f1b
-
-
-Version 1 (and 1.1)
-===================
-
-Version 1 (and 1.1) are mostly human readable protocols.
-
-Sample subunit wire contents
-----------------------------
-
-The following::
-  test: test foo works
-  success: test foo works.
-  test: tar a file.
-  failure: tar a file. [
-  ..
-   ]..  space is eaten.
-  foo.c:34 WARNING foo is not defined.
-  ]
-  a writeln to stdout
-
-When run through subunit2pyunit::
-  .F
-  a writeln to stdout
-
-  ========================
-  FAILURE: tar a file.
-  -------------------
-  ..
-  ]..  space is eaten.
-  foo.c:34 WARNING foo is not defined.
-
-
-Subunit protocol description
-============================
-
-This description is being ported to an EBNF style. Currently its only partly in
-that style, but should be fairly clear all the same. When in doubt, refer the
-source (and ideally help fix up the description!). Generally the protocol is
-line orientated and consists of either directives and their parameters, or
-when outside a DETAILS region unexpected lines which are not interpreted by
-the parser - they should be forwarded unaltered.
-
-test|testing|test:|testing: test LABEL
-success|success:|successful|successful: test LABEL
-success|success:|successful|successful: test LABEL DETAILS
-failure: test LABEL
-failure: test LABEL DETAILS
-error: test LABEL
-error: test LABEL DETAILS
-skip[:] test LABEL
-skip[:] test LABEL DETAILS
-xfail[:] test LABEL
-xfail[:] test LABEL DETAILS
-uxsuccess[:] test LABEL
-uxsuccess[:] test LABEL DETAILS
-progress: [+|-]X
-progress: push
-progress: pop
-tags: [-]TAG ...
-time: YYYY-MM-DD HH:MM:SSZ
-
-LABEL: UTF8*
-NAME: UTF8*
-DETAILS ::= BRACKETED | MULTIPART
-BRACKETED ::= '[' CR UTF8-lines ']' CR
-MULTIPART ::= '[ multipart' CR PART* ']' CR
-PART ::= PART_TYPE CR NAME CR PART_BYTES CR
-PART_TYPE ::= Content-Type: type/sub-type(;parameter=value,parameter=value)
-PART_BYTES ::= (DIGITS CR LF BYTE{DIGITS})* '0' CR LF
-
-unexpected output on stdout -> stdout.
-exit w/0 or last test completing -> error
-
-Tags given outside a test are applied to all following tests
-Tags given after a test: line and before the result line for the same test
-apply only to that test, and inherit the current global tags.
-A '-' before a tag is used to remove tags - e.g. to prevent a global tag
-applying to a single test, or to cancel a global tag.
-
-The progress directive is used to provide progress information about a stream
-so that stream consumer can provide completion estimates, progress bars and so
-on. Stream generators that know how many tests will be present in the stream
-should output "progress: COUNT". Stream filters that add tests should output
-"progress: +COUNT", and those that remove tests should output
-"progress: -COUNT". An absolute count should reset the progress indicators in
-use - it indicates that two separate streams from different generators have
-been trivially concatenated together, and there is no knowledge of how many
-more complete streams are incoming. Smart concatenation could scan each stream
-for their count and sum them, or alternatively translate absolute counts into
-relative counts inline. It is recommended that outputters avoid absolute counts
-unless necessary. The push and pop directives are used to provide local regions
-for progress reporting. This fits with hierarchically operating test
-environments - such as those that organise tests into suites - the top-most
-runner can report on the number of suites, and each suite surround its output
-with a (push, pop) pair. Interpreters should interpret a pop as also advancing
-the progress of the restored level by one step. Encountering progress
-directives between the start and end of a test pair indicates that a previous
-test was interrupted and did not cleanly terminate: it should be implicitly
-closed with an error (the same as when a stream ends with no closing test
-directive for the most recently started test).
-
-The time directive acts as a clock event - it sets the time for all future
-events. The value should be a valid ISO8601 time.
-
-The skip, xfail and uxsuccess outcomes are not supported by all testing
-environments. In Python the testttools (https://launchpad.net/testtools)
-library is used to translate these automatically if an older Python version
-that does not support them is in use. See the testtools documentation for the
-translation policy.
-
-skip is used to indicate a test was discovered but not executed. xfail is used
-to indicate a test that errored in some expected fashion (also know as "TODO"
-tests in some frameworks). uxsuccess is used to indicate and unexpected success
-where a test though to be failing actually passes. It is complementary to
-xfail.
-
-Hacking on subunit
-------------------
-
-Releases
-========
-
-* Update versions in configure.ac and python/subunit/__init__.py.
-* Update NEWS.
-* Do a make distcheck, which will update Makefile etc.
-* Do a PyPI release: PYTHONPATH=../../python python ../../setup.py sdist bdist_wheel upload -s
-* Upload the regular one to LP.
-* Push a tagged commit.
-
diff --git a/README.rst b/README.rst
new file mode 100644
index 0000000..76a7418
--- /dev/null
+++ b/README.rst
@@ -0,0 +1,470 @@
+
+  subunit: A streaming protocol for test results
+  Copyright (C) 2005-2013 Robert Collins <robertc@robertcollins.net>
+
+  Licensed under either the Apache License, Version 2.0 or the BSD 3-clause
+  license at the users choice. A copy of both licenses are available in the
+  project source as Apache-2.0 and BSD. You may not use this file except in
+  compliance with one of these two licences.
+
+  Unless required by applicable law or agreed to in writing, software
+  distributed under these licenses is distributed on an "AS IS" BASIS, WITHOUT
+  WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.  See the
+  license you chose for the specific language governing permissions and
+  limitations under that license.
+
+  See the COPYING file for full details on the licensing of Subunit.
+
+  subunit reuses iso8601 by Michael Twomey, distributed under an MIT style
+  licence - see python/iso8601/LICENSE for details.
+
+Subunit
+-------
+
+Subunit is a streaming protocol for test results.
+
+There are two major revisions of the protocol. Version 1 was trivially human
+readable but had significant defects as far as highly parallel testing was
+concerned - it had no room for doing discovery and execution in parallel,
+required substantial buffering when multiplexing and was fragile - a corrupt
+byte could cause an entire stream to be misparsed. Version 1.1 added
+encapsulation of binary streams which mitigated some of the issues but the
+core remained.
+
+Version 2 shares many of the good characteristics of Version 1 - it can be
+embedded into a regular text stream (e.g. from a build system) and it still
+models xUnit style test execution. It also fixes many of the issues with
+Version 1 - Version 2 can be multiplexed without excessive buffering (in
+time or space), it has a well defined recovery mechanism for dealing with
+corrupted streams (e.g. where two processes write to the same stream
+concurrently, or where the stream generator suffers a bug).
+
+More details on both protocol version s can be found in the 'Protocol' section
+of this document.
+
+Subunit comes with command line filters to process a subunit stream and
+language bindings for python, C, C++ and shell. Bindings are easy to write
+for other languages.
+
+A number of useful things can be done easily with subunit:
+ * Test aggregation: Tests run separately can be combined and then
+   reported/displayed together. For instance, tests from different languages
+   can be shown as a seamless whole, and tests running on multiple machines
+   can be aggregated into a single stream through a multiplexer.
+ * Test archiving: A test run may be recorded and replayed later.
+ * Test isolation: Tests that may crash or otherwise interact badly with each
+   other can be run seperately and then aggregated, rather than interfering
+   with each other or requiring an adhoc test->runner reporting protocol.
+ * Grid testing: subunit can act as the necessary serialisation and
+   deserialiation to get test runs on distributed machines to be reported in
+   real time.
+
+Subunit supplies the following filters:
+ * tap2subunit - convert perl's TestAnythingProtocol to subunit.
+ * subunit2csv - convert a subunit stream to csv.
+ * subunit2pyunit - convert a subunit stream to pyunit test results.
+ * subunit2gtk - show a subunit stream in GTK.
+ * subunit2junitxml - convert a subunit stream to JUnit's XML format.
+ * subunit-diff - compare two subunit streams.
+ * subunit-filter - filter out tests from a subunit stream.
+ * subunit-ls - list info about tests present in a subunit stream.
+ * subunit-stats - generate a summary of a subunit stream.
+ * subunit-tags - add or remove tags from a stream.
+
+Integration with other tools
+----------------------------
+
+Subunit's language bindings act as integration with various test runners like
+'check', 'cppunit', Python's 'unittest'. Beyond that a small amount of glue
+(typically a few lines) will allow Subunit to be used in more sophisticated
+ways.
+
+Python
+======
+
+Subunit has excellent Python support: most of the filters and tools are written
+in python and there are facilities for using Subunit to increase test isolation
+seamlessly within a test suite.
+
+The most common way is to run an existing python test suite and have it output
+subunit via the ``subunit.run`` module::
+
+  $ python -m subunit.run mypackage.tests.test_suite
+
+For more information on the Python support Subunit offers , please see
+``pydoc subunit``, or the source in ``python/subunit/``
+
+C
+=
+
+Subunit has C bindings to emit the protocol. The 'check' C unit testing project
+has included subunit support in their project for some years now. See
+'c/README' for more details.
+
+C++
+===
+
+The C library is includable and usable directly from C++. A TestListener for
+CPPUnit is included in the Subunit distribution. See 'c++/README' for details.
+
+shell
+=====
+
+There are two sets of shell tools. There are filters, which accept a subunit
+stream on stdin and output processed data (or a transformed stream) on stdout.
+
+Then there are unittest facilities similar to those for C : shell bindings
+consisting of simple functions to output protocol elements, and a patch for
+adding subunit output to the 'ShUnit' shell test runner. See 'shell/README' for
+details.
+
+Filter recipes
+--------------
+
+To ignore some failing tests whose root cause is already known::
+
+  subunit-filter --without 'AttributeError.*flavor'
+
+
+The xUnit test model
+--------------------
+
+Subunit implements a slightly modified xUnit test model. The stock standard
+model is that there are tests, which have an id(), can be run, and when run
+start, emit an outcome (like success or failure) and then finish.
+
+Subunit extends this with the idea of test enumeration (find out about tests
+a runner has without running them), tags (allow users to describe tests in
+ways the test framework doesn't apply any semantic value to), file attachments
+(allow arbitrary data to make analysing a failure easy) and timestamps.
+
+The protocol
+------------
+
+Version 2, or v2 is new and still under development, but is intended to
+supercede version 1 in the very near future. Subunit's bundled tools accept
+only version 2 and only emit version 2, but the new filters subunit-1to2 and
+subunit-2to1 can be used to interoperate with older third party libraries.
+
+Version 2
+=========
+
+Version 2 is a binary protocol consisting of independent packets that can be
+embedded in the output from tools like make - as long as each packet has no
+other bytes mixed in with it (which 'make -j N>1' has a tendency of doing).
+Version 2 is currently in draft form, and early adopters should be willing
+to either discard stored results (if protocol changes are made), or bulk
+convert them back to v1 and then to a newer edition of v2.
+
+The protocol synchronises at the start of the stream, after a packet, or
+after any 0x0A byte. That is, a subunit v2 packet starts after a newline or
+directly after the end of the prior packet.
+
+Subunit is intended to be transported over a reliable streaming protocol such
+as TCP. As such it does not concern itself with out of order delivery of
+packets. However, because of the possibility of corruption due to either
+bugs in the sender, or due to mixed up data from concurrent writes to the same
+fd when being embedded, subunit strives to recover reasonably gracefully from
+damaged data.
+
+A key design goal for Subunit version 2 is to allow processing and multiplexing
+without forcing buffering for semantic correctness, as buffering tends to hide
+hung or otherwise misbehaving tests. That said, limited time based buffering
+for network efficiency is a good idea - this is ultimately implementator
+choice. Line buffering is also discouraged for subunit streams, as dropping
+into a debugger or other tool may require interactive traffic even if line
+buffering would not otherwise be a problem.
+
+In version two there are two conceptual events - a test status event and a file
+attachment event. Events may have timestamps, and the path of multiplexers that
+an event is routed through is recorded to permit sending actions back to the
+source (such as new tests to run or stdin for driving debuggers and other
+interactive input). Test status events are used to enumerate tests, to report
+tests and test helpers as they run. Tests may have tags, used to allow
+tunnelling extra meanings through subunit without requiring parsing of
+arbitrary file attachments. Things that are not standalone tests get marked
+as such by setting the 'Runnable' flag to false. (For instance, individual
+assertions in TAP are not runnable tests, only the top level TAP test script
+is runnable).
+
+File attachments are used to provide rich detail about the nature of a failure.
+File attachments can also be used to encapsulate stdout and stderr both during
+and outside tests.
+
+Most numbers are stored in network byte order - Most Significant Byte first
+encoded using a variation of http://www.dlugosz.com/ZIP2/VLI.html. The first
+byte's top 2 high order bits encode the total number of octets in the number.
+This encoding can encode values from 0 to 2**30-1, enough to encode a
+nanosecond. Numbers that are not variable length encoded are still stored in
+MSB order.
+
+ prefix   octets   max       max
++-------+--------+---------+------------+
+| 00    |      1 |  2**6-1 |         63 |
+| 01    |      2 | 2**14-1 |      16383 |
+| 10    |      3 | 2**22-1 |    4194303 |
+| 11    |      4 | 2**30-1 | 1073741823 |
++-------+--------+---------+------------+
+
+All variable length elements of the packet are stored with a length prefix
+number allowing them to be skipped over for consumers that don't need to
+interpret them.
+
+UTF-8 strings are with no terminating NUL and should not have any embedded NULs
+(implementations SHOULD validate any such strings that they process and take
+some remedial action (such as discarding the packet as corrupt).
+
+In short the structure of a packet is:
+PACKET := SIGNATURE FLAGS PACKET_LENGTH TIMESTAMP? TESTID? TAGS? MIME?
+          FILECONTENT? ROUTING_CODE? CRC32
+
+In more detail...
+
+Packets are identified by a single byte signature - 0xB3, which is never legal
+in a UTF-8 stream as the first byte of a character. 0xB3 starts with the first
+bit set and the second not, which is the UTF-8 signature for a continuation
+byte. 0xB3 was chosen as 0x73 ('s' in ASCII') with the top two bits replaced by
+the 1 and 0 for a continuation byte.
+
+If subunit packets are being embedded in a non-UTF-8 text stream, where 0x73 is
+a legal character, consider either recoding the text to UTF-8, or using
+subunit's 'file' packets to embed the text stream in subunit, rather than the
+other way around.
+
+Following the signature byte comes a 16-bit flags field, which includes a
+4-bit version field - if the version is not 0x2 then the packet cannot be
+read. It is recommended to signal an error at this point (e.g. by emitting
+a synthetic error packet and returning to the top level loop to look for
+new packets, or exiting with an error). If recovery is desired, treat the
+packet signature as an opaque byte and scan for a new synchronisation point.
+NB: Subunit V1 and V2 packets may legitimately included 0xB3 internally,
+as they are an 8-bit safe container format, so recovery from this situation
+may involve an arbitrary number of false positives until an actual packet
+is encountered : and even then it may still be false, failing after passing
+the version check due to coincidence.
+
+Flags are stored in network byte order too.
++-------------------------+------------------------+
+| High byte               | Low byte               |
+| 15 14 13 12 11 10  9  8 | 7  6  5  4  3  2  1  0 |
+| VERSION    |feature bits|                        |
++------------+------------+------------------------+
+
+Valid version values are:
+0x2 - version 2
+
+Feature bits:
+Bit 11 - mask 0x0800 - Test id present.
+Bit 10 - mask 0x0400 - Routing code present.
+Bit  9 - mask 0x0200 - Timestamp present.
+Bit  8 - mask 0x0100 - Test is 'runnable'.
+Bit  7 - mask 0x0080 - Tags are present.
+Bit  6 - mask 0x0040 - File content is present.
+Bit  5 - mask 0x0020 - File MIME type is present.
+Bit  4 - mask 0x0010 - EOF marker.
+Bit  3 - mask 0x0008 - Must be zero in version 2.
+
+Test status gets three bits:
+Bit 2 | Bit 1 | Bit 0 - mask 0x0007 - A test status enum lookup:
+000 - undefined / no test
+001 - Enumeration / existence
+002 - In progress
+003 - Success
+004 - Unexpected Success
+005 - Skipped
+006 - Failed
+007 - Expected failure
+
+After the flags field is a number field giving the length in bytes for the
+entire packet including the signature and the checksum. This length must
+be less than 4MiB - 4194303 bytes. The encoding can obviously record a larger
+number but one of the goals is to avoid requiring large buffers, or causing
+large latency in the packet forward/processing pipeline. Larger file
+attachments can be communicated in multiple packets, and the overhead in such a
+4MiB packet is approximately 0.2%.
+
+The rest of the packet is a series of optional features as specified by the set
+feature bits in the flags field. When absent they are entirely absent.
+
+Forwarding and multiplexing of packets can be done without interpreting the
+remainder of the packet until the routing code and checksum (which are both at
+the end of the packet). Additionally, routers can often avoid copying or moving
+the bulk of the packet, as long as the routing code size increase doesn't force
+the length encoding to take up a new byte (which will only happen to packets
+less than or equal to 16KiB in length) - large packets are very efficient to
+route.
+
+Timestamp when present is a 32 bit unsigned integer for secnods, and a variable
+length number for nanoseconds, representing UTC time since Unix Epoch in
+seconds and nanoseconds.
+
+Test id when present is a UTF-8 string. The test id should uniquely identify
+runnable tests such that they can be selected individually. For tests and other
+actions which cannot be individually run (such as test
+fixtures/layers/subtests) uniqueness is not required (though being human
+meaningful is highly recommended).
+
+Tags when present is a length prefixed vector of UTF-8 strings, one per tag.
+There are no restrictions on tag content (other than the restrictions on UTF-8
+strings in subunit in general). Tags have no ordering.
+
+When a MIME type is present, it defines the MIME type for the file across all
+packets same file (routing code + testid + name uniquely identifies a file,
+reset when EOF is flagged). If a file never has a MIME type set, it should be
+treated as application/octet-stream.
+
+File content when present is a UTF-8 string for the name followed by the length
+in bytes of the content, and then the content octets.
+
+If present routing code is a UTF-8 string. The routing code is used to
+determine which test backend a test was running on when doing data analysis,
+and to route stdin to the test process if interaction is required.
+
+Multiplexers SHOULD add a routing code if none is present, and prefix any
+existing routing code with a routing code ('/' separated) if one is already
+present. For example, a multiplexer might label each stream it is multiplexing
+with a simple ordinal ('0', '1' etc), and given an incoming packet with route
+code '3' from stream '0' would adjust the route code when forwarding the packet
+to be '0/3'.
+
+Following the end of the packet is a CRC-32 checksum of the contents of the
+packet including the signature.
+
+Example packets
+~~~~~~~~~~~~~~~
+
+Trivial test "foo" enumeration packet, with test id, runnable set,
+status=enumeration. Spaces below are to visually break up signature / flags /
+length / testid / crc32
+
+b3 2901 0c 03666f6f 08555f1b
+
+
+Version 1 (and 1.1)
+===================
+
+Version 1 (and 1.1) are mostly human readable protocols.
+
+Sample subunit wire contents
+----------------------------
+
+The following::
+  test: test foo works
+  success: test foo works.
+  test: tar a file.
+  failure: tar a file. [
+  ..
+   ]..  space is eaten.
+  foo.c:34 WARNING foo is not defined.
+  ]
+  a writeln to stdout
+
+When run through subunit2pyunit::
+  .F
+  a writeln to stdout
+
+  ========================
+  FAILURE: tar a file.
+  -------------------
+  ..
+  ]..  space is eaten.
+  foo.c:34 WARNING foo is not defined.
+
+
+Subunit protocol description
+============================
+
+This description is being ported to an EBNF style. Currently its only partly in
+that style, but should be fairly clear all the same. When in doubt, refer the
+source (and ideally help fix up the description!). Generally the protocol is
+line orientated and consists of either directives and their parameters, or
+when outside a DETAILS region unexpected lines which are not interpreted by
+the parser - they should be forwarded unaltered.
+
+test|testing|test:|testing: test LABEL
+success|success:|successful|successful: test LABEL
+success|success:|successful|successful: test LABEL DETAILS
+failure: test LABEL
+failure: test LABEL DETAILS
+error: test LABEL
+error: test LABEL DETAILS
+skip[:] test LABEL
+skip[:] test LABEL DETAILS
+xfail[:] test LABEL
+xfail[:] test LABEL DETAILS
+uxsuccess[:] test LABEL
+uxsuccess[:] test LABEL DETAILS
+progress: [+|-]X
+progress: push
+progress: pop
+tags: [-]TAG ...
+time: YYYY-MM-DD HH:MM:SSZ
+
+LABEL: UTF8*
+NAME: UTF8*
+DETAILS ::= BRACKETED | MULTIPART
+BRACKETED ::= '[' CR UTF8-lines ']' CR
+MULTIPART ::= '[ multipart' CR PART* ']' CR
+PART ::= PART_TYPE CR NAME CR PART_BYTES CR
+PART_TYPE ::= Content-Type: type/sub-type(;parameter=value,parameter=value)
+PART_BYTES ::= (DIGITS CR LF BYTE{DIGITS})* '0' CR LF
+
+unexpected output on stdout -> stdout.
+exit w/0 or last test completing -> error
+
+Tags given outside a test are applied to all following tests
+Tags given after a test: line and before the result line for the same test
+apply only to that test, and inherit the current global tags.
+A '-' before a tag is used to remove tags - e.g. to prevent a global tag
+applying to a single test, or to cancel a global tag.
+
+The progress directive is used to provide progress information about a stream
+so that stream consumer can provide completion estimates, progress bars and so
+on. Stream generators that know how many tests will be present in the stream
+should output "progress: COUNT". Stream filters that add tests should output
+"progress: +COUNT", and those that remove tests should output
+"progress: -COUNT". An absolute count should reset the progress indicators in
+use - it indicates that two separate streams from different generators have
+been trivially concatenated together, and there is no knowledge of how many
+more complete streams are incoming. Smart concatenation could scan each stream
+for their count and sum them, or alternatively translate absolute counts into
+relative counts inline. It is recommended that outputters avoid absolute counts
+unless necessary. The push and pop directives are used to provide local regions
+for progress reporting. This fits with hierarchically operating test
+environments - such as those that organise tests into suites - the top-most
+runner can report on the number of suites, and each suite surround its output
+with a (push, pop) pair. Interpreters should interpret a pop as also advancing
+the progress of the restored level by one step. Encountering progress
+directives between the start and end of a test pair indicates that a previous
+test was interrupted and did not cleanly terminate: it should be implicitly
+closed with an error (the same as when a stream ends with no closing test
+directive for the most recently started test).
+
+The time directive acts as a clock event - it sets the time for all future
+events. The value should be a valid ISO8601 time.
+
+The skip, xfail and uxsuccess outcomes are not supported by all testing
+environments. In Python the testttools (https://launchpad.net/testtools)
+library is used to translate these automatically if an older Python version
+that does not support them is in use. See the testtools documentation for the
+translation policy.
+
+skip is used to indicate a test was discovered but not executed. xfail is used
+to indicate a test that errored in some expected fashion (also know as "TODO"
+tests in some frameworks). uxsuccess is used to indicate and unexpected success
+where a test though to be failing actually passes. It is complementary to
+xfail.
+
+Hacking on subunit
+------------------
+
+Releases
+========
+
+* Update versions in configure.ac and python/subunit/__init__.py.
+* Update NEWS.
+* Do a make distcheck, which will update Makefile etc.
+* Do a PyPI release: PYTHONPATH=../../python python ../../setup.py sdist bdist_wheel upload -s
+* Upload the regular one to LP.
+* Push a tagged commit.
+
diff --git a/setup.py b/setup.py
index d42d3d7..c6a3808 100755
--- a/setup.py
+++ b/setup.py
@@ -43,7 +43,7 @@ setup(
     name='python-subunit',
     version=VERSION,
     description=('Python implementation of subunit test streaming protocol'),
-    long_description=open('README').read(),
+    long_description=open('README.rst').read(),
     classifiers=[
         'Intended Audience :: Developers',
         'Programming Language :: Python :: 3',
-- 
cgit v1.2.1