add reason for not dumping to string

and "workaround" for those that think they really need this
author: Anthon van der Neut <anthon@mnt.org> 2017-07-27 22:59:24 +0200
committer: Anthon van der Neut <anthon@mnt.org> 2017-07-27 22:59:24 +0200
commit: 2565253f6b61d567280027719181c39250f8a0bb (patch)
tree: e2b434075784eeedfc6552d23bf6de4ca2bbca83 /_doc
parent: f19c96b16b2cc0aeb2fae84a3ddcf0f1bcb46883 (diff)
download: ruamel.yaml-2565253f6b61d567280027719181c39250f8a0bb.tar.gz
1 files changed, 56 insertions, 3 deletions
diff --git a/_doc/example.ryd b/_doc/example.ryd
index 11fa358..a792f48 100644
--- a/_doc/example.ryd
+++ b/_doc/example.ryd
@@ -122,8 +122,8 @@ posted by *demux* on StackOverflow.
 ----
 
 By default ``ruamel.yaml`` indents with two positions in block style, for
-both mappings and sequences. For sequences the indent is counted to the 
-beginning of the scalar, with the dash taking the first position of the 
+both mappings and sequences. For sequences the indent is counted to the
+beginning of the scalar, with the dash taking the first position of the
 indented "space".
 
 The following program with three dumps::
@@ -170,6 +170,59 @@ The following program with three dumps::
 gives as output::
 
 --- |
-The transform example was inspired by a `question posted by *nowox* 
+The transform example was inspired by a `question posted by *nowox*
 <https://stackoverflow.com/q/44388701/1307905>`_ on
 StackOverflow.
+
+-----
+
+Output of ``dump()`` as a string
+++++++++++++++++++++++++++++++++
+
+The single most abused "feature" of the old API is not providing the (second)
+stream parameter to one of the ``dump()`` variants, in order to get a monolithic string
+representation of the stream back.
+
+Apart from being memory inefficient and slow, quite often people using this did not
+realiase that ``print(round_trip_dump(dict(a=1, b=2)))`` gets you an extra,
+empty, line after ``b: 2``.
+
+The real quesiton is why is this functionality, which is seldom really
+necessary, is available in the old API (and in PyYAML) in the first place. One
+explanation you get by looking at what someone would need to do to make this
+available if it weren't there already. Apart from subclassing the ``Serializer``
+and providing a new ``dump`` method,which would ten or so lines, another
+**hundred** lines, essentially the whole ``dumper.py`` file, would need to be
+copied and to make use of this serializer.
+
+The fact is that one should normally be doing ``round_trip_dump(dict(a=1, b=2)),
+sys.stdout)`` and do away with 90% of the cases for returning the string, and
+that all post-processing YAML, before writing to stream, can be handled by using
+the ``transform=`` parameter of dump, being able to handle most of the rest. But
+it is also much easier in the new API to provide that YAML output as a string if
+you really need to have it (or think you do)::
+
+--- !python |
+import sys
+from ruamel.yaml import YAML from ruamel.yaml.compat import StringIO
+
+class MyYAML(YAML):
+    def dump(self, data, stream=None, **kw):
+        inefficient = False
+        if stream is None:
+            inefficient = True
+            stream = StringIO()
+        YAML.dump(self, data, stream, **kw)
+        if inefficient:
+            return stream.getvalue()
+
+yaml = MyYAML()   # or typ='safe'/'unsafe' etc
+--- |
+with about one tenth of the lines needed for the old interface, you can once more do::
+--- !code |
+print(yaml.dump(dict(a=1, b=2)))
+--- |
+instead of::
+--- !code |
+yaml.dump((dict(a=1, b=2)), sys.stdout)
+print()  # or sys.stdout.write('\n')
author	Anthon van der Neut <anthon@mnt.org>	2017-07-27 22:59:24 +0200
committer	Anthon van der Neut <anthon@mnt.org>	2017-07-27 22:59:24 +0200
commit	2565253f6b61d567280027719181c39250f8a0bb (patch)
tree	e2b434075784eeedfc6552d23bf6de4ca2bbca83 /_doc
parent	f19c96b16b2cc0aeb2fae84a3ddcf0f1bcb46883 (diff)
download	ruamel.yaml-2565253f6b61d567280027719181c39250f8a0bb.tar.gz