1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
|
# CONTRIBUTING
Thank you for your interest in working on pyparsing! Pyparsing has become a popular module for creating simple
text parsing and data scraping applications. It has been incorporated in several widely-used packages, and is
often used by beginners as part of their first Python project.
## Raising questions / asking for help
If you have a question on using pyparsing, there are a number of resources available online.
- [StackOverflow](https://stackoverflow.com/questions/tagged/pyparsing) - about 10 years of SO questions and answers
can be searched on StackOverflow, tagged with the `pyparsing` tag. Note that some of the older posts will refer
to features in Python 2, or to versions and coding practices for pyparsing that have been replaced by newer classes
and coding idioms.
- [pyparsing sub-reddit](https://www.reddit.com/r/pyparsing/) - still very lightly attended, but open to anyone
wishing to post questions or links related to pyparsing. An alternative channel to StackOverflow for asking
questions.
- [online docs](https://pyparsing-docs.readthedocs.io/en/latest/index.html) and a separately maintained set of class
library docs [here](https://pyparsing-doc.neocities.org/) - These docs are auto-generated from the docstrings
embedded in the pyparsing classes, so they can also be viewed in the interactive Python console's and Jupyter
Notebook's `help` commands.
- [the pyparsing Wikispaces archive](https://github.com/pyparsing/wikispaces_archive) - Before hosting on GitHub,
pyparsing had a separate wiki on the wikispaces.com website. In 2018 this page was discontinued. The discussion
content archive has been reformatted into Markdown and can be viewed by year at the GitHub repository. Just as
with some of the older questions on StackOverflow, some of these older posts may reflect out-of-date pyparsing
and Python features.
- [submit an issue](https://github.com/pyparsing/pyparsing/issues) - If you have a problem with pyparsing that looks
like an actual bug, or have an idea for a feature to add to pyparsing please submit an issue on GitHub. Some
pyparsing behavior may be counter-intuitive, so try to review some of the other resources first, or some of the
other open and closed issues. Or post your question on SO or reddit. But don't wait until you are desperate and
frustrated - just ask! :)
## Submitting examples
If you have an example you wish to submit, please follow these guidelines.
- **License - Submitted example code must be available for distribution with the rest of pyparsing under the MIT
open source license.**
- Please follow PEP8 name and coding guidelines, and use the black formatter
to auto-format code.
- Examples should import pyparsing and the common namespace classes as:
import pyparsing as pp
# if necessary
ppc = pp.pyparsing_common
ppu = pp.pyparsing_unicode
- Submitted examples *must* be Python 3.6.8 or later compatible. (It is acceptable if examples use Python
features added after 3.6)
- Where possible use operators to create composite parse expressions:
expr = expr_a + expr_b | expr_c
instead of:
expr = pp.MatchFirst([pp.And([expr_a, expr_b]), expr_c])
Exception: if using a generator to create an expression:
import keyword
python_keywords = keyword.kwlist
any_keyword = pp.MatchFirst(pp.Keyword(kw)
for kw in python_keywords))
- Learn [Common Pitfalls When Writing Parsers](https://github.com/pyparsing/pyparsing/wiki/Common-Pitfalls-When-Writing-Parsers) and
how to avoid them when developing new examples.
- See additional notes under [Some Coding Points](#some-coding-points).
## Submitting changes
If you are considering proposing updates to pyparsing, please bear in mind the following guidelines.
Please review [_The Zen of Pyparsing_ and _The Zen of Pyparsing
Development_](https://github.com/pyparsing/pyparsing/wiki/Zen)
article on the pyparsing wiki, to get a general feel for the historical and future approaches to pyparsing's
design, and intended developer experience as an embedded DSL.
If you are using new Python features or changing usage of the Python stdlib, please check that they work as
intended on prior versions of Python (currently back to Python 3.6.8).
## Some design points
- Minimize additions to the module namespace. Over time, pyparsing's namespace has acquired a *lot* of names.
New features have been encapsulated into namespace classes to try to hold back the name flooding when importing
pyparsing.
- New operator overloads for ParserElement will need to show broad applicability, and should be related to
parser construction.
- Performance tuning should focus on parse time performance. Optimizing parser definition performance is secondary.
- New external dependencies will require substantial justification, and if included, will need to be guarded for
`ImportError`s raised if the external module is not installed.
## Some coding points
These coding styles are encouraged whether submitting code for core pyparsing or for submitting an example.
- PEP8 - pyparsing has historically been very non-compliant with many PEP8 guidelines, especially those regarding
name casing. I had just finished several years of Java and Smalltalk development, and camel case seemed to be the
future trend in coding styles. As of version 3.0.0, pyparsing is moving over to PEP8 naming, while maintaining
compatibility with existing parser code by defining synonyms using the legacy names. These names will be
retained until a future release (probably 4.0), to provide a migration path for current pyparsing-dependent
applications - DO NOT MODIFY OR REMOVE THESE NAMES.
See more information at the [PEP8 wiki page](https://github.com/pyparsing/pyparsing/wiki/PEP-8-planning).
- No backslashes for line continuations.
Continuation lines for expressions in ()'s should start with the continuing operator:
really_long_line = (something
+ some_other_long_thing
+ even_another_long_thing)
- Maximum line length is 120 characters. (Black will override this.)
- Changes to core pyparsing must be compatible back to Py3.6 without conditionalizing. Later Py3 features may be
used in examples by way of illustration.
- str.format() statements should use named format arguments (unless this proves to be a slowdown at parse time).
- List, tuple, and dict literals should include a trailing comma after the last element, which reduces changeset
clutter when another element gets added to the end.
- New features should be accompanied by updates to unitTests.py and a bullet in the CHANGES file.
- Do not modify pyparsing_archive.py. This file is kept as a reference artifact from when pyparsing was distributed
as a single source file.
|