summaryrefslogtreecommitdiff
path: root/TODO.html
blob: a06bcbbd5053b7c723a903475f211b6ff21a4f64 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
<?xml version="1.0" encoding="iso-8859-1"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
  <title>Raptor RDF Parser Toolkit - To Do List</title>
</head>
<body>

<h1 style="text-align:center">Raptor RDF Parser Toolkit - To Do List</h1>

<h2>Bugs and Features</h2>

<ul>
<li>Update the rdf/rdfs schema documents. Convert the turtle labels
from unicode character form into UTF-8 (DATA, DOCS)</li>
<li>LSID URN support (FEATURE)</li>
<li>Ensure there is support to allow
<a href="http://librdf.org/rasqal/">Rasqal</a>
to register a parser outside raptor that provides RDF query results as
triples.  Alternatively, provide a skeleton wrapper parser that
allows the same by delegating all triple generation to callbacks. (FEATURE)</li>
<li>Add a guessing parser that sends an Accept: HTTP header for all
supported mime types and uses the returned headers to select a
parser. (FEATURE)</li>
<li>Describe in libraptor.3 relevant raptor.h structures -
describe important ones such as raptor_statement, raptor_locator and
raptor_feature as well as reference internal ones such as raptor_identifier,
raptor_identifier_type, raptor_uri_source, raptor_ntriples_term_type,
raptor_genid_type and raptor_uri_handler for the raptor_uri class.
(DOCS)</li>
<li>RSS tag soup parser junks/overwrites element content if it is
delivered in chunks rather than as one big CDATA, such as when
libxml2 sees entities, it emits the content in bits (BUG)</li>
<li>Add a gzip2/bzip2 content reading interface, using libxml2 to do the hard work (FEATURE)</li>
<li>Turtle parser: the lexer <code>lval-&gt;string</code> values get overwritten on errors (BUG)</li>
<li>Record the xml parser used and make available from API (FEATURE)</li>
<li>Add a N3 parser (FEATURE)</li>
<li>Add an XSLT 'parser' based on <a href="http://xmlsoft.org/XSLT/">libxslt</a> (FEATURE)</li>
<li>Provide a perl interface (FEATURE)</li>
<li>Add namespace/prefix declaration user level callback (FEATURE)</li>
<li>Other <code>rdf:parseType</code> support (FEATURE)</li>
<li>Add a PNG parser, like
<a href="http://pmt.sourceforge.net/pngmeta/">pngmeta</a> and parse
the embedded RDF/XML into triples (FEATURE)</li>
</ul>

<h2>Done</h2>

<p>The most recent changes are at the top, with the first
release version where appropriate.</p>

<ul>
<li>Allow the supported parsers to be selected by configure (1.3.3)</li>
<li>Errors which happened when fetching WWW content were always
printed to stderr.  They are now passed to the main error routines
which allows applications to retrieve them. (1.3.2)</li>
<li>In lax mode, warns when unknown rdf:parseType values are seen to prevent
things like 'owl:collection' and 'collection' passing through (1.3.1)</li>
<li>Turtle parser: a bare ':' and qnames such as 'rdf:_1' now work (1.3.1)</li>
<li>Describe in libraptor.3 the use of UTF-8 for strings and URIs. (1.3.1)</li>
<li>Send an HTTP Accept: header with WWW requests corresponding to the
mime type of the parser selected, accepting all others at lower q (1.3.0)</li>
<li>Guess parser from a mime type, content fragment and/or
content name such as a filename or URI (1.3.0)</li>
<li>Turtle parser: use <code>raptor_generate_id</code> for blank node identifiers (1.3.0)</li>
<li>Added <code>--enable-xml-1-1-names</code> to enable XML 1.1 name checking instead of XML 1.0 (1.1.0)</li>
<li>Updated <a href="http://www.w3.org/TR/xml11/">XML 1.1</a> name checking for ranges in the XML/Namespaces in XML 1.1 proposed recommendations (1.1.0)</li>
<li>Added <code>--disable-nfc-check</code> to disable the NFC linking/checking with GNOME glib, even if that library is present (1.1.0)</li>
<li>Made the N-Triples parser use <code>raptor_generate_id</code> for blank nodes identifiers (1.1.0)</li>
<li>Updated the RDF/XML parser to handle libxml 2.6.0 SAX2 API which
changes the names of all of the SAX1 calls (1.1.0).</li>
<li>Added an <a href="http://www.ilrt.bristol.ac.uk/discovery/2003/11/ntriplesplus/">N-Triples Plus</a> parser (1.1.0)</li>
<li>Correct line counting for N-Triples with \r\n line (DOS) files and
when the line crosses a chunk. (1.1.0)</li>
<li>Handle WIN32 file URIs starting <code>file://c:</code>... (1.1.0)</li>
<li>Scanning (<tt>rapper --scan</tt>) for <tt>rdf:RDF</tt> in embedded XML does not work (0.9.13)</li>
<li>URI retrieval - make sure it chops off the fragment before
fetching. (0.9.12)</li>
<li>Make 'make check' not die if NFC tests fail, possible if
no GNOME glib2 is present. (0.9.12)</li>
<li>Unicode character normalization NFC checks not implemented (0.9.11)</li>
<li>XML (Exclusive) Canonicalization for XML Literals not implemented (0.9.11)</li>
<li>libxml2 currently does not do XML attribute normalization i.e.
removing whitespace around attribute content. Added a fix (0.9.11)</li>
<li>Added raptor_www_no_www_library_init_finish to allow once-only
www library startup/shutdown to be prevented.  Default to performed
so that most higher level apps do not need to know or care (0.9.10)</li>
<li>Docs updated for 0.9.7 to 0.9.10 API changes in the <a href="libraptor.html">libraptor.3</a> manual page</li>
<li>Escape XML attribute values in parseType literal content generation (0.9.10)</li>
<li>Passing NULL to raptor_start_parse base URI for rdfxml parser
crashed it.  Now the RDF/XML parser fails. (0.9.9)</li>
<li>rdf:parseType="Literal" content with &amp;, &lt;, &gt; and
unicode characters did not get escaped in the encoded string into
entities/character entities. (0.9.9)</li>
<li>XML Namespaces declared with a prefix and no namespace name (URI)
were accepted - this is illegal (0.9.8)</li>
<li>Empty files made rdf/xml parser crash with libxml2 and expat; now both
return failure since an empty doc is not allowed (0.9.8)</li>
<li><tt>rdf:bagID</tt> handling added (0.9.7)</li>
<li>Can now configure on system which has expat alone using
<tt>./configure --with-xml-parser=expat</tt> (0.9.7)</li>
<li>Fix compiling libxml 2.3.5 and nearby versions failing on FreeBSD (which is 20 months old) caused by a change to the xmlSAXHandler structure (0.9.7)</li>
<li><tt>file:</tt> URIs were not correctly handled (0.9.6)</li>
<li>Resilience to XML parser errors, RDF/XML grammar errors (0.9.6)</li>
<li>Manual pages (0.9.6)</li>
<li>Made CDATA section work with libxml (0.9.6)</li>
<li>daml:collection is generating some wrong triples (BUG) (0.9.6)</li>
<li>Compiling on OSX fails on most systems since it requires libtool 1.4.2
or CVS version which requires automake 2.50 and a newer version of
autoconf.  This needs significant changes to the autoconfigure
system.  See
<a href="http://fink.sourceforge.net/doc/porting/libtool.php">Fink porting libtool to OSX.</a>
The fix for now - <strong>for the packaged sources only, not CVS</strong>
- is to use a patched libtool 1.4.2 (from Debian) that can generate a libtool
that knows OSX. (0.9.6)
</li>
<li>URI resolving to a base URI now working (0.9.5)</li>
<li>Handle &lt;prop:Elt rdf:ID="id" rdf:resource="http://example.org/obj"/&gt; (RDF Core WG syntax change) (0.9.5)</li>
<li>Add xml:base support - RDF Core WG syntax change (0.9.4)</li>
<li>Perform xml:lang processing and pass to application (0.9.4)</li>
<li>parseType literal broken (0.9.4)</li>
<li>Tracking of user IDs/generated IDs available to user code (0.9.3)</li>
<li>daml:collection parseType support (0.9.3)</li>
<li>rdf:li used as a propertyElt does not work (0.9.3)</li>
<li>parseType literal support complete (0.9.3)</li>
<li>Fixed many crashes (0.9.3)</li>
</ul>


<h2>No need to do list</h2>

<p>After decisions from the
<a href="http://www.w3.org/2001/sw/RDFCore/">RDF Core WG</a>
as recorded in the
<a href="http://www.w3.org/2000/03/rdf-tracking/#attention-developers">attention developers</a>
area of the
<a href="http://www.w3.org/2000/03/rdf-tracking/#attention-developers">RDF Issue Tracking document</a>.</p>

<ul>
<li>aboutEach support - removed from syntax</li>
<li>aboutEachPrefix support - removed from syntax</li>
<li>Special container support - not special anymore, just typed nodes</li>
</ul>

<hr />

<p>Copyright 2001-2004 <a href="http://purl.org/net/dajobe/">Dave Beckett</a>, <a href="http://www.ilrt.bristol.ac.uk/">Institute for Learning and Research Technology</a>, <a href="http://www.bristol.ac.uk/">University of Bristol</a></p>

</body>
</html>